WTF is the Semantic WebJuan F. SequedaSemantic Web AustinOct 1, 2011
WTF is the Semantic Web?
Internet != Web
What is the Web?“… the Web, is a system of interlinked hypertext documents accessed via the Internet. With a web browser, one can view web pages that may contain text, images […] and navigate between them via hyperlinks”http://en.wikipedia.org/wiki/World_Wide_Web
Current Web = internet + links + docs
History of the WebCreated by Tim Berners-Lee at CERN in 1989Mosaic browser in 1993W3C created in 1994Exponential growth mid 90sAmazon, Ebay – 1995Search engines – Google 1998Dot-com boom 1997 – 2001Web 2.0 – blogs, Facebook, Twitter, etc
What is the problem?
WHAT’S THE WEATHER IN AUSTIN TODAY?http://www.flickr.com/photos/jamieca/31631256/
What is the problem?The web is full of documentsWe aren’t always interested in documentsWe are interested in THINGSThese THINGS might be in documentsWe can read a HTML document rendered in a browser and find what we are searching forThis is hard for computers. Computers have to guess (even though they are pretty good at it)
The Web is a Data ShredderStructured DataUnstructured DataThanks Martin Hepp
What would we like?Make it easy for computers/software to find THINGSDo you SEARCH or do you FIND?
Search forFootball Players who went to the University of Texas at Austin, played for the Dallas Cowboys as Cornerback
Why can’t we just FIND it…
Guess how I FOUND out?
On a Semantic WebBesides publishing documents on the webwhich computers can’t understand easilyLet’s publish on the web something that computers can understandDATA
The Semantic Web is aweb of dataThe current web is aweb of documents
But wait… doesn’t the web already have data?
Current Data on the WebRelational DatabasesAPIsXMLCSVXLS…Can’t computers and applications already consume that data on the web?
Yes! But it is all in different formats and data models!
This makes it hard to integrate data
The data in different data sources aren’t linked
For example, how do I know that the Juan Sequeda in Facebook is the same as Juan Sequeda in Twitter
Or if I create a mashup from different services, I have to learn different APIs and I get different formats of data back
Data is Siloed
Wouldn’t it be great if we had a standard way of publishing data on the Web?
We have a standardized way of publishing documents on the web, right?HTML
Then why can’t we have a standard way of publishing data on the Web?
Good question! And the answer is YES. There is!RDF
Resource Description Framework (RDF)Data Model = a way to model datai.e. Relational databases use relational data modelRDF is a graph data model
Key Value vs GraphKey ValuesfirstName JuanlastName SequedalivesIn Austinknows StephaneCorlosquetBut what are these key/values describing?ME!
RDF is a GraphLet’s group the Key/Values together<JuanSequeda> <firstName> “Juan”<JuanSequeda> <lastName> “Sequeda”<JuanSequeda> <livesIn> “Austin”<JuanSequeda> <knows> <StephaneCorlosquet>..<StephaneCorlosquet> <firstName> “Stephane”<StephaneCorlosquet> <lastName> “Corlosquet”<StephaneCorlosquet> <livesIn> “Boston”
RDF is a GraphLet’s group the Key/Values together<JuanSequeda> <firstName> “Juan”<JuanSequeda> <lastName> “Sequeda”<JuanSequeda> <livesIn> “Austin”<JuanSequeda> <knows> <StephaneCorlosquet>..<StephaneCorlosquet> <firstName> “Stephane”<StephaneCorlosquet> <lastName> “Corlosquet”<StephaneCorlosquet> <livesIn> “Boston”Identifier for the “group”Key/Value
RDF can be serialized in different waysRDF/XMLRDFa (RDF in HTML)N3TurtleJSON
RDFa
RDF/XML
RDF/N-triples
RDF/Turtle
So does that mean that I have to publish my data in RDF now?
You don’t have to… but we would like you to Schema.orgRich Snippets…
An example
Document on the Web
Databases back up documentsTHINGS have PROPERTIES:A Book as a Title, an author, …This is a THING:A book title “Programming the Semantic Web” by Toby Segaran, …
Lets represent the data in RDFProgramming the Semantic WebtitleauthorbookToby Segaranisbn978-0-596-15381-6publishernamePublisherO’Reilly
Remember that we are on the webEverything on the web is identified by a URI
And now let’s link the data to other dataProgramming the Semantic Webtitleauthorhttp://…/isbn978Toby Segaranisbn978-0-596-15381-6publishernamehttp://…/publisher1O’Reilly
And now consider the data from Revyu.comhasReviewhttp://…/review1http://…/isbn978descriptionreviewerAwesome Bookhttp://…/reviewernameJuan Sequeda
Let’s start to link datahasReviewhttp://…/review1http://…/isbn978Programming the Semantic WebtitledescriptionsameAshasReviewerAwesome Bookauthorhttp://…/isbn978Toby Segaranhttp://…/reviewernameisbn978-0-596-15381-6Juan Sequedapublishernamehttp://…/publisher1O’Reilly
Juan Sequeda publishes data toohttp://juansequeda.com/idhttp://dbpedia.org/AustinlivesInnameJuan Sequeda
Let’s link more datahasReviewhttp://…/review1http://…/isbn978descriptionhasReviewerAwesome Bookhttp://…/reviewernameJuan SequedasameAshttp://juansequeda.com/idhttp://dbpedia.org/AustinlivesInnameJuan Sequeda
And morehasReviewhttp://…/review1http://…/isbn978Programming the Semantic WebtitledescriptionsameAshasReviewerAwesome Bookauthorhttp://…/isbn978Toby Segaranhttp://…/reviewernameisbn978-0-596-15381-6Juan SequedapublishersameAshttp://…/publisher1nameO’Reillyhttp://juansequeda.com/idhttp://dbpedia.org/AustinlivesInnameJuan Sequeda
Data on the Web that is in RDF and is linked to other RDF data isLINKED DATA
Linked Data PrinciplesUse URIs as names for thingsUse HTTP URIs so that people can look up (dereference) those names.When someone looks up a URI, provide useful information.Include links to other URIs so that they can discover more things.
Linked Data makes the web appear as ONEGIANTHUGEGLOBALDATABASE!
I can query a database with SQL. Is there a way to query Linked Data with a query language?
Yes! There is actually a standardize language for thatSPARQL
FIND all the reviews on the book “Programming the Semantic Web” by people who live in Austin
SELECT ?review ?commentWHERE {  isbn:978 ex:hasReview ?review .  ?review ex:description ?comment .  ?review ex:hasReviewer ?person .  ?person ex:livesdbpedia:Austin .} SPARQL
SELECT ?review ?commentWHERE {isbn:978 ex:hasReview ?review .?review ex:description ?comment .?review ex:hasReviewer ?person .?person ex:livesdbpedia:Austin .}hasReviewhttp://…/review1http://…/isbn978Programming the Semantic WebtitledescriptionsameAshasReviewerAwesome Bookauthorhttp://…/isbn978Toby Segaranhttp://…/reviewernameisbn978-0-596-15381-6Juan SequedapublishersameAsnamehttp://…/publisher1O’Reillyhttp://juansequeda.comhttp://dbpedia.org/AustinlivesInnameJuan Sequeda
This looks cool, but let’s be realistic. What is the incentive to publish Linked Data?
What was your incentive to publish an HTML page in 1990?
1) Share data in documents2) Because you neighbor was doing it… later on …3) Marketing, Advertising, …, SEO
So why should we publish Linked Data in 2011?
1) Share data as data2) Because you neighbor is doing it…3) Marketing, Advertising, SEO ++
Linked Data PublishersUK GovernmentUS GovernmentBBCOpen Calais – Thomson ReutersFreebase/GoogleNY TimesBest BuySearsKmartOverstock.comCNETDbpediaO’Reilly Media…
May 2007
Oct 2007
Nov 2007
Feb 2008
Mar 2008
Sept 2008
Mar 2009 (1)
Mar 2009 (2)
July 2009
September 2010
September 2011Linking Open Data cloud diagram, by Richard Cyganiak and AnjaJentzsch.http://lod-cloud.net/
YOU GET THE PICTUREITS BIG and getting BIGGER andBIGGER
QUESTIONS?

WTF is the Semantic Web

  • 1.
    WTF is theSemantic WebJuan F. SequedaSemantic Web AustinOct 1, 2011
  • 2.
    WTF is theSemantic Web?
  • 3.
  • 5.
    What is theWeb?“… the Web, is a system of interlinked hypertext documents accessed via the Internet. With a web browser, one can view web pages that may contain text, images […] and navigate between them via hyperlinks”http://en.wikipedia.org/wiki/World_Wide_Web
  • 6.
    Current Web =internet + links + docs
  • 7.
    History of theWebCreated by Tim Berners-Lee at CERN in 1989Mosaic browser in 1993W3C created in 1994Exponential growth mid 90sAmazon, Ebay – 1995Search engines – Google 1998Dot-com boom 1997 – 2001Web 2.0 – blogs, Facebook, Twitter, etc
  • 8.
    What is theproblem?
  • 9.
    WHAT’S THE WEATHERIN AUSTIN TODAY?http://www.flickr.com/photos/jamieca/31631256/
  • 13.
    What is theproblem?The web is full of documentsWe aren’t always interested in documentsWe are interested in THINGSThese THINGS might be in documentsWe can read a HTML document rendered in a browser and find what we are searching forThis is hard for computers. Computers have to guess (even though they are pretty good at it)
  • 14.
    The Web isa Data ShredderStructured DataUnstructured DataThanks Martin Hepp
  • 15.
    What would welike?Make it easy for computers/software to find THINGSDo you SEARCH or do you FIND?
  • 16.
    Search forFootball Playerswho went to the University of Texas at Austin, played for the Dallas Cowboys as Cornerback
  • 20.
    Why can’t wejust FIND it…
  • 23.
    Guess how IFOUND out?
  • 24.
    On a SemanticWebBesides publishing documents on the webwhich computers can’t understand easilyLet’s publish on the web something that computers can understandDATA
  • 25.
    The Semantic Webis aweb of dataThe current web is aweb of documents
  • 26.
    But wait… doesn’tthe web already have data?
  • 27.
    Current Data onthe WebRelational DatabasesAPIsXMLCSVXLS…Can’t computers and applications already consume that data on the web?
  • 28.
    Yes! But itis all in different formats and data models!
  • 29.
    This makes ithard to integrate data
  • 30.
    The data indifferent data sources aren’t linked
  • 31.
    For example, howdo I know that the Juan Sequeda in Facebook is the same as Juan Sequeda in Twitter
  • 32.
    Or if Icreate a mashup from different services, I have to learn different APIs and I get different formats of data back
  • 33.
  • 34.
    Wouldn’t it begreat if we had a standard way of publishing data on the Web?
  • 35.
    We have astandardized way of publishing documents on the web, right?HTML
  • 36.
    Then why can’twe have a standard way of publishing data on the Web?
  • 37.
    Good question! Andthe answer is YES. There is!RDF
  • 38.
    Resource Description Framework(RDF)Data Model = a way to model datai.e. Relational databases use relational data modelRDF is a graph data model
  • 39.
    Key Value vsGraphKey ValuesfirstName JuanlastName SequedalivesIn Austinknows StephaneCorlosquetBut what are these key/values describing?ME!
  • 40.
    RDF is aGraphLet’s group the Key/Values together<JuanSequeda> <firstName> “Juan”<JuanSequeda> <lastName> “Sequeda”<JuanSequeda> <livesIn> “Austin”<JuanSequeda> <knows> <StephaneCorlosquet>..<StephaneCorlosquet> <firstName> “Stephane”<StephaneCorlosquet> <lastName> “Corlosquet”<StephaneCorlosquet> <livesIn> “Boston”
  • 41.
    RDF is aGraphLet’s group the Key/Values together<JuanSequeda> <firstName> “Juan”<JuanSequeda> <lastName> “Sequeda”<JuanSequeda> <livesIn> “Austin”<JuanSequeda> <knows> <StephaneCorlosquet>..<StephaneCorlosquet> <firstName> “Stephane”<StephaneCorlosquet> <lastName> “Corlosquet”<StephaneCorlosquet> <livesIn> “Boston”Identifier for the “group”Key/Value
  • 42.
    RDF can beserialized in different waysRDF/XMLRDFa (RDF in HTML)N3TurtleJSON
  • 44.
  • 45.
  • 46.
  • 47.
  • 48.
    So does thatmean that I have to publish my data in RDF now?
  • 49.
    You don’t haveto… but we would like you to Schema.orgRich Snippets…
  • 50.
  • 51.
  • 52.
    Databases back updocumentsTHINGS have PROPERTIES:A Book as a Title, an author, …This is a THING:A book title “Programming the Semantic Web” by Toby Segaran, …
  • 53.
    Lets represent thedata in RDFProgramming the Semantic WebtitleauthorbookToby Segaranisbn978-0-596-15381-6publishernamePublisherO’Reilly
  • 54.
    Remember that weare on the webEverything on the web is identified by a URI
  • 55.
    And now let’slink the data to other dataProgramming the Semantic Webtitleauthorhttp://…/isbn978Toby Segaranisbn978-0-596-15381-6publishernamehttp://…/publisher1O’Reilly
  • 56.
    And now considerthe data from Revyu.comhasReviewhttp://…/review1http://…/isbn978descriptionreviewerAwesome Bookhttp://…/reviewernameJuan Sequeda
  • 57.
    Let’s start tolink datahasReviewhttp://…/review1http://…/isbn978Programming the Semantic WebtitledescriptionsameAshasReviewerAwesome Bookauthorhttp://…/isbn978Toby Segaranhttp://…/reviewernameisbn978-0-596-15381-6Juan Sequedapublishernamehttp://…/publisher1O’Reilly
  • 58.
    Juan Sequeda publishesdata toohttp://juansequeda.com/idhttp://dbpedia.org/AustinlivesInnameJuan Sequeda
  • 59.
    Let’s link moredatahasReviewhttp://…/review1http://…/isbn978descriptionhasReviewerAwesome Bookhttp://…/reviewernameJuan SequedasameAshttp://juansequeda.com/idhttp://dbpedia.org/AustinlivesInnameJuan Sequeda
  • 60.
    And morehasReviewhttp://…/review1http://…/isbn978Programming theSemantic WebtitledescriptionsameAshasReviewerAwesome Bookauthorhttp://…/isbn978Toby Segaranhttp://…/reviewernameisbn978-0-596-15381-6Juan SequedapublishersameAshttp://…/publisher1nameO’Reillyhttp://juansequeda.com/idhttp://dbpedia.org/AustinlivesInnameJuan Sequeda
  • 61.
    Data on theWeb that is in RDF and is linked to other RDF data isLINKED DATA
  • 62.
    Linked Data PrinciplesUseURIs as names for thingsUse HTTP URIs so that people can look up (dereference) those names.When someone looks up a URI, provide useful information.Include links to other URIs so that they can discover more things.
  • 63.
    Linked Data makesthe web appear as ONEGIANTHUGEGLOBALDATABASE!
  • 64.
    I can querya database with SQL. Is there a way to query Linked Data with a query language?
  • 65.
    Yes! There isactually a standardize language for thatSPARQL
  • 66.
    FIND all thereviews on the book “Programming the Semantic Web” by people who live in Austin
  • 67.
    SELECT ?review ?commentWHERE{ isbn:978 ex:hasReview ?review . ?review ex:description ?comment . ?review ex:hasReviewer ?person . ?person ex:livesdbpedia:Austin .} SPARQL
  • 68.
    SELECT ?review ?commentWHERE{isbn:978 ex:hasReview ?review .?review ex:description ?comment .?review ex:hasReviewer ?person .?person ex:livesdbpedia:Austin .}hasReviewhttp://…/review1http://…/isbn978Programming the Semantic WebtitledescriptionsameAshasReviewerAwesome Bookauthorhttp://…/isbn978Toby Segaranhttp://…/reviewernameisbn978-0-596-15381-6Juan SequedapublishersameAsnamehttp://…/publisher1O’Reillyhttp://juansequeda.comhttp://dbpedia.org/AustinlivesInnameJuan Sequeda
  • 69.
    This looks cool,but let’s be realistic. What is the incentive to publish Linked Data?
  • 70.
    What was yourincentive to publish an HTML page in 1990?
  • 71.
    1) Share datain documents2) Because you neighbor was doing it… later on …3) Marketing, Advertising, …, SEO
  • 72.
    So why shouldwe publish Linked Data in 2011?
  • 73.
    1) Share dataas data2) Because you neighbor is doing it…3) Marketing, Advertising, SEO ++
  • 74.
    Linked Data PublishersUKGovernmentUS GovernmentBBCOpen Calais – Thomson ReutersFreebase/GoogleNY TimesBest BuySearsKmartOverstock.comCNETDbpediaO’Reilly Media…
  • 75.
  • 76.
  • 77.
  • 78.
  • 79.
  • 80.
  • 81.
  • 82.
  • 83.
  • 84.
  • 85.
    September 2011Linking OpenData cloud diagram, by Richard Cyganiak and AnjaJentzsch.http://lod-cloud.net/
  • 86.
    YOU GET THEPICTUREITS BIG and getting BIGGER andBIGGER
  • 87.