Querying the Web of DataKennissystemen, December 2010Rinke Hoekstra
OverviewLinked (Open) DataThe Web of DataScalability issuesTechnologyRDF SyntaxesRDF Storage and QueryingKennissystemen 2010
The Semantic Web IdeologyIdentity is everythingPartial solutions are great too!Layer cakeOWLKennissystemen 2010
The Web of Data… does it exist?Kennissystemen 2010
Linked DataKennissystemen 2010
Semantic WebIntially
`Metadata’ for web pages
Since ~2006
`Web of Data’
Semantic web as data source in its own right
Linked Data
A ‘Databaseesque’ Web
RDF Triple stores
Query languagesKennissystemen 2010
Storage (on the web)As documents.rdf, .n3, .turtle, .htmlRDF triple storesSesame, Joseki, 4Store, AllegroGraph, OpenLink Virtuoso, SDB/TDB, Open Calais, SWI Prolog Reasoners ‘on top’, or via DIGPellet, OWLIM, etc.SPARQL EndpointsResults as JSON, XML, CSV etc.Kennissystemen 2010
Data and the WebNeed to add this ‘meta’ to my ‘data’‘Linking’ data across sitesWeb of Documents and the Web of DataOld fashioned HTML:<link rel='meta' type='application/rdf+xml' href='http://www.leibnizcenter.org/~hoekstra/foaf.rdf' title='FOAF'> URL-basedHTTP 303 `see other’http://www.w3.org/TR/swbp-vocab-pub/RDFaKennissystemen 2010
BBC MusicKennissystemen 2010
Kennissystemen 2010
Kennissystemen 2010
Kennissystemen 2010
Integration: 303 See OtherKennissystemen 2010
Integration: Inline RDFaAttributes on XHTML elementshttp://www.w3.org/TR/xhtml-rdfa-primerKennissystemen 2010
Integration: RDFa ExampleIn XHTML:<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE html PUBLIC "-//W3C//DTD XHTML+RDFa 1.0//EN"          "http://www.w3.org/MarkUp/DTD/xhtml-rdfa-1.dtd"><html xmlns:cal="http://www.w3.org/2002/12/cal/ical#">    <head><title>Jo's Friends and Family Blog</title></head>    <body>    <pinstanceof="cal:Vevent">        I'm holding        <span property="cal:summary">one last summer Barbecue</span>,        on        <span property="cal:dtstart" content="20070916T1600-0500">            September 16th at 4pm.        </span>    </p>    </body></html>In RDF:_:blanknode0rdf:typecal:Vevent; cal:summary ”one last summer Barbecue";cal:dtstart "20070916T1600-0500" .Kennissystemen 2010
Legal InformationRetrievalforLaymenKennissystemen 2010
VoorbeeldKennissystemen 2010
So, where’s that data?I repeat: does it really exist?Kennissystemen 2010
Linked Open DataKennissystemen 2010
November 2009: 13.1 Billion triples, 142 Million links Kennissystemen 2010
September 2010: 25 Miljard triples, 395 Miljoenlinks Kennissystemen 2010
ScalabilityHow to deal with massive amounts of data?Consequences for reasoningBillion Triple Challenge(864.8 Million Triples)Consequences for queryingTable lookups, joins etc.… and what about …Dealing with change, provenance, trust?Kennissystemen 2010
A rough idea…I can crash a DL reasoner using an ontology of ~15 classes and 5 individuals (honestly)What if my ontology contains thousands of classes and billions of individuals?Kennissystemen 2010
ReasoningReasoning with inconsistent knowledgeincomplete knowledgeComplete vs. incomplete reasoningKennissystemen 2010
ReasoningWhen?
Realtime vs. in advance
Lightweight reasoning (RDFS, OWL 2 RL)
Implementable using forward chaining rules
Still problems with scalability
Distributed reasoning (DAS-3)
MaRVIN
‘SpeedDate’ distrubution of triples across nodes
MapReduce
Full closure of BTC in 57 minutes
Output: 30B triples
And what to do with the results?Kennissystemen 2010
2 Degrees from Kevin BaconPREFIX p: http://dbpedia.org/property/SELECT ?film1 ?actor1 ?film2 ?actor2WHERE {  	?film1 p:starring <http://dbpedia.org/resource/Kevin_Bacon> .  	?film1 p:starring ?actor1 .  	?film2 p:starring ?actor1 .  	?film2 p:starring ?actor2 .}DBPedia: 150M triplesKennissystemen 2010
Another rough idea…1 Billion triples in MySQL
Load time
… a couple of hours
Simple table lookup (one-variable query)
… about 5 minutes

More Related Content

PDF
Getty Vocabulary Program LOD: Ontologies and Semantic Representation
PDF
Getty Vocabulary Program LOD: Ontologies and Semantic Representation
PDF
Debunking some “RDF vs. Property Graph” Alternative Facts
PPTX
4 sw architectures and sparql
PPTX
Semantic web meetup – sparql tutorial
PDF
Two graph data models : RDF and Property Graphs
PPTX
A Little SPARQL in your Analytics
PDF
ParlBench: a SPARQL-benchmark for electronic publishing applications.
Getty Vocabulary Program LOD: Ontologies and Semantic Representation
Getty Vocabulary Program LOD: Ontologies and Semantic Representation
Debunking some “RDF vs. Property Graph” Alternative Facts
4 sw architectures and sparql
Semantic web meetup – sparql tutorial
Two graph data models : RDF and Property Graphs
A Little SPARQL in your Analytics
ParlBench: a SPARQL-benchmark for electronic publishing applications.

What's hot (20)

PDF
RDF Tutorial - SPARQL 20091031
PDF
Jesús Barrasa
PDF
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
PDF
Property graph vs. RDF Triplestore comparison in 2020
PDF
LDQL: A Query Language for the Web of Linked Data
PDF
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
PPTX
Triple Stores
PDF
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
PPT
Ks2008 Semanticweb In Action
PDF
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
ODP
Graph Data -- RDF and Property Graphs
PPTX
Semantic Variation Graphs the case for RDF & SPARQL
PDF
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
PPTX
Democratizing Big Semantic Data management
PPT
Ist16-04 An introduction to RDF
PPTX
RDF Data Model
PDF
#sod14 - ok, è un endpoint SPARQL non facciamoci prendere dal panico
PPTX
Challenges and applications of RDF shapes
PPT
Aidan's PhD Viva
PDF
2011 4IZ440 Semantic Web – RDF, SPARQL, and software APIs
RDF Tutorial - SPARQL 20091031
Jesús Barrasa
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Property graph vs. RDF Triplestore comparison in 2020
LDQL: A Query Language for the Web of Linked Data
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Triple Stores
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Ks2008 Semanticweb In Action
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Graph Data -- RDF and Property Graphs
Semantic Variation Graphs the case for RDF & SPARQL
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
Democratizing Big Semantic Data management
Ist16-04 An introduction to RDF
RDF Data Model
#sod14 - ok, è un endpoint SPARQL non facciamoci prendere dal panico
Challenges and applications of RDF shapes
Aidan's PhD Viva
2011 4IZ440 Semantic Web – RDF, SPARQL, and software APIs
Ad

Viewers also liked (6)

ODP
Summary of SIGIR 2011 Papers
PDF
Enterprise Use of Twitter by Doug Ross
KEY
Concept- en Definitie Extractie
ODP
SIGIR 2011
PDF
Experience aware Item Recommendation in Evolving Review Communities
PDF
QBer - Connect your data to the cloud
Summary of SIGIR 2011 Papers
Enterprise Use of Twitter by Doug Ross
Concept- en Definitie Extractie
SIGIR 2011
Experience aware Item Recommendation in Evolving Review Communities
QBer - Connect your data to the cloud
Ad

Similar to Querying the Web of Data (20)

ODP
State of the Semantic Web
PPTX
20100614 ISWSA Keynote
PPTX
CSHALS 2010 W3C Semanic Web Tutorial
PPT
(Re-) Discovering Lost Web Pages
PPTX
Ks2009 Semanticweb In Action
PDF
Visualize open data with Plone - eea.daviz PLOG 2013
PPT
Information Extraction and Linked Data Cloud
PPT
Semantic Web
PPT
SemanticWeb Nuts 'n Bolts
PDF
Spark Community Update - Spark Summit San Francisco 2015
PPTX
Linked Data and Locah, UKSG2011
PPT
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
PPT
Linked data and voyager
PPT
Linked Data Tutorial
PPT
Lee Iverson - How does the web connect content?
PDF
Sustainable queryable access to Linked Data
PPT
Getting Started With The Talis Platform
PDF
The Lonesome LOD Cloud
PDF
Explicit Semantics in Graph DBs Driving Digital Transformation With Neo4j
PDF
Moving Library Metadata Toward Linked Data: Opportunities Provided by the eX...
State of the Semantic Web
20100614 ISWSA Keynote
CSHALS 2010 W3C Semanic Web Tutorial
(Re-) Discovering Lost Web Pages
Ks2009 Semanticweb In Action
Visualize open data with Plone - eea.daviz PLOG 2013
Information Extraction and Linked Data Cloud
Semantic Web
SemanticWeb Nuts 'n Bolts
Spark Community Update - Spark Summit San Francisco 2015
Linked Data and Locah, UKSG2011
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
Linked data and voyager
Linked Data Tutorial
Lee Iverson - How does the web connect content?
Sustainable queryable access to Linked Data
Getting Started With The Talis Platform
The Lonesome LOD Cloud
Explicit Semantics in Graph DBs Driving Digital Transformation With Neo4j
Moving Library Metadata Toward Linked Data: Opportunities Provided by the eX...

More from Rinke Hoekstra (20)

PDF
Knowledge Representation on the Web
PDF
Managing Metadata for Science and Technology Studies: the RISIS case
PDF
An Ecosystem for Linked Humanities Data
PDF
Jurix 2014 welcome presentation
PDF
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
PDF
Prov-O-Viz: Interactive Provenance Visualization
PDF
Linkitup: Link Discovery for Research Data
PDF
A Network Analysis of Dutch Regulations - Using the Metalex Document Server
PDF
Linked (Open) Data - But what does it buy me?
PDF
Linked Science - Building a Web of Research Data
PDF
COMMIT/VIVO
PDF
Semantic Representations for Research
PDF
A Slightly Different Web of Data
PDF
The Knowledge Reengineering Bottleneck
PDF
Linked Census Data
KEY
SIKS 2011 Semantic Web Languages
PDF
The MetaLex Document Server - Legal Documents as Versioned Linked Data
PPTX
History of Knowledge Representation (SIKS Course 2010)
PPTX
Making Sense of Design Patterns
PPTX
Publicatie van Linked Open Overheids Data
Knowledge Representation on the Web
Managing Metadata for Science and Technology Studies: the RISIS case
An Ecosystem for Linked Humanities Data
Jurix 2014 welcome presentation
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
Prov-O-Viz: Interactive Provenance Visualization
Linkitup: Link Discovery for Research Data
A Network Analysis of Dutch Regulations - Using the Metalex Document Server
Linked (Open) Data - But what does it buy me?
Linked Science - Building a Web of Research Data
COMMIT/VIVO
Semantic Representations for Research
A Slightly Different Web of Data
The Knowledge Reengineering Bottleneck
Linked Census Data
SIKS 2011 Semantic Web Languages
The MetaLex Document Server - Legal Documents as Versioned Linked Data
History of Knowledge Representation (SIKS Course 2010)
Making Sense of Design Patterns
Publicatie van Linked Open Overheids Data

Recently uploaded (20)

PDF
A comparative study of natural language inference in Swahili using monolingua...
PDF
A novel scalable deep ensemble learning framework for big data classification...
PDF
DP Operators-handbook-extract for the Mautical Institute
PDF
WOOl fibre morphology and structure.pdf for textiles
PDF
Unlock new opportunities with location data.pdf
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PPTX
observCloud-Native Containerability and monitoring.pptx
PPTX
Tartificialntelligence_presentation.pptx
PDF
Hindi spoken digit analysis for native and non-native speakers
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PDF
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
PPT
What is a Computer? Input Devices /output devices
PDF
Getting started with AI Agents and Multi-Agent Systems
PDF
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
DOCX
search engine optimization ppt fir known well about this
PPTX
Benefits of Physical activity for teenagers.pptx
PPTX
Chapter 5: Probability Theory and Statistics
PDF
sustainability-14-14877-v2.pddhzftheheeeee
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
Developing a website for English-speaking practice to English as a foreign la...
A comparative study of natural language inference in Swahili using monolingua...
A novel scalable deep ensemble learning framework for big data classification...
DP Operators-handbook-extract for the Mautical Institute
WOOl fibre morphology and structure.pdf for textiles
Unlock new opportunities with location data.pdf
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
observCloud-Native Containerability and monitoring.pptx
Tartificialntelligence_presentation.pptx
Hindi spoken digit analysis for native and non-native speakers
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
What is a Computer? Input Devices /output devices
Getting started with AI Agents and Multi-Agent Systems
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
search engine optimization ppt fir known well about this
Benefits of Physical activity for teenagers.pptx
Chapter 5: Probability Theory and Statistics
sustainability-14-14877-v2.pddhzftheheeeee
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
Developing a website for English-speaking practice to English as a foreign la...

Querying the Web of Data