SlideShare a Scribd company logo
Linked Data Query Processing
Tutorial at the 22nd International World Wide Web Conference (WWW 2013)
May 14, 2013
http://db.uwaterloo.ca/LDQTut2013/
Olaf Hartig
University of Waterloo
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 2
Tutorial Outline
(1) Introduction
(2) Theoretical Foundations
(3) Source Selection Strategies
(4) Execution Process
(5) Query Planning and Optimization
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 3
Linked Data Query Processing
Tutorial at the 22nd International World Wide Web Conference (WWW 2013)
May 14, 2013
http://db.uwaterloo.ca/LDQTut2013/
1. Introduction
Olaf Hartig
University of Waterloo
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 4
Outline
 The Linked Data Principles
 Paradigms for Querying Linked Data
 Characteristics of the “Database System”
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 5
The Traditional, Hypertext Web
MovieDB
Data exposed
to the Web
via HTML
CIA World
Factbook
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 6
Towards a Web of Linked Data
MovieDB
:
( Albania , unemployment rate , 13.2% )
:
Data model: RDF
( War Child , release date , 12 July 1999 )
( War Child , filming location , Albania )
( Michael Davie , directed , War Child )
:
CIA World
Factbook
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 7
Towards a Web of Linked Data
MovieDB
( http://...imdb.../WarChild , release date , 12 July 1999 )
( http://...imdb.../WarChild , filming location , http://cia.../Albania )
( http://...imdb.../MichaelDavie , directed , http://...imdb.../WarChild )
:
( http://cia.../Albania ,
unemployment rate , 13.2% )
:
Data model: RDF
Global identifier: URI
CIA World
Factbook
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 8
Towards a Web of Linked Data
MovieDB
( http://cia.../Albania ,
unemployment rate , 13.2% )
:
Data model: RDF
Global identifier: URI
Access mechanism: HTTP
( http://...imdb.../WarChild , release date , 12 July 1999 )
( http://...imdb.../WarChild , filming location , http://cia.../Albania )
( http://...imdb.../MichaelDavie , directed , http://...imdb.../WarChild )
:
CIA World
Factbook
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 9
Towards a Web of Linked Data
MovieDB
CIA World
Factbook
( http://...imdb.../WarChild , release date , 12 July 1999 )
( http://...imdb.../WarChild , filming location , http://cia.../Albania )
( http://...imdb.../MichaelDavie , directed , http://...imdb.../WarChild )
:
( http://cia.../Albania ,
unemployment rate , 13.2% )
:
Data model: RDF
Global identifier: URI
Access mechanism: HTTP
Connection: data links
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 10
Supplementary Access Methods
● RDF dump: the whole dataset provided as a big file
● SPARQL endpoint: Web service that allows for executing
SPARQL queries over the dataset
● Caveat: these access method cannot be assumed
to be available for all datasets
● Creating dumps is not feasible if data changes very frequently
● Dumps or endpoints may not be feasible if Linked Data
interface is simply a wrapper for some other data source
● Providing and maintaining a reliable SPARQL endpoint
is a significant additional effort
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 11
Outline
 The Linked Data Principles
 Paradigms for Querying Linked Data
 Characteristics of the “Database System”
√
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 12
Traditional Paradigm 1: Warehousing
● Copy data into a centralized repository
● Query this repository
+ Almost instant results
– Misses unknown or new sources
– Collection possibly out of date
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 13
?
??
?
Traditional Paradigm 2: Federation
● Distribute query execution over a
federation of SPARQL endpoints
+ Current data
– Misses sources without
SPARQL endpoint
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 14
Principle 1: Rely on the Linked Data principles only
Principle 2: On-line execution
Linked Data Query Processing
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 15
Principle 1: Rely on the Linked Data principles only
Principle 2: On-line execution
Consequence: Obtain data for executing a given query by
looking up URIs during the query execution process itself
Linked Data Query Processing
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 16
“Ingredients” for LD Query Execution
Query-local data
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 17
“Ingredients” for LD Query Execution
● Data retrieval approach
● Data source selection
● Data source ranking
(optional, for optimization)
Query-local data
GET http://.../movie2449
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 18
“Ingredients” for LD Query Execution
● Data retrieval approach
● Data source selection
● Data source ranking
(optional, for optimization)
Query-local data
http://mdb.../Paul http://geo.../Berlin
http://mdb.../Ric http://geo.../Rome
?loc?actor
GET http://.../movie2449
● Result construction approach
● i.e., query-local data processing
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 19
“Ingredients” for LD Query Execution
● Data retrieval approach
● Data source selection
● Data source ranking
(optional, for optimization)
Query-local data
http://mdb.../Paul http://geo.../Berlin
http://mdb.../Ric http://geo.../Rome
?loc?actor
GET http://.../movie2449
● Result construction approach
● i.e., query-local data processing
● Combining data retrieval
and result construction
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 20
+ Current data
+ May make use of any Linked Data available on the Web
– Least efficient due to data shipping
Use cases: live querying where freshness and discovery of
results is more important than an almost instant answer
Properties of LD Query Processing
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 21
Combination with other Paradigms
● Linked Data query processing with a query-local dataset
● Query-local dataset contains additional data [LT11]
● Query-local dataset for caching [Har11b, HH11]
● Linked Data query processing with a SPARQL endpoint
● SPARQL endpoint exposes a cache of Linked Data [UKH+12]
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 22
Our Topic Today …
… pure Linked Data query processing
Linked Data query: a query that ranges over
data made available using
the Linked Data principles
Web of Linked Data: network of data that evolves
by publishing data according
to the Linked Data principles
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 23
Outline
 The Linked Data Principles
 Paradigms for Querying Linked Data
 Characteristics of the “Database System”
√
√
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 24
An Analogy ...
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 25
Traditional, Central Database Systems
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 26
Distributed Database Systems
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 27
The Web of Linked Data
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 28
The Web of Linked Data
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 29
● Number of
potential data
sources infinite
The Web of Linked Data
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 30
● Number of
potential data
sources infinite
● No (a priori)
information
The Web of Linked Data
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 31
● Number of
potential data
sources infinite
● No (a priori)
information
The Web of Linked Data
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 32
● Number of
potential data
sources infinite
● No (a priori)
information
The Web of Linked Data
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 33
● Number of
potential data
sources infinite
● No (a priori)
information
The Web of Linked Data
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 34
● Number of
potential data
sources infinite
● No (a priori)
information
The Web of Linked Data
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 35
● Number of
potential data
sources infinite
● No (a priori)
information
● Number of
actual data
sources infinite
The Web of Linked Data
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 36
Issues due to the Openness
● Data quality issues
● Accuracy
● Freshness / timeliness
● Believability / trustworthiness
● Data source quality issues
● Availability
● Reliability
● Data integration issues
● Coreferences: Publishers may use different URIs
for denoting the same entity
● Schema heterogeneity: Publishers may use different
vocabularies for their data
For the purpose of discussing
execution of queries in this tutorial,
we largely ignore these issues.
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 37
Outline
 The Linked Data Principles
 Paradigms for Querying Linked Data
 Characteristics of the “Database System”
√
√
√
Next part: 2. Theoretical Foundations ...
WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 38
These slides have been created by
Olaf Hartig
for the
WWW 2013 tutorial on
Link Data Query Processing
Tutorial Website: http://db.uwaterloo.ca/LDQTut2013/
This work is licensed under a
Creative Commons Attribution-Share Alike 3.0 License
(http://creativecommons.org/licenses/by-sa/3.0/)

More Related Content

What's hot (20)

PPTX
“Open Data Web” – A Linked Open Data Repository Built with CKAN
Chengjen Lee
 
PPTX
The FLuID Meta Model: Incrementally Compute Schema-level Indices for the Web...
Till Blume
 
PDF
Interactive exploration of complex relational data sets in a web - SemWeb.Pro...
Logilab
 
PPTX
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
Ontotext
 
PPTX
Detection of Related Semantic Datasets Based on Frequent Subgraph Mining
Mikel Emaldi Manrique
 
PPTX
eNanoMapper database, search tools and templates
Nina Jeliazkova
 
PPTX
Querying Linked Data
EUCLID project
 
PPTX
Providing Linked Data
EUCLID project
 
PPTX
RDF-Gen: Generating RDF from streaming and archival data
Giorgos Santipantakis
 
PPT
euclid_linkedup WWW tutorial (Besnik Fetahu)
Besnik Fetahu
 
PPTX
Interaction with Linked Data
EUCLID project
 
PDF
Instance Matching
Robert Isele
 
PPTX
Building Linked Data Applications
EUCLID project
 
PDF
Adventures in Linked Data Land (presentation by Richard Light)
jottevanger
 
ODP
Graph databases
Karol Grzegorczyk
 
PPTX
Big Linked Data - Creating Training Curricula
EUCLID project
 
PPTX
2015 TaPP - Interoperability for Provenance-aware Databases using PROV and JSON
Boris Glavic
 
PPTX
Scaling up Linked Data
EUCLID project
 
PDF
Oshs_9_11_2015
Béatrice Bouchou
 
PDF
SF Python Meetup: TextRank in Python
Paco Nathan
 
“Open Data Web” – A Linked Open Data Repository Built with CKAN
Chengjen Lee
 
The FLuID Meta Model: Incrementally Compute Schema-level Indices for the Web...
Till Blume
 
Interactive exploration of complex relational data sets in a web - SemWeb.Pro...
Logilab
 
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
Ontotext
 
Detection of Related Semantic Datasets Based on Frequent Subgraph Mining
Mikel Emaldi Manrique
 
eNanoMapper database, search tools and templates
Nina Jeliazkova
 
Querying Linked Data
EUCLID project
 
Providing Linked Data
EUCLID project
 
RDF-Gen: Generating RDF from streaming and archival data
Giorgos Santipantakis
 
euclid_linkedup WWW tutorial (Besnik Fetahu)
Besnik Fetahu
 
Interaction with Linked Data
EUCLID project
 
Instance Matching
Robert Isele
 
Building Linked Data Applications
EUCLID project
 
Adventures in Linked Data Land (presentation by Richard Light)
jottevanger
 
Graph databases
Karol Grzegorczyk
 
Big Linked Data - Creating Training Curricula
EUCLID project
 
2015 TaPP - Interoperability for Provenance-aware Databases using PROV and JSON
Boris Glavic
 
Scaling up Linked Data
EUCLID project
 
Oshs_9_11_2015
Béatrice Bouchou
 
SF Python Meetup: TextRank in Python
Paco Nathan
 

Similar to Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.) (20)

PPTX
Linked Energy Data Generation
Filip Radulovic
 
PPTX
Linked Data for Information Extraction Challenge - Tasks and Results @ ISWC 2014
Robert Meusel
 
PDF
Web Data Extraction: A Crash Course
Giorgio Orsi
 
PDF
Wed roman tut_open_datapub
eswcsummerschool
 
PDF
LOP – Capturing and Linking Open Provenance on LOD Cycle
rogers.rj
 
PDF
Answers to usual issues in getting started with consuming Linked Data
Olaf Hartig
 
PPTX
Creating knowledge out of interlinked data
Sören Auer
 
PDF
Graphs & Big Data - Philip Rathle and Andreas Kollegger @ Big Data Science Me...
Neo4j
 
PPTX
Developing Linked Data and Semantic Web-based Applications (Expotec 2015)
Ig Bittencourt
 
PPTX
SSSW2015 Data Workflow Tutorial
SSSW
 
PDF
Tutorial Data Management and workflows
SSSW
 
PDF
Sq lite module1
Highervista
 
PDF
Using Linked Data Resources to generate web pages based on a BBC case study
Leila Zemmouchi-Ghomari
 
PDF
Linked Data for the Masses: The approach and the Software
IMC Technologies
 
PPSX
The Web of data and web data commons
Jesse Wang
 
PDF
Disrupting Data Discovery
markgrover
 
PDF
Achieving the digital thread through PLM and ALM integration using oslc
Axel Reichwein
 
PDF
Achieving the Digital Thread through PLM and ALM Integration using OSLC
Koneksys
 
PDF
WWW2014 Overview of W3C Linked Data Platform 20140410
Arnaud Le Hors
 
Linked Energy Data Generation
Filip Radulovic
 
Linked Data for Information Extraction Challenge - Tasks and Results @ ISWC 2014
Robert Meusel
 
Web Data Extraction: A Crash Course
Giorgio Orsi
 
Wed roman tut_open_datapub
eswcsummerschool
 
LOP – Capturing and Linking Open Provenance on LOD Cycle
rogers.rj
 
Answers to usual issues in getting started with consuming Linked Data
Olaf Hartig
 
Creating knowledge out of interlinked data
Sören Auer
 
Graphs & Big Data - Philip Rathle and Andreas Kollegger @ Big Data Science Me...
Neo4j
 
Developing Linked Data and Semantic Web-based Applications (Expotec 2015)
Ig Bittencourt
 
SSSW2015 Data Workflow Tutorial
SSSW
 
Tutorial Data Management and workflows
SSSW
 
Sq lite module1
Highervista
 
Using Linked Data Resources to generate web pages based on a BBC case study
Leila Zemmouchi-Ghomari
 
Linked Data for the Masses: The approach and the Software
IMC Technologies
 
The Web of data and web data commons
Jesse Wang
 
Disrupting Data Discovery
markgrover
 
Achieving the digital thread through PLM and ALM integration using oslc
Axel Reichwein
 
Achieving the Digital Thread through PLM and ALM Integration using OSLC
Koneksys
 
WWW2014 Overview of W3C Linked Data Platform 20140410
Arnaud Le Hors
 
Ad

More from Olaf Hartig (20)

PDF
A Context-Based Semantics for SPARQL Property Paths over the Web
Olaf Hartig
 
PDF
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Olaf Hartig
 
PDF
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Olaf Hartig
 
PDF
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Olaf Hartig
 
ODP
An Overview on PROV-AQ: Provenance Access and Query
Olaf Hartig
 
PDF
(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)
Olaf Hartig
 
PDF
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
Olaf Hartig
 
PDF
The Impact of Data Caching of on Query Execution for Linked Data
Olaf Hartig
 
PDF
How Caching Improves Efficiency and Result Completeness for Querying Linked Data
Olaf Hartig
 
PDF
A Main Memory Index Structure to Query Linked Data
Olaf Hartig
 
PDF
Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...
Olaf Hartig
 
PDF
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)
Olaf Hartig
 
PDF
Querying Linked Data with SPARQL (2010)
Olaf Hartig
 
PDF
Answers to usual issues in getting started with consuming Linked Data (2010)
Olaf Hartig
 
PDF
Linked Data on the Web
Olaf Hartig
 
PDF
Executing SPARQL Queries of the Web of Linked Data
Olaf Hartig
 
PDF
Using Web Data Provenance for Quality Assessment
Olaf Hartig
 
PDF
Querying Linked Data with SPARQL
Olaf Hartig
 
PDF
Querying Trust in RDF Data with tSPARQL
Olaf Hartig
 
PDF
Database Researchers Map
Olaf Hartig
 
A Context-Based Semantics for SPARQL Property Paths over the Web
Olaf Hartig
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Olaf Hartig
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Olaf Hartig
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Olaf Hartig
 
An Overview on PROV-AQ: Provenance Access and Query
Olaf Hartig
 
(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)
Olaf Hartig
 
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
Olaf Hartig
 
The Impact of Data Caching of on Query Execution for Linked Data
Olaf Hartig
 
How Caching Improves Efficiency and Result Completeness for Querying Linked Data
Olaf Hartig
 
A Main Memory Index Structure to Query Linked Data
Olaf Hartig
 
Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...
Olaf Hartig
 
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)
Olaf Hartig
 
Querying Linked Data with SPARQL (2010)
Olaf Hartig
 
Answers to usual issues in getting started with consuming Linked Data (2010)
Olaf Hartig
 
Linked Data on the Web
Olaf Hartig
 
Executing SPARQL Queries of the Web of Linked Data
Olaf Hartig
 
Using Web Data Provenance for Quality Assessment
Olaf Hartig
 
Querying Linked Data with SPARQL
Olaf Hartig
 
Querying Trust in RDF Data with tSPARQL
Olaf Hartig
 
Database Researchers Map
Olaf Hartig
 
Ad

Recently uploaded (20)

PPTX
Designing_the_Future_AI_Driven_Product_Experiences_Across_Devices.pptx
presentifyai
 
PDF
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
PDF
NASA A Researcher’s Guide to International Space Station : Physical Sciences ...
Dr. PANKAJ DHUSSA
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
Edge AI and Vision Alliance
 
PPTX
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
PDF
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Safe Software
 
PDF
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
PDF
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
PPTX
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
PDF
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
PPTX
MuleSoft MCP Support (Model Context Protocol) and Use Case Demo
shyamraj55
 
PPTX
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PDF
The 2025 InfraRed Report - Redpoint Ventures
Razin Mustafiz
 
PPTX
Mastering ODC + Okta Configuration - Chennai OSUG
HathiMaryA
 
PDF
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
Edge AI and Vision Alliance
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
PDF
ICONIQ State of AI Report 2025 - The Builder's Playbook
Razin Mustafiz
 
Designing_the_Future_AI_Driven_Product_Experiences_Across_Devices.pptx
presentifyai
 
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
NASA A Researcher’s Guide to International Space Station : Physical Sciences ...
Dr. PANKAJ DHUSSA
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
Edge AI and Vision Alliance
 
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Safe Software
 
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
MuleSoft MCP Support (Model Context Protocol) and Use Case Demo
shyamraj55
 
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
The 2025 InfraRed Report - Redpoint Ventures
Razin Mustafiz
 
Mastering ODC + Okta Configuration - Chennai OSUG
HathiMaryA
 
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
Edge AI and Vision Alliance
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
ICONIQ State of AI Report 2025 - The Builder's Playbook
Razin Mustafiz
 

Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)

  • 1. Linked Data Query Processing Tutorial at the 22nd International World Wide Web Conference (WWW 2013) May 14, 2013 http://db.uwaterloo.ca/LDQTut2013/ Olaf Hartig University of Waterloo
  • 2. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 2 Tutorial Outline (1) Introduction (2) Theoretical Foundations (3) Source Selection Strategies (4) Execution Process (5) Query Planning and Optimization
  • 3. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 3 Linked Data Query Processing Tutorial at the 22nd International World Wide Web Conference (WWW 2013) May 14, 2013 http://db.uwaterloo.ca/LDQTut2013/ 1. Introduction Olaf Hartig University of Waterloo
  • 4. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 4 Outline  The Linked Data Principles  Paradigms for Querying Linked Data  Characteristics of the “Database System”
  • 5. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 5 The Traditional, Hypertext Web MovieDB Data exposed to the Web via HTML CIA World Factbook
  • 6. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 6 Towards a Web of Linked Data MovieDB : ( Albania , unemployment rate , 13.2% ) : Data model: RDF ( War Child , release date , 12 July 1999 ) ( War Child , filming location , Albania ) ( Michael Davie , directed , War Child ) : CIA World Factbook
  • 7. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 7 Towards a Web of Linked Data MovieDB ( http://...imdb.../WarChild , release date , 12 July 1999 ) ( http://...imdb.../WarChild , filming location , http://cia.../Albania ) ( http://...imdb.../MichaelDavie , directed , http://...imdb.../WarChild ) : ( http://cia.../Albania , unemployment rate , 13.2% ) : Data model: RDF Global identifier: URI CIA World Factbook
  • 8. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 8 Towards a Web of Linked Data MovieDB ( http://cia.../Albania , unemployment rate , 13.2% ) : Data model: RDF Global identifier: URI Access mechanism: HTTP ( http://...imdb.../WarChild , release date , 12 July 1999 ) ( http://...imdb.../WarChild , filming location , http://cia.../Albania ) ( http://...imdb.../MichaelDavie , directed , http://...imdb.../WarChild ) : CIA World Factbook
  • 9. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 9 Towards a Web of Linked Data MovieDB CIA World Factbook ( http://...imdb.../WarChild , release date , 12 July 1999 ) ( http://...imdb.../WarChild , filming location , http://cia.../Albania ) ( http://...imdb.../MichaelDavie , directed , http://...imdb.../WarChild ) : ( http://cia.../Albania , unemployment rate , 13.2% ) : Data model: RDF Global identifier: URI Access mechanism: HTTP Connection: data links
  • 10. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 10 Supplementary Access Methods ● RDF dump: the whole dataset provided as a big file ● SPARQL endpoint: Web service that allows for executing SPARQL queries over the dataset ● Caveat: these access method cannot be assumed to be available for all datasets ● Creating dumps is not feasible if data changes very frequently ● Dumps or endpoints may not be feasible if Linked Data interface is simply a wrapper for some other data source ● Providing and maintaining a reliable SPARQL endpoint is a significant additional effort
  • 11. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 11 Outline  The Linked Data Principles  Paradigms for Querying Linked Data  Characteristics of the “Database System” √
  • 12. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 12 Traditional Paradigm 1: Warehousing ● Copy data into a centralized repository ● Query this repository + Almost instant results – Misses unknown or new sources – Collection possibly out of date
  • 13. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 13 ? ?? ? Traditional Paradigm 2: Federation ● Distribute query execution over a federation of SPARQL endpoints + Current data – Misses sources without SPARQL endpoint
  • 14. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 14 Principle 1: Rely on the Linked Data principles only Principle 2: On-line execution Linked Data Query Processing
  • 15. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 15 Principle 1: Rely on the Linked Data principles only Principle 2: On-line execution Consequence: Obtain data for executing a given query by looking up URIs during the query execution process itself Linked Data Query Processing
  • 16. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 16 “Ingredients” for LD Query Execution Query-local data
  • 17. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 17 “Ingredients” for LD Query Execution ● Data retrieval approach ● Data source selection ● Data source ranking (optional, for optimization) Query-local data GET http://.../movie2449
  • 18. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 18 “Ingredients” for LD Query Execution ● Data retrieval approach ● Data source selection ● Data source ranking (optional, for optimization) Query-local data http://mdb.../Paul http://geo.../Berlin http://mdb.../Ric http://geo.../Rome ?loc?actor GET http://.../movie2449 ● Result construction approach ● i.e., query-local data processing
  • 19. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 19 “Ingredients” for LD Query Execution ● Data retrieval approach ● Data source selection ● Data source ranking (optional, for optimization) Query-local data http://mdb.../Paul http://geo.../Berlin http://mdb.../Ric http://geo.../Rome ?loc?actor GET http://.../movie2449 ● Result construction approach ● i.e., query-local data processing ● Combining data retrieval and result construction
  • 20. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 20 + Current data + May make use of any Linked Data available on the Web – Least efficient due to data shipping Use cases: live querying where freshness and discovery of results is more important than an almost instant answer Properties of LD Query Processing
  • 21. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 21 Combination with other Paradigms ● Linked Data query processing with a query-local dataset ● Query-local dataset contains additional data [LT11] ● Query-local dataset for caching [Har11b, HH11] ● Linked Data query processing with a SPARQL endpoint ● SPARQL endpoint exposes a cache of Linked Data [UKH+12]
  • 22. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 22 Our Topic Today … … pure Linked Data query processing Linked Data query: a query that ranges over data made available using the Linked Data principles Web of Linked Data: network of data that evolves by publishing data according to the Linked Data principles
  • 23. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 23 Outline  The Linked Data Principles  Paradigms for Querying Linked Data  Characteristics of the “Database System” √ √
  • 24. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 24 An Analogy ...
  • 25. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 25 Traditional, Central Database Systems
  • 26. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 26 Distributed Database Systems
  • 27. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 27 The Web of Linked Data
  • 28. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 28 The Web of Linked Data
  • 29. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 29 ● Number of potential data sources infinite The Web of Linked Data
  • 30. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 30 ● Number of potential data sources infinite ● No (a priori) information The Web of Linked Data
  • 31. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 31 ● Number of potential data sources infinite ● No (a priori) information The Web of Linked Data
  • 32. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 32 ● Number of potential data sources infinite ● No (a priori) information The Web of Linked Data
  • 33. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 33 ● Number of potential data sources infinite ● No (a priori) information The Web of Linked Data
  • 34. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 34 ● Number of potential data sources infinite ● No (a priori) information The Web of Linked Data
  • 35. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 35 ● Number of potential data sources infinite ● No (a priori) information ● Number of actual data sources infinite The Web of Linked Data
  • 36. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 36 Issues due to the Openness ● Data quality issues ● Accuracy ● Freshness / timeliness ● Believability / trustworthiness ● Data source quality issues ● Availability ● Reliability ● Data integration issues ● Coreferences: Publishers may use different URIs for denoting the same entity ● Schema heterogeneity: Publishers may use different vocabularies for their data For the purpose of discussing execution of queries in this tutorial, we largely ignore these issues.
  • 37. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 37 Outline  The Linked Data Principles  Paradigms for Querying Linked Data  Characteristics of the “Database System” √ √ √ Next part: 2. Theoretical Foundations ...
  • 38. WWW 2013 Tutorial on Linked Data Query Processing [ Introduction ] 38 These slides have been created by Olaf Hartig for the WWW 2013 tutorial on Link Data Query Processing Tutorial Website: http://db.uwaterloo.ca/LDQTut2013/ This work is licensed under a Creative Commons Attribution-Share Alike 3.0 License (http://creativecommons.org/licenses/by-sa/3.0/)