Towards Answering Provenance-Enabled SPARQL Queries over
RDF Data Cubes
February 7, 2017
Kim Ahlstrøm, Katja Hose, and Torben Bach Pedersen
{kah|khose|tbp}@cs.aau.dk
Department of Computer Science
Aalborg University
Denmark
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Agenda
Preliminaries
Resource Description Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
2 Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Preliminaries
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
3 Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Resource Description Framework
What is RDF
RDF is a standard model for data
interchange on the Web
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
3 Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Resource Description Framework
What is RDF
RDF is a standard model for data
interchange on the Web
What is it good for?
Linked data
Hetrogenious data
Publish data in a searchable manner
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
3 Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Resource Description Framework
What is RDF
RDF is a standard model for data
interchange on the Web
What is it good for?
Linked data
Hetrogenious data
Publish data in a searchable manner
What does it look like?
subject object
subject literal
predicate
predicate
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
4 Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
http://lod-cloud.net/
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
5 Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
RDF
Example
sale1 7
book1Frank_Herbert “Dune”
product
nameauthor
price
product
nameauthor
price
Graph A
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
5 Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
RDF
Example
sale1 7
book1Frank_Herbert “Dune”
product
nameauthor
price
product
nameauthor
price
Graph A
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
5 Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
RDF
Example
sale1 7
book1Frank_Herbert “Dune”
product
nameauthor
price
product
nameauthor
price
Graph A
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
5 Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
RDF
Example
sale1 7
book1Frank_Herbert “Dune”
product
nameauthor
price
product
nameauthor
price
Graph A
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
5 Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
RDF
Example
sale1 7
book1Frank_Herbert “Dune”
product
nameauthor
price
product
nameauthor
price
Graph A
Subject Predicate Object Graph
sale1 price 7 A
sale1 product book1 A
book1 name “Dune” A
book1 author Frank_Herbert A
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
6 Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
SPARQL
SELECT ( count (? sale ) as ?sales )
FROM A
WHERE {
?sale product ?book .
?book author Frank_Herbert .
}
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
7 RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
RDF Data Cube
9
7
12
4
85
3
6
product
location
1
88
9
8
1
5
6
2
4
9
7
1
5
6
88
2
3
9
tim
e
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
7 RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
RDF Data Cube
9
7
12
4
85
3
6
product
location
1
88
9
8
1
5
6
2
4
9
7
1
5
6
88
2
3
9
tim
e
Observations, Measures,
Dimensions, and Levels
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
7 RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
RDF Data Cube
9
7
12
4
85
3
6
product
location
1
88
9
8
1
5
6
2
4
9
7
1
5
6
88
2
3
9
tim
e
Observations, Measures,
Dimensions, and Levels
Pure RDF data cubesab
hybrid RDF data cubesc
aRDF Analytics: Lenses over Semantic Graphs, WWW’14.
bModeling and querying data warehouses on the semantic web using QB4OLAP, DaWaK’14.
cFusion Cubes: Towards Self-Service Business Intelligence, IJDWM’13.
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
8 RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Example
:sale1:shop1
“Barnes
& Noble”
:book1“Dune” 7
“January”
“2016”
:january2016
:2016
:date31012016
:shop
:product
:name
:title
:price
:date skos:broader
skos:broader
:month
:year
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
9 Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Provenance Data
Provenance
Provenance (from the French provenir, "to come from"), is the chronology of the
ownership, custody or location of a historical object1
1Oxfort Dictionary.
2Automated metadata generation for linked data generation and publishing workflows, LDOW’16 .
3https://www.w3.org/TR/prov-o/.
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
9 Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Provenance Data
Provenance
Provenance (from the French provenir, "to come from"), is the chronology of the
ownership, custody or location of a historical object1
Where does it come from?
Generated when data is manipulated2
1Oxfort Dictionary.
2Automated metadata generation for linked data generation and publishing workflows, LDOW’16 .
3https://www.w3.org/TR/prov-o/.
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
9 Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Provenance Data
Provenance
Provenance (from the French provenir, "to come from"), is the chronology of the
ownership, custody or location of a historical object1
Where does it come from?
Generated when data is manipulated2
What do we use it for?
Quality & Trust
Access control
Debugging
1Oxfort Dictionary.
2Automated metadata generation for linked data generation and publishing workflows, LDOW’16 .
3https://www.w3.org/TR/prov-o/.
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
9 Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Provenance Data
Provenance
Provenance (from the French provenir, "to come from"), is the chronology of the
ownership, custody or location of a historical object1
Where does it come from?
Generated when data is manipulated2
What do we use it for?
Quality & Trust
Access control
Debugging
How do we represent provenance data?
Prov-o vocabulary3
1Oxfort Dictionary.
2Automated metadata generation for linked data generation and publishing workflows, LDOW’16 .
3https://www.w3.org/TR/prov-o/.
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
10 Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
RDF and provenance data
Reification
Named graphs
Singleton properties
n-ary
Nano publications
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
11 Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Named graphs
sale1 7
book1Frank_Herbert “Dune”
product
nameauthor
price
Graph A
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
11 Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Named graphs
sale1 7
book1Frank_Herbert “Dune”
product
nameauthor
price
Graph A
Subject Predicate Object
:A rdf:type prov:entity
:A prov:wasGeneratedBy :DataExtractor
:DataExtractor prov:used :DBpedia
:DBpedia :status "trusted"
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
12 Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Provenance-enabled Query
Provenance query
Filters the provenance data.
Analytical query
Query the RDF data cube only containing allowed provenance data.
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
13 The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
The Problem
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
14 Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Provenance-enabled Queries
How do we answer provenance-enabled queries over RDF data cubes?
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
15 Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Cube Instance Data
:sale1:shop1
“Barnes
& Noble”
:book1
:sale2
“Dune”
:shop2“Amazon”
:book2“Dracula” 12
7
“January”
“2016”
:january2016
:2016
:date31012016
:shop:shop
:shop:shop
:product:product
:product:product
:name:name
:name:name
:title:title
:title:title
:price:price
:price:price
:date:date
:date:date
skos:broaderskos:broader
skos:broaderskos:broader
:month:month
:year:year
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
15 Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Cube Instance Data
:sale1:shop1
“Barnes
& Noble”
:book1
:sale2
“Dune”
:shop2“Amazon”
:book2“Dracula” 12
7
“January”
“2016”
:january2016
:2016
:date31012016
:shop:shop
:shop:shop
:product:product
:product:product
:name:name
:name:name
:title:title
:title:title
:price:price
:price:price
:date:date
:date:date
skos:broaderskos:broader
skos:broaderskos:broader
:month:month
:year:year
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
16 Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Provenance Data
Subject Predicate Object
:Blue prov:wasGeneratedBy :BookExtractor
:BookExtractor prov:used :DBpedia
:DBpedia :status "trusted"
:Purple prov:wasGeneratedBy :ShopExtractor
:ShopExtractor prov:used :Database
:Database :status "trusted"
:Teal prov:wasGeneratedBy :CalenderExtractor
:CalenderExtractor prov:used :CSVFile
:CSVFile :status "trusted"
:Brown prov:wasGeneratedBy :WebTableExtractor
:WebTableExtractor prov:used :WebTable
:WebTable :status "unknown"
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
17 Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Provenance-enabled Query
Provenance Query
All sources that are trusted
SELECT ?provenanceGraph
WHERE {
?provenanceGraph prov : wasGeneratedBy ? a c t i v i t y .
? a c t i v i t y prov : used ? e n t i t y .
? e n t i t y : status " trusted " .
}
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
17 Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Provenance-enabled Query
Provenance Query
All sources that are trusted
SELECT ?provenanceGraph
WHERE {
?provenanceGraph prov : wasGeneratedBy ? a c t i v i t y .
? a c t i v i t y prov : used ? e n t i t y .
? e n t i t y : status " trusted " .
}
Analytical Query
Revenue of books sold in shop “Barnes & Noble”
SELECT sum(? price )
WHERE {
?observation : price ? price .
?observation : shop ?shop .
?shop : t i t l e " Barnes & Noble " .
}
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
18 Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Naive Strategy
1. Evaluate Provenance Query
2. Reformulate Analytical Query
3. Execute Analytical Query
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
19 Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Naive Strategy
Step 1
Provenance Query
All sources that are trusted
SELECT ?provenanceGraph
WHERE {
?provenanceGraph prov : wasGeneratedBy ? a c t i v i t y .
? a c t i v i t y prov : used ? e n t i t y .
? e n t i t y : status " trusted " .
}
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
20 Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Naive Strategy
Step 1
Subject Predicate Object
:Blue prov:wasGeneratedBy :BookExtractor
:BookExtractor prov:used :DBpedia
:DBpedia :status "trusted"
:Purple prov:wasGeneratedBy :ShopExtractor
:ShopExtractor prov:used :Database
:Database :status "trusted"
:Teal prov:wasGeneratedBy :CalenderExtractor
:CalenderExtractor prov:used :CSVFile
:CSVFile :status "trusted"
:Brown prov:wasGeneratedBy :WebTableExtractor
:WebTableExtractor prov:used :WebTable
:WebTable :status "unknown"
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
20 Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Naive Strategy
Step 1
Subject Predicate Object
:Blue prov:wasGeneratedBy :BookExtractor
:BookExtractor prov:used :DBpedia
:DBpedia :status "trusted"
:Purple prov:wasGeneratedBy :ShopExtractor
:ShopExtractor prov:used :Database
:Database :status "trusted"
:Teal prov:wasGeneratedBy :CalenderExtractor
:CalenderExtractor prov:used :CSVFile
:CSVFile :status "trusted"
:Brown prov:wasGeneratedBy :WebTableExtractor
:WebTableExtractor prov:used :WebTable
:WebTable :status "unknown"
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
20 Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Naive Strategy
Step 1
Subject Predicate Object
:Blue prov:wasGeneratedBy :BookExtractor
:BookExtractor prov:used :DBpedia
:DBpedia :status "trusted"
:Purple prov:wasGeneratedBy :ShopExtractor
:ShopExtractor prov:used :Database
:Database :status "trusted"
:Teal prov:wasGeneratedBy :CalenderExtractor
:CalenderExtractor prov:used :CSVFile
:CSVFile :status "trusted"
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
21 Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Naive Strategy
Step 1
Provenance Query
All sources that are trusted
SELECT ?provenanceGraph
WHERE {
?provenanceGraph prov : wasGeneratedBy ? a c t i v i t y .
? a c t i v i t y prov : used ? e n t i t y .
? e n t i t y : status " trusted " .
}
Result
:Blue, :Purple, and :Teal
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
22 Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Naive Strategy
Step 2
Analytical Query
Revenue of books sold in shop “Barnes & Noble”
SELECT sum(? price )
FROM :Blue
FROM :Purple
FROM :Teal
WHERE {
?observation : price ? price .
?observation : shop ?shop .
?shop : t i t l e " Barnes & Noble " .
}
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
23 Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Naive Strategy
Step 3
:sale1:shop1
“Barnes
& Noble”
:book1
:sale2
“Dune”
:shop2“Amazon”
:book2“Dracula” 12
7
“January”
“2016”
:january2016
:2016
:date31012016
:shop:shop
:shop:shop
:product:product
:product:product
:name:name
:name:name
:title:title
:title:title
:price:price
:price:price
:date:date
:date:date
skos:broaderskos:broader
skos:broaderskos:broader
:month:month
:year:year
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
23 Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Naive Strategy
Step 3
:sale1:shop1
“Barnes
& Noble”
:book1
:sale2
“Dune”
:shop2“Amazon”
:book2“Dracula” 12
7
“January”
“2016”
:january2016
:2016
:date31012016
:shop:shop
:shop:shop
:product:product
:product:product
:name:name
:name:name
:title:title
:title:title
:price:price
:price:price
:date:date
:date:date
skos:broaderskos:broader
skos:broaderskos:broader
:month:month
:year:year
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
24 Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Query Tree
σ(?shop :name “Barns & Nobels”) σ(?obs :shop ?shop)
σ(?obs :price ?price)
AGG(?price)
Π (?price)
:Purple:Teal :Blue :Purple:Teal :Blue
:Purple:Teal :Blue
Overhead per join
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
25 Our Solution
Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Our Solution
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
26 Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Materialization Strategy
1. Evaluate Provenance Query
2. Materialize Provenance Graphs
3. Execute Analytical Query
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
27 Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Materialization
Step 1
Provenance Query
All sources that are trusted
SELECT ?provenanceGraph
WHERE {
?provenanceGraph prov : wasGeneratedBy ? a c t i v i t y .
? a c t i v i t y prov : used ? e n t i t y .
? e n t i t y : status " trusted " .
}
Result
:Blue, :Purple, and :Teal
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
28 Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Materialization
Step 2
:sale1:shop1
“Barnes
& Noble”
:book1
:sale2
“Dune”
:shop2“Amazon”
:book2“Dracula” 12
7
“January”
“2016”
:january2016
:2016
:date31012016
:shop:shop
:shop:shop
:product:product
:product:product
:name:name
:name:name
:title:title
:title:title
:price:price
:price:price
:date:date
:date:date
skos:broaderskos:broader
skos:broaderskos:broader
:month:month
:year:year
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
28 Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Materialization
Step 2
:sale1:shop1
“Barnes
& Noble”
:book1
:sale2
“Dune”
:shop2“Amazon”
:book2“Dracula” 12
7
“January”
“2016”
:january2016
:2016
:date31012016
:shop:shop
:shop:shop
:product:product
:product:product
:name:name
:name:name
:title:title
:title:title
:price:price
:price:price
:date:date
:date:date
skos:broaderskos:broader
skos:broaderskos:broader
:month:month
:year:year
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
29 Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Materialization
Step 2
Materialize Cube’
Revenue of books sold in shop “Barnes & Noble”
CONSTRUCT { ? subject ? predicate ? object }
FROM :Blue
FROM :Purple
FROM :Teal
WHERE
{
? subject ? predicate ? object
}
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
30 Materialization Strategy
Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Materialization
Step 3
Analytical Query
Revenue of books sold in shop “Barnes & Noble”
SELECT sum(? price )
FROM :CUBE’
WHERE {
?observation : price ? price .
?observation : shop ?shop .
?shop : t i t l e " Barnes & Noble " .
}
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
31 Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Context Index
1. Analyze analytical query
2. Lookup predicates
3. Filter provenance graphs
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
32 Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Context Index
Query Analysis
Analytical Query
Revenue of books sold in shop “Barnes & Noble”
SELECT sum(? price )
WHERE {
?observation :price ? price .
?observation : shop ?shop .
?shop :title " Barnes & Noble " .
}
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
33 Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Context Index
Query Analysis
:sale1:shop1
“Barnes
& Noble”
:book1
:sale2
“Dune”
:shop2“Amazon”
:book2“Dracula” 12
7
“January”
“2016”
:january2016
:2016
:date31012016
:shop
:shop
:product
:product
:name
:name
:title
:title
:price
:price
:date
:date
skos:broader
skos:broader
:month
:year
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
34 Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Context Index
Lookup
Root
:product
:name
:shop
:title
:price :date
skos:broader
:month skos:broader
:year
:Teal:Teal:Blue:Brown:Purple:Purple
observations
Level 1
Level 2
Level 3
provenance
:title
:shop
Root
:price
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
34 Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Context Index
Lookup
Root
:product
:name
:shop
:title
:price :date
skos:broader
:month skos:broader
:year
:Teal:Teal:Blue:Brown:Purple:Purple
observations
Level 1
Level 2
Level 3
provenance
:title
:shop
Root
:price
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
35 Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Context Index
Combining Provenance Query & Context Index
:sale1:shop1
“Barnes
& Noble”
:book1
:sale2
“Dune”
:shop2“Amazon”
:book2“Dracula” 12
7
“January”
“2016”
:january2016
:2016
:date31012016
:shop
:shop
:product
:product
:name
:name
:title
:title
:price
:price
:date
:date
skos:broader
skos:broader
:month
:year
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
35 Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Context Index
Combining Provenance Query & Context Index
:sale1:shop1
“Barnes
& Noble”
:book1
:sale2
“Dune”
:shop2“Amazon”
:book2“Dracula” 12
7
“January”
“2016”
:january2016
:2016
:date31012016
:shop
:shop
:product
:product
:name
:name
:title
:title
:price
:price
:date
:date
skos:broader
skos:broader
:month
:year
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
35 Context Index
Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Context Index
Combining Provenance Query & Context Index
:sale1:shop1
“Barnes
& Noble”
:book1
:sale2
“Dune”
:shop2“Amazon”
:book2“Dracula” 12
7
“January”
“2016”
:january2016
:2016
:date31012016
:shop
:shop
:product
:product
:name
:name
:title
:title
:price
:price
:date
:date
skos:broader
skos:broader
:month
:year
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
36 Experiments
Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Experiments
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
37 Setup
Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Setup
Dataset
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
38 Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Evaluation
1.0 1.8 8.0 13.5
101
102
103
104
Million of triples
Evaluationtime(sec)
Naive
Naive+Index
Materialized
Materialized+Index
39
Towards Answering
Provenance-Enabled
SPARQL Queries over
RDF Data Cubes
Kim Ahlstrøm
Preliminaries
Resource Description
Framework
RDF Data Cube
Provenance
Provenance-enabled Query
The Problem
Provenance-enabled
Queries
Example
Our Solution
Materialization Strategy
Context Index
Experiments
Setup
39 Evaluation
Dept. of Computer Science
Aalborg University
Denmark
Evaluation
10 30 50 70 90
50
100
150
200
Percentage size of provenance query
Evaluationtime(sec)
Materialized
Materialized+Index
Thank you for your attention

More Related Content

PDF
Learning Commonalities in RDF
PDF
Learning Commonalities in RDF and SPARQL
PDF
Classification and Clustering of arXiv Documents, Sections, and Abstracts, Co...
PDF
ParlBench: a SPARQL-benchmark for electronic publishing applications.
PDF
RDF Stream Processing Models (RSP2014)
PDF
IPython Notebook as a Unified Data Science Interface for Hadoop
PDF
Framester: A Wide Coverage Linguistic Linked Data Hub
PDF
Heuristic based Query Optimisation for SPARQL
Learning Commonalities in RDF
Learning Commonalities in RDF and SPARQL
Classification and Clustering of arXiv Documents, Sections, and Abstracts, Co...
ParlBench: a SPARQL-benchmark for electronic publishing applications.
RDF Stream Processing Models (RSP2014)
IPython Notebook as a Unified Data Science Interface for Hadoop
Framester: A Wide Coverage Linguistic Linked Data Hub
Heuristic based Query Optimisation for SPARQL

What's hot (11)

PDF
Interactive Knowledge Discovery over Web of Data.
PDF
LDQL: A Query Language for the Web of Linked Data
PPTX
Programming with Semantic Broad Data
PDF
Sharing massive data analysis: from provenance to linked experiment reports
PPTX
Democratizing Big Semantic Data management
PDF
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
PPTX
(Semi-)Automatic analysis of online contents
PPTX
EDF2012 Peter Boncz - LOD benchmarking SRbench
PDF
Apache Spark NLP for Healthcare: Lessons Learned Building Real-World Healthca...
PDF
HARE: An Engine for Enhancing Answer Completeness of SPARQL Queries via Crowd...
PDF
Navigating and Exploring RDF Data using Formal Concept Analysis
Interactive Knowledge Discovery over Web of Data.
LDQL: A Query Language for the Web of Linked Data
Programming with Semantic Broad Data
Sharing massive data analysis: from provenance to linked experiment reports
Democratizing Big Semantic Data management
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
(Semi-)Automatic analysis of online contents
EDF2012 Peter Boncz - LOD benchmarking SRbench
Apache Spark NLP for Healthcare: Lessons Learned Building Real-World Healthca...
HARE: An Engine for Enhancing Answer Completeness of SPARQL Queries via Crowd...
Navigating and Exploring RDF Data using Formal Concept Analysis

Viewers also liked (20)

PDF
Using PostgreSQL with Bibliographic Data
PDF
Bringing the Semantic Web closer to reality: PostgreSQL as RDF Graph Database
PDF
Marco Gold Standard
PDF
10 Ways Your Boss Kills Employee Motivation
PDF
Recovery: Job Growth and Education Requirements Through 2020
PDF
Beyond the Gig Economy
PDF
African Americans: College Majors and Earnings
PDF
Creative Traction Methodology - For Early Stage Startups
PDF
The Online College Labor Market
PPTX
3 hard facts shaping higher education thinking and behavior
PDF
8 Tips for Scaling Mobile Users in China by Edith Yeung
PDF
BigWeatherGear Group and Corporate Services Brochure 2013
PPTX
The French Revolution of 1789
PDF
What's Trending in Talent and Learning for 2016?
PDF
GAME ON! Integrating Games and Simulations in the Classroom
PDF
Game Based Learning for Language Learners
PDF
SXSW 2016: The Need To Knows
PDF
Digitized Student Development, Social Media, and Identity
PDF
Bill Aulet GEC2016 keynote speech March 16 2016 Medellin Colombia
PDF
SXSW 2016 takeaways
Using PostgreSQL with Bibliographic Data
Bringing the Semantic Web closer to reality: PostgreSQL as RDF Graph Database
Marco Gold Standard
10 Ways Your Boss Kills Employee Motivation
Recovery: Job Growth and Education Requirements Through 2020
Beyond the Gig Economy
African Americans: College Majors and Earnings
Creative Traction Methodology - For Early Stage Startups
The Online College Labor Market
3 hard facts shaping higher education thinking and behavior
8 Tips for Scaling Mobile Users in China by Edith Yeung
BigWeatherGear Group and Corporate Services Brochure 2013
The French Revolution of 1789
What's Trending in Talent and Learning for 2016?
GAME ON! Integrating Games and Simulations in the Classroom
Game Based Learning for Language Learners
SXSW 2016: The Need To Knows
Digitized Student Development, Social Media, and Identity
Bill Aulet GEC2016 keynote speech March 16 2016 Medellin Colombia
SXSW 2016 takeaways

Similar to Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes (20)

PDF
Executing Provenance-Enabled Queries over Web Data
PDF
Efficient, Scalable, and Provenance-Aware Management of Linked Data
PDF
IRJET- Data Retrieval using Master Resource Description Framework
PPTX
Timbuctoo 2 EASY
PPTX
Provenance for Data Munging Environments
PPTX
Provenance for Reproducible Data Science
PDF
Enabling Citizen-empowered Apps over Linked Data
PDF
Citizen-centric Linked Data Services for Smarter Cities
PDF
Provenance Analysis and RDF Query Processing: W3C PROV for Data Quality and T...
ODP
2010 06 ipaw_prv
PPTX
Lotico oct 2010
PPTX
RDF-Gen: Generating RDF from streaming and archival data
PDF
2010 06 rdf_next
PPT
Re-using Media on the Web: Media fragment re-mixing and playout
PPTX
Triplestore and SPARQL
PPTX
Elastic Meetup - Elasticsearch and Linked Data
PDF
IDEAS 2013 Presentation
PPTX
Publishing "5 star" data: the case for RDF
PDF
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
PPTX
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...
Executing Provenance-Enabled Queries over Web Data
Efficient, Scalable, and Provenance-Aware Management of Linked Data
IRJET- Data Retrieval using Master Resource Description Framework
Timbuctoo 2 EASY
Provenance for Data Munging Environments
Provenance for Reproducible Data Science
Enabling Citizen-empowered Apps over Linked Data
Citizen-centric Linked Data Services for Smarter Cities
Provenance Analysis and RDF Query Processing: W3C PROV for Data Quality and T...
2010 06 ipaw_prv
Lotico oct 2010
RDF-Gen: Generating RDF from streaming and archival data
2010 06 rdf_next
Re-using Media on the Web: Media fragment re-mixing and playout
Triplestore and SPARQL
Elastic Meetup - Elasticsearch and Linked Data
IDEAS 2013 Presentation
Publishing "5 star" data: the case for RDF
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...

Recently uploaded (20)

PPT
what do you want to know about myeloprolifritive disorders .ppt
PPTX
Personality for guidance related to theories
PDF
Thyroid Hormone by Iqra Nasir detail.pdf
PPT
plant growth and development after seeding plant .ppt
PDF
Human Anatomy (Anatomy and Physiology A)
PPTX
Antihypertensive Medicinal Chemistry Unit II BP501T.pptx
PDF
Physics of Bitcoin #30 Perrenod Santostasi.pdf
PDF
software engineering for computer science
PPT
dcs-computertraningbasics-170826004702.ppt
PDF
2024_PohleJellKlug_CambrianPlectronoceratidsAustralia.pdf
PPT
INSTRUMENTAL ANALYSIS (Electrochemical processes )-1.ppt
DOCX
Introduction , chapter 1 , Nahid Fatema thesis
PPTX
Bacterial and protozoal infections in pregnancy.pptx
PDF
Sujay Rao Mandavilli Variable logic FINAL FINAL FINAL FINAL FINAL.pdf
PPTX
Contact Lens Dr Hari.pptx presentation powerpoint
DOCX
lesson plan on learning disabilities in children
PDF
Sujay Rao Mandavilli Degrowth delusion FINAL FINAL FINAL FINAL FINAL.pdf
PDF
SWAG Research Lab Scientific Publications
PPTX
Morphology of Bacteria & Structure of Bacterial Cell
PPTX
Cutaneous tuberculosis Dermatology
what do you want to know about myeloprolifritive disorders .ppt
Personality for guidance related to theories
Thyroid Hormone by Iqra Nasir detail.pdf
plant growth and development after seeding plant .ppt
Human Anatomy (Anatomy and Physiology A)
Antihypertensive Medicinal Chemistry Unit II BP501T.pptx
Physics of Bitcoin #30 Perrenod Santostasi.pdf
software engineering for computer science
dcs-computertraningbasics-170826004702.ppt
2024_PohleJellKlug_CambrianPlectronoceratidsAustralia.pdf
INSTRUMENTAL ANALYSIS (Electrochemical processes )-1.ppt
Introduction , chapter 1 , Nahid Fatema thesis
Bacterial and protozoal infections in pregnancy.pptx
Sujay Rao Mandavilli Variable logic FINAL FINAL FINAL FINAL FINAL.pdf
Contact Lens Dr Hari.pptx presentation powerpoint
lesson plan on learning disabilities in children
Sujay Rao Mandavilli Degrowth delusion FINAL FINAL FINAL FINAL FINAL.pdf
SWAG Research Lab Scientific Publications
Morphology of Bacteria & Structure of Bacterial Cell
Cutaneous tuberculosis Dermatology

Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes

  • 1. Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes February 7, 2017 Kim Ahlstrøm, Katja Hose, and Torben Bach Pedersen {kah|khose|tbp}@cs.aau.dk Department of Computer Science Aalborg University Denmark
  • 2. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Agenda Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation
  • 3. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm 2 Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Preliminaries
  • 4. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries 3 Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Resource Description Framework What is RDF RDF is a standard model for data interchange on the Web
  • 5. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries 3 Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Resource Description Framework What is RDF RDF is a standard model for data interchange on the Web What is it good for? Linked data Hetrogenious data Publish data in a searchable manner
  • 6. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries 3 Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Resource Description Framework What is RDF RDF is a standard model for data interchange on the Web What is it good for? Linked data Hetrogenious data Publish data in a searchable manner What does it look like? subject object subject literal predicate predicate
  • 7. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries 4 Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark http://lod-cloud.net/
  • 8. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries 5 Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark RDF Example sale1 7 book1Frank_Herbert “Dune” product nameauthor price product nameauthor price Graph A
  • 9. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries 5 Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark RDF Example sale1 7 book1Frank_Herbert “Dune” product nameauthor price product nameauthor price Graph A
  • 10. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries 5 Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark RDF Example sale1 7 book1Frank_Herbert “Dune” product nameauthor price product nameauthor price Graph A
  • 11. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries 5 Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark RDF Example sale1 7 book1Frank_Herbert “Dune” product nameauthor price product nameauthor price Graph A
  • 12. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries 5 Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark RDF Example sale1 7 book1Frank_Herbert “Dune” product nameauthor price product nameauthor price Graph A Subject Predicate Object Graph sale1 price 7 A sale1 product book1 A book1 name “Dune” A book1 author Frank_Herbert A
  • 13. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries 6 Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark SPARQL SELECT ( count (? sale ) as ?sales ) FROM A WHERE { ?sale product ?book . ?book author Frank_Herbert . }
  • 14. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework 7 RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark RDF Data Cube 9 7 12 4 85 3 6 product location 1 88 9 8 1 5 6 2 4 9 7 1 5 6 88 2 3 9 tim e
  • 15. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework 7 RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark RDF Data Cube 9 7 12 4 85 3 6 product location 1 88 9 8 1 5 6 2 4 9 7 1 5 6 88 2 3 9 tim e Observations, Measures, Dimensions, and Levels
  • 16. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework 7 RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark RDF Data Cube 9 7 12 4 85 3 6 product location 1 88 9 8 1 5 6 2 4 9 7 1 5 6 88 2 3 9 tim e Observations, Measures, Dimensions, and Levels Pure RDF data cubesab hybrid RDF data cubesc aRDF Analytics: Lenses over Semantic Graphs, WWW’14. bModeling and querying data warehouses on the semantic web using QB4OLAP, DaWaK’14. cFusion Cubes: Towards Self-Service Business Intelligence, IJDWM’13.
  • 17. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework 8 RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Example :sale1:shop1 “Barnes & Noble” :book1“Dune” 7 “January” “2016” :january2016 :2016 :date31012016 :shop :product :name :title :price :date skos:broader skos:broader :month :year
  • 18. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube 9 Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Provenance Data Provenance Provenance (from the French provenir, "to come from"), is the chronology of the ownership, custody or location of a historical object1 1Oxfort Dictionary. 2Automated metadata generation for linked data generation and publishing workflows, LDOW’16 . 3https://www.w3.org/TR/prov-o/.
  • 19. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube 9 Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Provenance Data Provenance Provenance (from the French provenir, "to come from"), is the chronology of the ownership, custody or location of a historical object1 Where does it come from? Generated when data is manipulated2 1Oxfort Dictionary. 2Automated metadata generation for linked data generation and publishing workflows, LDOW’16 . 3https://www.w3.org/TR/prov-o/.
  • 20. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube 9 Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Provenance Data Provenance Provenance (from the French provenir, "to come from"), is the chronology of the ownership, custody or location of a historical object1 Where does it come from? Generated when data is manipulated2 What do we use it for? Quality & Trust Access control Debugging 1Oxfort Dictionary. 2Automated metadata generation for linked data generation and publishing workflows, LDOW’16 . 3https://www.w3.org/TR/prov-o/.
  • 21. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube 9 Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Provenance Data Provenance Provenance (from the French provenir, "to come from"), is the chronology of the ownership, custody or location of a historical object1 Where does it come from? Generated when data is manipulated2 What do we use it for? Quality & Trust Access control Debugging How do we represent provenance data? Prov-o vocabulary3 1Oxfort Dictionary. 2Automated metadata generation for linked data generation and publishing workflows, LDOW’16 . 3https://www.w3.org/TR/prov-o/.
  • 22. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube 10 Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark RDF and provenance data Reification Named graphs Singleton properties n-ary Nano publications
  • 23. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube 11 Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Named graphs sale1 7 book1Frank_Herbert “Dune” product nameauthor price Graph A
  • 24. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube 11 Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Named graphs sale1 7 book1Frank_Herbert “Dune” product nameauthor price Graph A Subject Predicate Object :A rdf:type prov:entity :A prov:wasGeneratedBy :DataExtractor :DataExtractor prov:used :DBpedia :DBpedia :status "trusted"
  • 25. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance 12 Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Provenance-enabled Query Provenance query Filters the provenance data. Analytical query Query the RDF data cube only containing allowed provenance data.
  • 26. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query 13 The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark The Problem
  • 27. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem 14 Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Provenance-enabled Queries How do we answer provenance-enabled queries over RDF data cubes?
  • 28. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries 15 Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Cube Instance Data :sale1:shop1 “Barnes & Noble” :book1 :sale2 “Dune” :shop2“Amazon” :book2“Dracula” 12 7 “January” “2016” :january2016 :2016 :date31012016 :shop:shop :shop:shop :product:product :product:product :name:name :name:name :title:title :title:title :price:price :price:price :date:date :date:date skos:broaderskos:broader skos:broaderskos:broader :month:month :year:year
  • 29. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries 15 Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Cube Instance Data :sale1:shop1 “Barnes & Noble” :book1 :sale2 “Dune” :shop2“Amazon” :book2“Dracula” 12 7 “January” “2016” :january2016 :2016 :date31012016 :shop:shop :shop:shop :product:product :product:product :name:name :name:name :title:title :title:title :price:price :price:price :date:date :date:date skos:broaderskos:broader skos:broaderskos:broader :month:month :year:year
  • 30. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries 16 Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Provenance Data Subject Predicate Object :Blue prov:wasGeneratedBy :BookExtractor :BookExtractor prov:used :DBpedia :DBpedia :status "trusted" :Purple prov:wasGeneratedBy :ShopExtractor :ShopExtractor prov:used :Database :Database :status "trusted" :Teal prov:wasGeneratedBy :CalenderExtractor :CalenderExtractor prov:used :CSVFile :CSVFile :status "trusted" :Brown prov:wasGeneratedBy :WebTableExtractor :WebTableExtractor prov:used :WebTable :WebTable :status "unknown"
  • 31. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries 17 Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Provenance-enabled Query Provenance Query All sources that are trusted SELECT ?provenanceGraph WHERE { ?provenanceGraph prov : wasGeneratedBy ? a c t i v i t y . ? a c t i v i t y prov : used ? e n t i t y . ? e n t i t y : status " trusted " . }
  • 32. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries 17 Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Provenance-enabled Query Provenance Query All sources that are trusted SELECT ?provenanceGraph WHERE { ?provenanceGraph prov : wasGeneratedBy ? a c t i v i t y . ? a c t i v i t y prov : used ? e n t i t y . ? e n t i t y : status " trusted " . } Analytical Query Revenue of books sold in shop “Barnes & Noble” SELECT sum(? price ) WHERE { ?observation : price ? price . ?observation : shop ?shop . ?shop : t i t l e " Barnes & Noble " . }
  • 33. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries 18 Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Naive Strategy 1. Evaluate Provenance Query 2. Reformulate Analytical Query 3. Execute Analytical Query
  • 34. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries 19 Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Naive Strategy Step 1 Provenance Query All sources that are trusted SELECT ?provenanceGraph WHERE { ?provenanceGraph prov : wasGeneratedBy ? a c t i v i t y . ? a c t i v i t y prov : used ? e n t i t y . ? e n t i t y : status " trusted " . }
  • 35. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries 20 Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Naive Strategy Step 1 Subject Predicate Object :Blue prov:wasGeneratedBy :BookExtractor :BookExtractor prov:used :DBpedia :DBpedia :status "trusted" :Purple prov:wasGeneratedBy :ShopExtractor :ShopExtractor prov:used :Database :Database :status "trusted" :Teal prov:wasGeneratedBy :CalenderExtractor :CalenderExtractor prov:used :CSVFile :CSVFile :status "trusted" :Brown prov:wasGeneratedBy :WebTableExtractor :WebTableExtractor prov:used :WebTable :WebTable :status "unknown"
  • 36. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries 20 Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Naive Strategy Step 1 Subject Predicate Object :Blue prov:wasGeneratedBy :BookExtractor :BookExtractor prov:used :DBpedia :DBpedia :status "trusted" :Purple prov:wasGeneratedBy :ShopExtractor :ShopExtractor prov:used :Database :Database :status "trusted" :Teal prov:wasGeneratedBy :CalenderExtractor :CalenderExtractor prov:used :CSVFile :CSVFile :status "trusted" :Brown prov:wasGeneratedBy :WebTableExtractor :WebTableExtractor prov:used :WebTable :WebTable :status "unknown"
  • 37. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries 20 Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Naive Strategy Step 1 Subject Predicate Object :Blue prov:wasGeneratedBy :BookExtractor :BookExtractor prov:used :DBpedia :DBpedia :status "trusted" :Purple prov:wasGeneratedBy :ShopExtractor :ShopExtractor prov:used :Database :Database :status "trusted" :Teal prov:wasGeneratedBy :CalenderExtractor :CalenderExtractor prov:used :CSVFile :CSVFile :status "trusted"
  • 38. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries 21 Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Naive Strategy Step 1 Provenance Query All sources that are trusted SELECT ?provenanceGraph WHERE { ?provenanceGraph prov : wasGeneratedBy ? a c t i v i t y . ? a c t i v i t y prov : used ? e n t i t y . ? e n t i t y : status " trusted " . } Result :Blue, :Purple, and :Teal
  • 39. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries 22 Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Naive Strategy Step 2 Analytical Query Revenue of books sold in shop “Barnes & Noble” SELECT sum(? price ) FROM :Blue FROM :Purple FROM :Teal WHERE { ?observation : price ? price . ?observation : shop ?shop . ?shop : t i t l e " Barnes & Noble " . }
  • 40. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries 23 Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Naive Strategy Step 3 :sale1:shop1 “Barnes & Noble” :book1 :sale2 “Dune” :shop2“Amazon” :book2“Dracula” 12 7 “January” “2016” :january2016 :2016 :date31012016 :shop:shop :shop:shop :product:product :product:product :name:name :name:name :title:title :title:title :price:price :price:price :date:date :date:date skos:broaderskos:broader skos:broaderskos:broader :month:month :year:year
  • 41. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries 23 Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Naive Strategy Step 3 :sale1:shop1 “Barnes & Noble” :book1 :sale2 “Dune” :shop2“Amazon” :book2“Dracula” 12 7 “January” “2016” :january2016 :2016 :date31012016 :shop:shop :shop:shop :product:product :product:product :name:name :name:name :title:title :title:title :price:price :price:price :date:date :date:date skos:broaderskos:broader skos:broaderskos:broader :month:month :year:year
  • 42. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries 24 Example Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Query Tree σ(?shop :name “Barns & Nobels”) σ(?obs :shop ?shop) σ(?obs :price ?price) AGG(?price) Π (?price) :Purple:Teal :Blue :Purple:Teal :Blue :Purple:Teal :Blue Overhead per join
  • 43. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example 25 Our Solution Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Our Solution
  • 44. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution 26 Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Materialization Strategy 1. Evaluate Provenance Query 2. Materialize Provenance Graphs 3. Execute Analytical Query
  • 45. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution 27 Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Materialization Step 1 Provenance Query All sources that are trusted SELECT ?provenanceGraph WHERE { ?provenanceGraph prov : wasGeneratedBy ? a c t i v i t y . ? a c t i v i t y prov : used ? e n t i t y . ? e n t i t y : status " trusted " . } Result :Blue, :Purple, and :Teal
  • 46. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution 28 Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Materialization Step 2 :sale1:shop1 “Barnes & Noble” :book1 :sale2 “Dune” :shop2“Amazon” :book2“Dracula” 12 7 “January” “2016” :january2016 :2016 :date31012016 :shop:shop :shop:shop :product:product :product:product :name:name :name:name :title:title :title:title :price:price :price:price :date:date :date:date skos:broaderskos:broader skos:broaderskos:broader :month:month :year:year
  • 47. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution 28 Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Materialization Step 2 :sale1:shop1 “Barnes & Noble” :book1 :sale2 “Dune” :shop2“Amazon” :book2“Dracula” 12 7 “January” “2016” :january2016 :2016 :date31012016 :shop:shop :shop:shop :product:product :product:product :name:name :name:name :title:title :title:title :price:price :price:price :date:date :date:date skos:broaderskos:broader skos:broaderskos:broader :month:month :year:year
  • 48. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution 29 Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Materialization Step 2 Materialize Cube’ Revenue of books sold in shop “Barnes & Noble” CONSTRUCT { ? subject ? predicate ? object } FROM :Blue FROM :Purple FROM :Teal WHERE { ? subject ? predicate ? object }
  • 49. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution 30 Materialization Strategy Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Materialization Step 3 Analytical Query Revenue of books sold in shop “Barnes & Noble” SELECT sum(? price ) FROM :CUBE’ WHERE { ?observation : price ? price . ?observation : shop ?shop . ?shop : t i t l e " Barnes & Noble " . }
  • 50. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy 31 Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Context Index 1. Analyze analytical query 2. Lookup predicates 3. Filter provenance graphs
  • 51. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy 32 Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Context Index Query Analysis Analytical Query Revenue of books sold in shop “Barnes & Noble” SELECT sum(? price ) WHERE { ?observation :price ? price . ?observation : shop ?shop . ?shop :title " Barnes & Noble " . }
  • 52. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy 33 Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Context Index Query Analysis :sale1:shop1 “Barnes & Noble” :book1 :sale2 “Dune” :shop2“Amazon” :book2“Dracula” 12 7 “January” “2016” :january2016 :2016 :date31012016 :shop :shop :product :product :name :name :title :title :price :price :date :date skos:broader skos:broader :month :year
  • 53. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy 34 Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Context Index Lookup Root :product :name :shop :title :price :date skos:broader :month skos:broader :year :Teal:Teal:Blue:Brown:Purple:Purple observations Level 1 Level 2 Level 3 provenance :title :shop Root :price
  • 54. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy 34 Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Context Index Lookup Root :product :name :shop :title :price :date skos:broader :month skos:broader :year :Teal:Teal:Blue:Brown:Purple:Purple observations Level 1 Level 2 Level 3 provenance :title :shop Root :price
  • 55. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy 35 Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Context Index Combining Provenance Query & Context Index :sale1:shop1 “Barnes & Noble” :book1 :sale2 “Dune” :shop2“Amazon” :book2“Dracula” 12 7 “January” “2016” :january2016 :2016 :date31012016 :shop :shop :product :product :name :name :title :title :price :price :date :date skos:broader skos:broader :month :year
  • 56. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy 35 Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Context Index Combining Provenance Query & Context Index :sale1:shop1 “Barnes & Noble” :book1 :sale2 “Dune” :shop2“Amazon” :book2“Dracula” 12 7 “January” “2016” :january2016 :2016 :date31012016 :shop :shop :product :product :name :name :title :title :price :price :date :date skos:broader skos:broader :month :year
  • 57. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy 35 Context Index Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Context Index Combining Provenance Query & Context Index :sale1:shop1 “Barnes & Noble” :book1 :sale2 “Dune” :shop2“Amazon” :book2“Dracula” 12 7 “January” “2016” :january2016 :2016 :date31012016 :shop :shop :product :product :name :name :title :title :price :price :date :date skos:broader skos:broader :month :year
  • 58. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index 36 Experiments Setup Evaluation Dept. of Computer Science Aalborg University Denmark Experiments
  • 59. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments 37 Setup Evaluation Dept. of Computer Science Aalborg University Denmark Setup Dataset
  • 60. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup 38 Evaluation Dept. of Computer Science Aalborg University Denmark Evaluation 1.0 1.8 8.0 13.5 101 102 103 104 Million of triples Evaluationtime(sec) Naive Naive+Index Materialized Materialized+Index
  • 61. 39 Towards Answering Provenance-Enabled SPARQL Queries over RDF Data Cubes Kim Ahlstrøm Preliminaries Resource Description Framework RDF Data Cube Provenance Provenance-enabled Query The Problem Provenance-enabled Queries Example Our Solution Materialization Strategy Context Index Experiments Setup 39 Evaluation Dept. of Computer Science Aalborg University Denmark Evaluation 10 30 50 70 90 50 100 150 200 Percentage size of provenance query Evaluationtime(sec) Materialized Materialized+Index
  • 62. Thank you for your attention