SlideShare a Scribd company logo
TermPicker:	Enabling	the	Reuse	of	
Vocabulary	Terms	by	Exploi:ng	Data	
from	the	Linked	Open	Data	Cloud	
Johann	Schaible,	Thomas	Go2ron,	and	Ansgar	Scherp.	
at	ESWC	2016
Problem	statement	
2
§  When	modeling	LOD,	it	is	accustomed	to	reuse	
vocabulary	terms	(à	classes	and	proper;es)		
§  However,	it	is	a	challenging	task	
swrc:Publication
http://ex.com/001
http://ex.com/002
http://ex.com/003
http://ex.com/p/001
http://ex.com/p/002
http://ex.com/p/003
swrc:Person
?
8
>><
>>:
dc:creator
foaf:maker
dcterms:creator
swrc:author
Need	for	Vocabulary	Term	
Recommenda;ons
Term	recommenda:ons		
based	on…?	
3
§  rdfs:domain,	rdfs:range,	and	other	informa;on	
encoded	in	vocabularies?	
§  Popularity	of	a	vocabulary	term?	
§  Classes	and	proper;es	from	domain	specific	
vocabularies?	
§  etc.	
Which	vocabulary	terms	did	other	data	providers	
on	the	LOD	cloud	use	in	a	similar	scenario?
How	to	capture	a	Scenario?	
4
§  A	scenario	is	defined	by	vocabulary	terms	used	
for	a	part	of	a	model	à	paKerns	on	schema	level	
Example:	
slp = ({swrc:Publication}, {dc:creator}, {foaf:Person})
Resources	of	type	swrc:PublicaFon	are	connected	to	
resources	of	type	foaf:Person	via	the	property	dc:creator	
slp = (sts, ps, ots)
Schema-Level	PaIerns	(SLPs)	
A	tuple	describing	the	connec;on	between	two	
sets	of	classes	via	a	set	of	proper;es		
In	General:
Vocabulary	Term	
Recommenda:ons	Based	on	LOD	
5
Recommender of vocabulary terms:{x1, ..., xn}
query input I
query-SLP: slpq = ({mo:SoloMusicArtist}, ?, ?)
Feature Computation
{F(slpq, x1), ..., F(slpq, xn)}{F(slpq, x1), ..., F(slpq, xn)}
II
Ranking Model
III
%({F(slpq, x1), ..., F(slpq, xn)})
query output IV
Classes for subject:
properties:
Clases for object:<..., mo:Record, mo:MusicGroup,...>
<..., mo:MusicArtist, foaf:Person,...>
<..., foaf:made,..., mo:member of,...>
Overview	
6
Feature Computation
{F(slpq, x1), ..., F(slpq, xn)}{F(slpq, x1), ..., F(slpq, xn)}
Recommender of vocabulary terms:
Ranking Model
query input
query output IV
I
II
III
Classes for subject:
properties:
Clases for object:
query-SLP:
{x1, ..., xn}
%({F(slpq, x1), ..., F(slpq, xn)})
<..., mo:Record, mo:MusicGroup,...>
slpq = ({mo:SoloMusicArtist}, ?, ?)
<..., mo:MusicArtist, foaf:Person,...>
<..., foaf:made,..., mo:member of,...>
Feature	Computa:on:	
The	SLP-Feature	
7
slpq = ({mo:SoloMusicArtist}, {}, {})
If slpq ✓ slpi (slpi 2 SLPLOD)
Then Sets of recommendations: slpi slpq
Collabora;ve	
Filtering	
Classes for subject: < mo:MusicArtist, dbo:Actor >
Properties: < mo:member of, foaf:made, mo:recorded >
Classes for object: < mo:MusicBand, mo:Record >
SLPLOD = {({mo:SoloMusicArtist, mo:MusicArtist}, {mo:member of}, {mo:MusicBand})
({mo:SoloMusicArtist, dbo:Actor}, {foaf:made, mo:recorded}, {mo:Record})
({foaf:Person}, {foaf:knows}, {foaf:Person})
}
SLPLOD = SPLs	computed	from	datasets	on	the	LOD	cloud
Feature	Computa:on:	
State	of	the	Art	Features1	
8
Feature Definition of the Feature
f1 Number of datasets on the LOD cloud using the recommendation
candidate x
f2 Number of datasets on the LOD cloud using the vocabulary Vx of
recommendation candidate x
f3 Total number of occurrences of recommendation candidate x on the
LOD cloud
f4 Whether the recommendation candidate x is from a vocabulary that
is already used in query-SLP slpq
f1 f3: Reusing	popular	vocabularies/vocabulary	terms	
f4: Reusing	vocabulary	terms	from	the	same	vocabulary	
1)	Schaible,	GoKron,	and	Scherp:	Survey	on	Common	Strategies	of	Vocabulary	Reuse	in	Linked	Open	Data	Modeling	
(ESWC	2104)
Overview	
9
Feature Computation
{F(slpq, x1), ..., F(slpq, xn)}{F(slpq, x1), ..., F(slpq, xn)}
Recommender of vocabulary terms:
Ranking Model
query input
query output IV
I
II
III
Classes for subject:
properties:
Clases for object:
query-SLP:
{x1, ..., xn}
%({F(slpq, x1), ..., F(slpq, xn)})
<..., mo:Record, mo:MusicGroup,...>
slpq = ({mo:SoloMusicArtist}, ?, ?)
<..., mo:MusicArtist, foaf:Person,...>
<..., foaf:made,..., mo:member of,...>
Calcula:ng	a	Ranking	Model	
10
How	to	weight	
the	feature	
values?	
§  Learning	to	Rank	(L2R):		
}  Family	of	supervised	machine	learning	algorithms	
based	on	data	with	relevance	annota;ons	
}  state	of	the	art	in	IR	to	compute	a	generalized	ranking	
model	over	a	given	set	of	features	
}  Ranking	model	is	derived	by	observing	correla;ons	
between	feature	values	and	candidate	relevance	
F f1 f2 f3 f4 SLP-feature
(slpq, x1) 7 9 20 1 4
(slpq, x2) 3 3 5 0 6
(slpq, x3) 10 20 80 0 2
(slpq, x4) 4 20 29 1 4
Overview	
11
Feature Computation
{F(slpq, x1), ..., F(slpq, xn)}{F(slpq, x1), ..., F(slpq, xn)}
Recommender of vocabulary terms:
Ranking Model
query input
query output IV
I
II
III
Classes for subject:
properties:
Clases for object:
query-SLP:
{x1, ..., xn}
%({F(slpq, x1), ..., F(slpq, xn)})
<..., mo:Record, mo:MusicGroup,...>
slpq = ({mo:SoloMusicArtist}, ?, ?)
<..., mo:MusicArtist, foaf:Person,...>
<..., foaf:made,..., mo:member of,...>
12
Evalua:on	
§  Baseline	POP:	Reuse	popular	vocabulary	terms	
}  Based	on	features:	
§  Baseline	SAME:	Reuse	terms	from	same	vocabulary	
}  Based	on	features:		
	
§  SLP-feature-based:	U;lizing	the	SLP-feature	
}  Based	on	features:	
What	is	the	benefit	of	the	SLP-feature?	
f1 f3
f1 f4
f1 f4 (+ SLP-feature )
13
Evalua:on	Procedure	
§  Offline	evalua;on	with	hidden	informa;on	
§  Measuring	quality	of	recommenda;ons	
}  Mean	Average	Precision	(MAP	)	
}  Mean	Reciprocal	Rank	at	the	first	5	posi;on	(MRR@5)	
§  Use	of	the	RankLib2	library	
slpq = ({mo:SoloMusicArtist}, {foaf:made}, {mo:Record})
Example:	
Randomly	hidden	term:	“foaf:made”	
< foaf:name, mo:remixed, foaf:made, ... >Result	list:	
2)	hKps://sourceforge.net/p/lemur/wiki/RankLib/
14
Evalua:on	Data	for	
Recommenda:ons	
§  Two	evalua;ons	based	on	BTC	20143	and	DyLDO4	
BTC 2014 DyLDO
# of triples first 34 mio. (reduce overhead) 10.8 mio
# of PLDs 3, 500 382
# of distinct terms 5.5 mio. 2.3 mio.
# of vocabularies 1, 500 600
# of computed SLPs 227, 000 118, 000
§  10-fold	leave-one-out	valida;on	based	on	PLDs	
3)	hKp://km.aid.kit.edu/projects/btc-2014/		
4)	hKp://swse.deri.org/dyldo/
15
Results	–	Box	Plots	MAP	
L2R	Algorithm	
Features	used
16
Discussion	
§  Using	“from	same	vocabulary”-feature	not	
significant	
}  à	only	in	few	cases	terms	from	same	vocabulary	are	
used	
§  Using	SLPs	significant	improvement	(ca.	35%	in	
MAP)	
}  à	already	now:	looking	at	how	others	model	their	data	
§  BeKer	performance	on	BTC	2014	
}  More	data	in	BTC	2014	to	train	the	ranking	model	
}  37%	more	relevant	candidates
17
Conclusion	
§  Using	SLPs,	relevant	recommenda;ons	are	ranked	
significantly	higher	in	the	result	list	
}  Can	aid	the	engineer	even	more	in	modeling	data	in	a	
way	how	other	data	providers	do	
§  Using	L2R,	the	more	relevant	candidates	correlate	
with	a	feature,	the	beKer	the	results	
However,	
Offline	evalua;ons	do	not	observe	actual	
user	behavior	à	online	evalua;on	needed5	
5)	Performed	in	Schaible,	Szekely,	Scherp:	“Comparing	Vocabulary	Term	Recommenda;ons	using	Associa;on	Rules	
and	Learning	To	Rank:	A	User	Study”	at	ESWC	2016
Thank	You!	
18
Tool	URL:	hKp://termpicker.lodrec.org	
Evalua;on	data	and	raw	results:	hKps://github.com/WanjaSchaible/l2r_eval_material

More Related Content

What's hot (20)

PPTX
Semantic web meetup – sparql tutorial
AdonisDamian
 
PPT
From SQL to SPARQL
George Roth
 
PDF
Data translation with SPARQL 1.1
andreas_schultz
 
PPT
SPARQL in a nutshell
Fabien Gandon
 
ODP
Graph Data -- RDF and Property Graphs
andyseaborne
 
PPTX
Democratizing Big Semantic Data management
WU (Vienna University of Economics and Business)
 
PDF
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
Olaf Hartig
 
PPTX
SPARQL-DL - Theory & Practice
Adriel Café
 
PPT
Slides
butest
 
PPTX
SPARQL 1.1 Status
LeeFeigenbaum
 
PDF
Introduction to R for Data Science :: Session 8 [Intro to Text Mining in R, M...
Goran S. Milovanovic
 
PPT
Twinkle: A SPARQL Query Tool
Leigh Dodds
 
PDF
useR! 2012 Talk
rtelmore
 
PPT
SPARQL Tutorial
Leigh Dodds
 
PDF
Revealing Entities From Texts With a Hybrid Approach
Julien PLU
 
PPTX
Efficient source selection for sparql endpoint federation
Muhammad Saleem
 
PPTX
Introduction to SPARQL
Jose Emilio Labra Gayo
 
PDF
Link Discovery Tutorial Part I: Efficiency
Holistic Benchmarking of Big Linked Data
 
PDF
Linked Open Data Visualization
Laura Po
 
Semantic web meetup – sparql tutorial
AdonisDamian
 
From SQL to SPARQL
George Roth
 
Data translation with SPARQL 1.1
andreas_schultz
 
SPARQL in a nutshell
Fabien Gandon
 
Graph Data -- RDF and Property Graphs
andyseaborne
 
Democratizing Big Semantic Data management
WU (Vienna University of Economics and Business)
 
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
Olaf Hartig
 
SPARQL-DL - Theory & Practice
Adriel Café
 
Slides
butest
 
SPARQL 1.1 Status
LeeFeigenbaum
 
Introduction to R for Data Science :: Session 8 [Intro to Text Mining in R, M...
Goran S. Milovanovic
 
Twinkle: A SPARQL Query Tool
Leigh Dodds
 
useR! 2012 Talk
rtelmore
 
SPARQL Tutorial
Leigh Dodds
 
Revealing Entities From Texts With a Hybrid Approach
Julien PLU
 
Efficient source selection for sparql endpoint federation
Muhammad Saleem
 
Introduction to SPARQL
Jose Emilio Labra Gayo
 
Link Discovery Tutorial Part I: Efficiency
Holistic Benchmarking of Big Linked Data
 
Linked Open Data Visualization
Laura Po
 

Viewers also liked (20)

PDF
RailsGirls Rotterdam - Github (and Octocats!)
Anton Bangratz
 
PPTX
Mood-based Classification of TV Programmes - Jana Eggink, Sam Davies, Denise...
sebastianewert
 
PPTX
BBC R&D Datasets - Jana Eggink (Semantic Media @ BBC, Feb 2013)
sebastianewert
 
PPTX
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Sören Auer
 
PPTX
Semantic Media Project Introduction - Mark Sandler (Barbican Arts Centre, Oct...
sebastianewert
 
ODP
DBpedia: A Public Data Infrastructure for the Web of Data
Sebastian Hellmann
 
PPT
DBpedia Framework - BBC Talk
Georgi Kobilarov
 
PDF
Semantic Linking of Information, Content and Metadata for Early Music (SLICKM...
sebastianewert
 
PDF
Large-Scale Capture of Producer-Defined Musical Semantics - Ryan Stables (Sem...
sebastianewert
 
PPTX
Exploring the British Library's audio collections - Richard Ranft (Semantic M...
sebastianewert
 
PDF
Inducing Predictive Clustering Trees for Datatype properties Values
Giuseppe Rizzo
 
PDF
One Web of pages, One Web of peoples, One Web of Services, One Web of Data, O...
Fabien Gandon
 
PDF
DBpedia/association Introduction The Hague 12.2.2016
Sebastian Hellmann
 
PPTX
OWL-based validation by Gavin Mendel Gleasonand Bojan Bozic, Trinity College,...
semanticsconference
 
PPT
Lecture 11 Unstructured Data and the Data Warehouse
phanleson
 
PDF
Adam Bartusiak and Jörg Lässig | Semantic Processing for the Conversion of Un...
semanticsconference
 
PDF
Apache Any23 - Anything to Triples
Michele Mostarda
 
PDF
Semantic Web and Web 3.0 - Web Technologies (1019888BNR)
Beat Signer
 
PPTX
Semantic Web Intro - St. Patrick's Day 2016 Update
Eric Franzon
 
PDF
DBpedia Tutorial - Feb 2015, Dublin
m_ackermann
 
RailsGirls Rotterdam - Github (and Octocats!)
Anton Bangratz
 
Mood-based Classification of TV Programmes - Jana Eggink, Sam Davies, Denise...
sebastianewert
 
BBC R&D Datasets - Jana Eggink (Semantic Media @ BBC, Feb 2013)
sebastianewert
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Sören Auer
 
Semantic Media Project Introduction - Mark Sandler (Barbican Arts Centre, Oct...
sebastianewert
 
DBpedia: A Public Data Infrastructure for the Web of Data
Sebastian Hellmann
 
DBpedia Framework - BBC Talk
Georgi Kobilarov
 
Semantic Linking of Information, Content and Metadata for Early Music (SLICKM...
sebastianewert
 
Large-Scale Capture of Producer-Defined Musical Semantics - Ryan Stables (Sem...
sebastianewert
 
Exploring the British Library's audio collections - Richard Ranft (Semantic M...
sebastianewert
 
Inducing Predictive Clustering Trees for Datatype properties Values
Giuseppe Rizzo
 
One Web of pages, One Web of peoples, One Web of Services, One Web of Data, O...
Fabien Gandon
 
DBpedia/association Introduction The Hague 12.2.2016
Sebastian Hellmann
 
OWL-based validation by Gavin Mendel Gleasonand Bojan Bozic, Trinity College,...
semanticsconference
 
Lecture 11 Unstructured Data and the Data Warehouse
phanleson
 
Adam Bartusiak and Jörg Lässig | Semantic Processing for the Conversion of Un...
semanticsconference
 
Apache Any23 - Anything to Triples
Michele Mostarda
 
Semantic Web and Web 3.0 - Web Technologies (1019888BNR)
Beat Signer
 
Semantic Web Intro - St. Patrick's Day 2016 Update
Eric Franzon
 
DBpedia Tutorial - Feb 2015, Dublin
m_ackermann
 
Ad

Similar to TermPicker: Enabling the Reuse of Vocabulary Terms by Exploiting Data from the Linked Open Data Cloud (20)

PPTX
Sparql
Tamrat Amare
 
PDF
A Generic Mapping-based Query Translation from SPARQL to Various Target Datab...
Franck Michel
 
PPTX
SPARQL introduction and training (130+ slides with exercices)
Thomas Francart
 
PDF
Sparklis exploration et interrogation de points d'accès sparql par interactio...
SemWebPro
 
PPTX
Introduction to SPARQL
Jose Emilio Labra Gayo
 
PDF
Linked (Open) Data
Bernhard Haslhofer
 
PDF
Comparing Vocabulary Term Recommendations using Association Rules and Learnin...
JohannWanja
 
PPTX
semantic web & natural language
Nurfadhlina Mohd Sharef
 
PPTX
Bringing It All Together: Mapping Continuing Resources Vocabularies for Linke...
NASIG
 
PPTX
Slides
shahriar-ro
 
PPT
Sparql
Serge Garlatti
 
PPT
Wi presentation
Saeedeh Shekarpour
 
PDF
Nicoletta Fornara and Fabio Marfia | Modeling and Enforcing Access Control Ob...
semanticsconference
 
PDF
BioSD Tutorial 2014 Editition
Rothamsted Research, UK
 
PPTX
20130622 okfn hackathon t2
Seonho Kim
 
PPT
Semantic Web: From Representations to Applications
Guus Schreiber
 
KEY
Creating web applications with LODSPeaKr
Alvaro Graves
 
PPTX
Loupe model - Use Cases and Requirements
Nandana Mihindukulasooriya
 
PDF
Metaprograms and metadata (as part of the the PTT lecture)
Ralf Laemmel
 
PPT
Integrating a Domain Ontology Development Environment and an Ontology Search ...
Takeshi Morita
 
Sparql
Tamrat Amare
 
A Generic Mapping-based Query Translation from SPARQL to Various Target Datab...
Franck Michel
 
SPARQL introduction and training (130+ slides with exercices)
Thomas Francart
 
Sparklis exploration et interrogation de points d'accès sparql par interactio...
SemWebPro
 
Introduction to SPARQL
Jose Emilio Labra Gayo
 
Linked (Open) Data
Bernhard Haslhofer
 
Comparing Vocabulary Term Recommendations using Association Rules and Learnin...
JohannWanja
 
semantic web & natural language
Nurfadhlina Mohd Sharef
 
Bringing It All Together: Mapping Continuing Resources Vocabularies for Linke...
NASIG
 
Slides
shahriar-ro
 
Wi presentation
Saeedeh Shekarpour
 
Nicoletta Fornara and Fabio Marfia | Modeling and Enforcing Access Control Ob...
semanticsconference
 
BioSD Tutorial 2014 Editition
Rothamsted Research, UK
 
20130622 okfn hackathon t2
Seonho Kim
 
Semantic Web: From Representations to Applications
Guus Schreiber
 
Creating web applications with LODSPeaKr
Alvaro Graves
 
Loupe model - Use Cases and Requirements
Nandana Mihindukulasooriya
 
Metaprograms and metadata (as part of the the PTT lecture)
Ralf Laemmel
 
Integrating a Domain Ontology Development Environment and an Ontology Search ...
Takeshi Morita
 
Ad

Recently uploaded (20)

PDF
thesis dr Zahida and samia on plasma physics.pdf
HamzaKhalid267437
 
PPTX
CNS.pptx Central nervous system meninges ventricles of brain it's structure a...
Ashwini I Chuncha
 
PPTX
Slideshow 2 about cows and how they procreate
chig22222
 
PDF
Preserving brand authenticity amid AI-driven misinformation: Sustaining consu...
Selcen Ozturkcan
 
PDF
20250603 Recycling 4.pdf . Rice flour, aluminium, hydrogen, paper, cardboard.
Sharon Liu
 
PDF
Rapid protoplanet formation in the outer Solar System recorded in a dunite fr...
Sérgio Sacani
 
PDF
A High-Caliber View of the Bullet Cluster through JWST Strong and Weak Lensin...
Sérgio Sacani
 
PDF
Plant growth promoting bacterial non symbiotic
psuvethapalani
 
DOCX
Critical Book Review (CBR) - "Hate Speech: Linguistic Perspectives"
Sahmiral Amri Rajagukguk
 
PPTX
Q1_Science 8_Week3-Day 1.pptx science lesson
AizaRazonado
 
PPTX
Microbiome_Engineering_Poster_Fixed.pptx
SupriyaPolisetty1
 
PDF
Carbonate formation and fluctuating habitability on Mars
Sérgio Sacani
 
PDF
Pharmakon of algorithmic alchemy: Marketing in the age of AI
Selcen Ozturkcan
 
PPTX
SCHOOL HOLIDAY REVISION CHAPTER 8.pptx science kssm
SITIATHIRAHBINTISULA
 
PDF
RANKING THE MICRO LEVEL CRITICAL FACTORS OF ELECTRONIC MEDICAL RECORDS ADOPTI...
hiij
 
PPTX
770043401-q1-Ppt-pe-and-Health-7-week-1-lesson-1.pptx
AizaRazonado
 
PDF
2025 Insilicogen Company English Brochure
Insilico Gen
 
PDF
soil and environmental microbiology.pdf
Divyaprabha67
 
PPTX
Systamatic Acquired Resistence (SAR).pptx
giriprasanthmuthuraj
 
PDF
The ALMA-CRISTAL survey: Gas, dust, and stars in star-forming galaxies when t...
Sérgio Sacani
 
thesis dr Zahida and samia on plasma physics.pdf
HamzaKhalid267437
 
CNS.pptx Central nervous system meninges ventricles of brain it's structure a...
Ashwini I Chuncha
 
Slideshow 2 about cows and how they procreate
chig22222
 
Preserving brand authenticity amid AI-driven misinformation: Sustaining consu...
Selcen Ozturkcan
 
20250603 Recycling 4.pdf . Rice flour, aluminium, hydrogen, paper, cardboard.
Sharon Liu
 
Rapid protoplanet formation in the outer Solar System recorded in a dunite fr...
Sérgio Sacani
 
A High-Caliber View of the Bullet Cluster through JWST Strong and Weak Lensin...
Sérgio Sacani
 
Plant growth promoting bacterial non symbiotic
psuvethapalani
 
Critical Book Review (CBR) - "Hate Speech: Linguistic Perspectives"
Sahmiral Amri Rajagukguk
 
Q1_Science 8_Week3-Day 1.pptx science lesson
AizaRazonado
 
Microbiome_Engineering_Poster_Fixed.pptx
SupriyaPolisetty1
 
Carbonate formation and fluctuating habitability on Mars
Sérgio Sacani
 
Pharmakon of algorithmic alchemy: Marketing in the age of AI
Selcen Ozturkcan
 
SCHOOL HOLIDAY REVISION CHAPTER 8.pptx science kssm
SITIATHIRAHBINTISULA
 
RANKING THE MICRO LEVEL CRITICAL FACTORS OF ELECTRONIC MEDICAL RECORDS ADOPTI...
hiij
 
770043401-q1-Ppt-pe-and-Health-7-week-1-lesson-1.pptx
AizaRazonado
 
2025 Insilicogen Company English Brochure
Insilico Gen
 
soil and environmental microbiology.pdf
Divyaprabha67
 
Systamatic Acquired Resistence (SAR).pptx
giriprasanthmuthuraj
 
The ALMA-CRISTAL survey: Gas, dust, and stars in star-forming galaxies when t...
Sérgio Sacani
 

TermPicker: Enabling the Reuse of Vocabulary Terms by Exploiting Data from the Linked Open Data Cloud