SlideShare a Scribd company logo
Sahar Vahdati
Christoph Lange
Giorgos Alexiou
George Papastefanatos
Making Use of the Linked Open Data
Services for OpenAIRE:
Querying Data about Research Results, Person, Projects and
Organizations
Digital Infrastructure for Research (DI4R)
28-30 September 2016
Krakau, Poland
University of Bonn, Germany Athena Research Center
Session outline
• Introduction to OpenAIRE
• Technical Concepts
• Hands on Session
Open Access Infrastructure for Research
in Europe
Need for digital research infrastructures for all kinds of
research outputs, across disciplines and countries!
•comprises a database of all EC FP7 and H2020 funded research projects, publications, datasets
•manages scientific publications and associated scientific material
•aggregates Open Access publications and links them to research data and funding bodies
•supports the Open Access principles via national helpdesks and comprehensive guidelines
http://www.openaire.eu
OpenAIRE Services
OpenAIRE focuses on:
• Workflows and processes of scholarly communication rather than resources,
• Research data and other research outputs rather than only publications,
• The links between considered entities,
• Relationship of European OA infrastructures with other regions of the world.
enables search, discovery and monitoring of the publications and
datasets resulting from:
>100k research projects
>17m publications
>23k datasets
>5k repositories.
Core entities
Linking entities
OpenAIRE Data Model
Example of data about Core Entities
Entity type Result
openaireID od_______908::fac3db85bbcb1f52ae07c5868d8fb453
dateOfTransformation 2015-02-06
dateOfCollection 2015-02-06
title A Patient from Argentina Infected with Rickettsia massiliae
Dateofacceptance 01/04/2010
Publisher The American Society of Tropical Medicine and Hygiene
Pid oai:europepmc.org:2077077;PMC2844561
Language English
Subject Articles
BestLicense Open Acces
An entity of type Result
Interlink to other databases
Support researchers by answering interesting queries
The OpenAIRE vision:
• Data about scientific events  emergence of scientific topics
• Data about people affiliation  impact of certain research
Use cases:
• Research managers  use new indicators for measuring the quality
• Policy makers  get a quick overview of the findings and projects
• Researchers  find comprehensive citations list, research movement between communities/organizations
• Reviewers  get a quick overview of the field covered by the paper or dataset under review
Challenges supported by LOD Services
Linked Open Data
(LOD)
RDF data model
Publishing the OpenAIRE data as Linked Open Data and linking it to related datasets!
• Diverse data formats
• Various means to access/query data
• Use of different identifiers
• Heterogeneity of metadata schemas
Expected values
• Open up a window to the Linked Open Data Web
• Increase the OpenAIRE technical interoperability
• Increase the reusability of the OpenAIRE research metadata
• Engage with additional user communities
• Explore synergies with and added value to related open content initiatives
• Provide links through LOD to similar infrastructures
• Offer new services for OA data monitoring activities
• Provide services to export the OpenAIRE objects as a LOD graph
• Facilitate integration with other LOD graphs relative to similar systems and infrastructures
• Find patterns to enrich the OpenAIRE information space
Exposing the OpenAIRE Information Space as linked data!
Towards OpenAIRE LOD Services
Phase 1: LOD Production
Phase 1: Interlinking OpenAIRE RDF Graph to LOD cloud
Steps:
• Specify an RDF vocabulary
• Specify terms and namespaces
• Map the OA data model to an RDF data model
• Map the OA data to an statistic RDF dump
• Specify strategies to automate the RDF generation
OA RDF graph
…
@prefix oad: <http://lod.openaire.eu/data/> .
@prefix oav: <http://lod.openaire.eu/vocab#> .
@prefix dbpedia-owl: http://dbpedia.org/ontology/.
@prefix vivo: <http://vivoweb.org/files/vivo-isf-public-1.6.owl#> .
@prefix pext: <http://www.ontotext.com/proton-ontology/#> .
@prefix swrc:<http://swrc.ontoware.org/ontology#> .
oad:07553d8e646b69b868a9791da39a1802 a foaf:Person;
foaf:firstName "P."^^xsd:string; foaf:lastName "Jha"^^xsd:string;
foaf:name "Jha, P."^^xsd:string; oav:isAuthorOf .
oad:755469c995c2cb6cb55c3483634b026 a foaf:Person;
oav:hasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095;
oav:hasLabel "personResult_authorship_isAuthorOf"^^xsd:string;
oav:ranking "6"^^xsd:integer.
oad:075558cd104f737d82a34cb7e9fecd7d a foaf:Person;
foaf:firstName "T."^^xsd:string; foaf:lastName "Bere"^^xsd:string;
foaf:name "Bere, T."^^xsd:string.
…
OpenAIRE data
OA RDF
Phase 1.
LOD Production
Core entities
Linking entities
Specify vocabularies
Organizations Results* Persons Datasources Projects
68.526 17,414,766 62,958,315 19,443 624,417
*including duplicates connected with sameAs
Total Number of Triples: 1,013,527,855
Distinct Entities: 98,256
OpenAIRE data as RDF Graph
Steps:
• Identify datasets to be interlinked to
• Select interlinking tools: LIMES, Silk
• Test interlinking OA with DBLP and DBpedia
• Evaluate resulting link sets
• Specify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Phase2. Interlinking OA-RDF Graph to
LOD cloud
…
@prefix oad: <http://lod.openaire.eu/data/> .
@prefix oav: <http://lod.openaire.eu/vocab#> .
@prefix dbpedia-owl: http://dbpedia.org/ontology/.
.
oad:07553d8e646b69b868a9791da39a1802 a foaf:Person; foaf:firstName
"P."^^xsd:string; foaf:lastName "Jha"^^xsd:string; foaf:name "Jha, P."^^xsd:string;
oav:isAuthorOf .
oad:755469c995c2cb6cb55c3483634b026 a foaf:Person;
oav:hasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095;
oav:hasLabel "personResult_authorship_isAuthorOf"^^xsd:string;
oav:ranking "6"^^xsd:integer.
OA LOD
Linked Open Data
(LOD)
http://beta.lod.openaire.eu/
RDF (Resource Description Framework)
• Resource : anything uniquely identifiable
• Description: description of resource via representing properties and relations
• Framework: web-based protocols and semantics
• RDF triples: List of statements
Subject (URI)
Predicate (URI)
Object (URI or Literal)
oad:publication1
“Juan Carlos García“
oav:hasAuthor
RDF version of example
PREFIX dcterms: <http://purl.org/dc/terms/>
…
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX cerif: <http://www.eurocris.org/ontologies/cerif/1.3#>
PREFIX prov: <http://www.w3.org/ns/prov#
:od_______908::… rdf:type cerif:ResultEntity;
dcterms:description “ The first confirmed case “;
dcterms:publisher “The American Society of Tropical
Medicine and Hygiene”;
…
oav:resultSubject “Articles“;
oav:dateOfCollection 2015-02-06.
.
Example of data about Linking entities
An entity of type Person_Result whose ranking property can have the value 1 to
indicate the first author.
od_______908::f39…1c4a PersonResult od_______908::fa3...b453
Rdf:type foaf:Person;
oav:rank 1.
Rdf:type cerif:ResultEntity
How to query RDF?
SPARQL (Protocol and RDF Query Language)
• Query language of RDF-based data
• SPARQL endpoint: RDF-triple database on a server available on the Web
• Pattern matching language
• Protocol layer
• Query interface
How to query?
• SPARQL variables are bound to RDF terms e.g., ?title , ?author
• Inspired by SQL via SELECT statement
Example: SELECT ?title ?author
• Return as a table
?title ?author
A Patient from Argentina Infected with Rickettsia massiliae Juan Carlos García
OpenAIRE as LOD
• OA LOD in BETA version
• Triples per entity
• Online data: SPARQL endpoint
• Offline data: RDF dump
• Entities and URIs (interactive browsing)
• Dereferenceable URIs for all entities
http://www. beta.lod.openaire.eu
Steps:
• Specify an RDF vocabulary
• Specify terms and namespaces
• Map the OA data model to an RDF data model
• Map the OA data to an statistic RDF dump
• Specify strategies to automate the RDF generation
Data conforming to LOD best practices
published in BETA, December 2015
Main entitiesLinking entities
http://beta.lod.openaire.eu/
OA RDF graph
…
@prefix oad: <http://lod.openaire.eu/data/> .
@prefix oav: <http://lod.openaire.eu/vocab#> .
@prefix dbpedia-owl: http://dbpedia.org/ontology/.
@prefix vivo: <http://vivoweb.org/files/vivo-isf-public-1.6.owl#> .
@prefix pext: <http://www.ontotext.com/proton-ontology/#> .
@prefix swrc:<http://swrc.ontoware.org/ontology#> .
oad:07553d8e646b69b868a9791da39a1802 a foaf:Person;
foaf:firstName "P."^^xsd:string; foaf:lastName "Jha"^^xsd:string;
foaf:name "Jha, P."^^xsd:string; oav:isAuthorOf .
oad:755469c995c2cb6cb55c3483634b026 a foaf:Person;
oav:hasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095;
oav:hasLabel "personResult_authorship_isAuthorOf"^^xsd:string;
oav:ranking "6"^^xsd:integer.
oad:075558cd104f737d82a34cb7e9fecd7d a foaf:Person;
foaf:firstName "T."^^xsd:string; foaf:lastName "Bere"^^xsd:string;
foaf:name "Bere, T."^^xsd:string.
…
OpenAIRE data
OA RDF
Sample query
select (count (distinct ?s) as ?count) ?flevel from <test> from <relationsTest>
where {?s a <http://www.eurocris.org/ontologies/cerif/1.3#Project>;
<http://lod.openaire.eu/vocab/fundingLevel0> ?flevel} GROUP BY ?flevel order by
?count
Number of publications with their corresponding funding level
General architecture
OpenAIRE Metadata
RDFization
Interlinking RDF Store
Deduplication & Inference
Apache Solr
https://www.openaire.eu
LOD Client
http://beta.lod.openaire.eu
OA Vocabulary
OA Data Model
HTML Browser
HTML
HTML
RDF
Steps:
• Identify datasets to be interlinked to
• Select interlinking tools: LIMES, Silk
• Test interlinking OA with DBLP and DBpedia
• Evaluate resulting link sets
• Specify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF
Graph to LOD cloud
…
@prefix oad: <http://lod.openaire.eu/data/> .
@prefix oav: <http://lod.openaire.eu/vocab#> .
@prefix dbpedia-owl: http://dbpedia.org/ontology/.
@prefix vivo: <http://vivoweb.org/files/vivo-isf-public-1.6.owl#> .
@prefix pext: <http://www.ontotext.com/proton-ontology/#> .
@prefix swrc:<http://swrc.ontoware.org/ontology#> .
oad:07553d8e646b69b868a9791da39a1802 a foaf:Person;
foaf:firstName "P."^^xsd:string; foaf:lastName "Jha"^^xsd:string;
foaf:name "Jha, P."^^xsd:string; oav:isAuthorOf .
oad:755469c995c2cb6cb55c3483634b026 a foaf:Person;
oav:hasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095;
oav:hasLabel "personResult_authorship_isAuthorOf"^^xsd:string;
oav:ranking "6"^^xsd:integer.
oad:075558cd104f737d82a34cb7e9fecd7d a foaf:Person;
foaf:firstName "T."^^xsd:string; foaf:lastName "Bere"^^xsd:string;
foaf:name "Bere, T."^^xsd:string.
…
OA LOD
Linked Open Data
(LOD)
http://beta.lod.openaire.eu/
OA LOD interlinking workflow
Preprocessing
• Process all the dumps from candidate datasets
• Prune useless metadata
• Transform the metadata to key-value pairs(hadoop key(ID)-
value([Properties]))
• Store in HDFS
Sample interlinking result
Result of interlinking is a set of links between URIs from source and
target dataset:
DBLP dump is not complete
<http://lod.openaire...bde783> owl:sameAs <http://dblp.l3s.../BoissonnatN96>
<http://lod.openaire...4f8964> owl:sameAs <http://dblp.l3s.../Shrobe96>
<http://lod.openaire...27fea2> owl:sameAs <http://dblp.l3s.../X96c>
<http://lod.openaire...f433b9> owl:sameAs <http://dblp.l3s.../LiroyG96>
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
…
@prefix oad: <http://lod.openaire.eu/data/> .
@prefix oav: <http://lod.openaire.eu/vocab#> .
@prefix dbpedia-owl: http://dbpedia.org/ontology/.
@prefix vivo: <http://vivoweb.org/files/vivo-isf-public-1.6.owl#> .
@prefix pext: <http://www.ontotext.com/proton-ontology/#> .
@prefix swrc:<http://swrc.ontoware.org/ontology#> .
oad:07553d8e646b69b868a9791da39a1802 a foaf:Person;
foaf:firstName "P."^^xsd:string; foaf:lastName "Jha"^^xsd:string;
foaf:name "Jha, P."^^xsd:string; oav:isAuthorOf .
oad:755469c995c2cb6cb55c3483634b026 a foaf:Person;
oav:hasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095;
oav:hasLabel "personResult_authorship_isAuthorOf"^^xsd:string;
oav:ranking "6"^^xsd:integer.
oad:075558cd104f737d82a34cb7e9fecd7d a foaf:Person;
foaf:firstName "T."^^xsd:string; foaf:lastName "Bere"^^xsd:string;
foaf:name "Bere, T."^^xsd:string.
…
OA LOD
Linked Open Data
(LOD)
Ideas for LOD in Monitoring
monitoring interlinking:
when the target dataset grows from one
version to another one,
we can expect the linkset to grow as well
Scientific events
Bootstrapping datasets for scientific events:
• CEUR-WS.org dataset
• OpenResearch.org
• Include events in OA Data Model (Conference Object?)
• Measure the quality of events
• Related to funding and sponsoring
• Continuality
• Accepted project publications
• Reputation of people
• Location
• Citation
• …
Hands on
http://beta.lod.openaire.eu/sparql
Example: What is the overall research output
of a given project?
oav:produces and UNION are not working:
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX oav: <http://lod.openaire.eu/vocab/>
PREFIX cerif: http://www.eurocris.org/ontologies/cerif/1.3#
SELECT ?x ?y
WHERE
{
?y a cerif:ResultEntity
{ ?y oav:resultType 'dataset'}
UNION
{ ?y oav:resultType 'publication'}
?x a cerif:Project.
?y cerif:linkToProject ?y
} LIMIT 10
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX oav: <http://lod.openaire.eu/vocab/>
PREFIX foaf: <http://xmlns.com/foaf/0.1/>
SELECT ?o
WHERE
{
?x oav:projectOrganization ?o.
?o a foaf:Organization.
?y oav:projectOrganization ?o2.
?o2 a foaf:Organization.
FILTER (sameTerm(?o, ?o2) && !sameTerm(?x, ?y))
} LIMIT 10
Example: What organizations are more
active than others w.r.t. projects?
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX oav: <http://lod.openaire.eu/vocab/>
PREFIX cerif: <http://www.eurocris.org/ontologies/cerif/1.3#>
PREFIX dcterms: <http://purl.org/dc/terms/>
PREFIX foaf: <http://xmlns.com/foaf/0.1/>
SELECT ?y
WHERE
{
?p cerif:linksToPerson ?x
?x a foaf:Person.
?x dcterms:creator ?y.
?y oav:resultType "dataset"
} LIMIT 10
Example: What datasets has published by a
specific person who involved in a given
project?
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX oav: <http://lod.openaire.eu/vocab/>
PREFIX cerif: <http://www.eurocris.org/ontologies/cerif/1.3#>
PREFIX dcterms: <http://purl.org/dc/terms/>
PREFIX foaf: <http://xmlns.com/foaf/0.1/>
SELECT ?y
WHERE
{
?p cerif:linksToPerson ?x
?x a foaf:Person.
?x dcterms:creator ?y.
?y oav:resultType "dataset"
}
LIMIT 10
Example: List the full names of all authors
who have (co-)authored a publication in
project P?

More Related Content

What's hot (20)

PPTX
Towards a European Research Information Infrastructure
OpenAIRE
 
PPTX
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
Pedro Príncipe
 
PPTX
New OpenAIRE data providers: some of the most recent from September to Decemb...
OpenAIRE
 
PDF
OpenAIRE@info day_amsterdam_jan_2016
OpenAIRE
 
PPTX
OpenAIRE: Open Science as-a-Service - presentation at #DI4R2016
OpenAIRE
 
PPTX
Enabling better science - Results and vision of the OpenAIRE infrastructure a...
Paolo Manghi
 
PPTX
OpenAIRE services and tools - presentation at #DI4R2016
OpenAIRE
 
PPTX
OpenAIRE: eInfrastructure for Open Science
OpenAIRE
 
PPTX
Linked Data Implementations—Who, What and Why?
OCLC
 
PPTX
When the Web of Linked Data Arrives
Richard Wallis
 
PDF
OpenAIRE Presentation @3AMconf - Supporting Research Analytics by OpenAIRE Us...
OpenAIRE
 
PDF
Discovering Related Data Sources in Data Portals
Peter Haase
 
PPTX
IDCC workshop: OpenAIRE services and tools for Open Research Data in H2020
OpenAIRE
 
PPTX
The CIARD RINGValeri
CIARD Movement
 
PPTX
Scaling Usage Statistics across Repositories as an OpenAIRE Analytics Service...
OpenAIRE
 
PDF
Documents, services, and data on the web
Chiara Del Vescovo
 
PDF
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
OpenAIRE
 
PPT
Riding the wave - Paradigm shifts in information access
datacite
 
PDF
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
OpenAIRE
 
PPTX
Alma Swan - PASTEUR4OA: Policy alignment and effectiveness
OpenAIRE
 
Towards a European Research Information Infrastructure
OpenAIRE
 
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
Pedro Príncipe
 
New OpenAIRE data providers: some of the most recent from September to Decemb...
OpenAIRE
 
OpenAIRE@info day_amsterdam_jan_2016
OpenAIRE
 
OpenAIRE: Open Science as-a-Service - presentation at #DI4R2016
OpenAIRE
 
Enabling better science - Results and vision of the OpenAIRE infrastructure a...
Paolo Manghi
 
OpenAIRE services and tools - presentation at #DI4R2016
OpenAIRE
 
OpenAIRE: eInfrastructure for Open Science
OpenAIRE
 
Linked Data Implementations—Who, What and Why?
OCLC
 
When the Web of Linked Data Arrives
Richard Wallis
 
OpenAIRE Presentation @3AMconf - Supporting Research Analytics by OpenAIRE Us...
OpenAIRE
 
Discovering Related Data Sources in Data Portals
Peter Haase
 
IDCC workshop: OpenAIRE services and tools for Open Research Data in H2020
OpenAIRE
 
The CIARD RINGValeri
CIARD Movement
 
Scaling Usage Statistics across Repositories as an OpenAIRE Analytics Service...
OpenAIRE
 
Documents, services, and data on the web
Chiara Del Vescovo
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
OpenAIRE
 
Riding the wave - Paradigm shifts in information access
datacite
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
OpenAIRE
 
Alma Swan - PASTEUR4OA: Policy alignment and effectiveness
OpenAIRE
 

Similar to Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session) (20)

PPTX
OpenAIRE: Implementing Open Science in EOSC - crosscutting with RDA (Presenta...
OpenAIRE
 
PPTX
A user journey in OpenAIRE services through the lens of repository managers -...
OpenAIRE
 
PPTX
A user journey in OpenAIRE services through the lens of repository managers -...
OpenAIRE
 
PDF
4th Content Providers Community Call
OpenAIRE
 
PPTX
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
OpenAIRE
 
PPTX
Introduction to OpenAIRE services and the OpenAIRE Research Graph
OpenAIRE
 
PPTX
OpenAIRE services and tools for Open Science
Pedro Príncipe
 
PPTX
Moving content across the OpenAIRE infrastructure boundaries (6th RDA Plenary)
OpenAIRE
 
PPTX
OpenAIRE services & tools: Zenodo and what's next (Danish OpenAIRE workshop)
Pedro Príncipe
 
PDF
7th Content Providers Community Call
OpenAIRE
 
PDF
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
Marta Villegas
 
PDF
A document-inspired way for tracking changes of RDF data - The case of the Op...
University of Bologna
 
PPTX
OpenAIRE and the Case of Irish Repositories
RIANIreland
 
PPTX
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
OpenAIRE
 
PPTX
OpenAIRE infrastructure and Services (OpenAIRE Workshop Malta)
Pedro Príncipe
 
PPTX
Webinar@AIMS: How to practically support Open Access: Guidelines for Data Pro...
AIMS (Agricultural Information Management Standards)
 
PPTX
How to practically support Open Access: Guidelines for Data Providers of the ...
OpenAIRE
 
PPTX
Make your content count - OpenAIRE Content providers Dashboard: service for r...
OpenAIRE
 
PPTX
Facilitate Research Communities Adoption of Open Science Publishing Principle...
OpenAIRE
 
PPTX
Belgium webinar - openAIRE Research Graph
OpenAccessBelgium
 
OpenAIRE: Implementing Open Science in EOSC - crosscutting with RDA (Presenta...
OpenAIRE
 
A user journey in OpenAIRE services through the lens of repository managers -...
OpenAIRE
 
A user journey in OpenAIRE services through the lens of repository managers -...
OpenAIRE
 
4th Content Providers Community Call
OpenAIRE
 
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
OpenAIRE
 
Introduction to OpenAIRE services and the OpenAIRE Research Graph
OpenAIRE
 
OpenAIRE services and tools for Open Science
Pedro Príncipe
 
Moving content across the OpenAIRE infrastructure boundaries (6th RDA Plenary)
OpenAIRE
 
OpenAIRE services & tools: Zenodo and what's next (Danish OpenAIRE workshop)
Pedro Príncipe
 
7th Content Providers Community Call
OpenAIRE
 
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
Marta Villegas
 
A document-inspired way for tracking changes of RDF data - The case of the Op...
University of Bologna
 
OpenAIRE and the Case of Irish Repositories
RIANIreland
 
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
OpenAIRE
 
OpenAIRE infrastructure and Services (OpenAIRE Workshop Malta)
Pedro Príncipe
 
Webinar@AIMS: How to practically support Open Access: Guidelines for Data Pro...
AIMS (Agricultural Information Management Standards)
 
How to practically support Open Access: Guidelines for Data Providers of the ...
OpenAIRE
 
Make your content count - OpenAIRE Content providers Dashboard: service for r...
OpenAIRE
 
Facilitate Research Communities Adoption of Open Science Publishing Principle...
OpenAIRE
 
Belgium webinar - openAIRE Research Graph
OpenAccessBelgium
 
Ad

More from OpenAIRE (20)

PDF
10th OpenAIRE Content Providers Community Call
OpenAIRE
 
PDF
9th Content Providers Community Call\
OpenAIRE
 
PPTX
OpenAIRE in the European Open Science Cloud (EOSC)
OpenAIRE
 
PDF
8th Content Providers Community Call
OpenAIRE
 
PDF
OpenAIRE PROVIDE Dashboard for Turkish repository managers
OpenAIRE
 
PDF
What will it cost to manage and share my data?
OpenAIRE
 
PDF
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
OpenAIRE
 
PDF
6th Content Providers Community Call
OpenAIRE
 
PPTX
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
OpenAIRE
 
PPTX
20200504_Research Data & the GDPR: How Open is Open?
OpenAIRE
 
PDF
20200504_Data, Data Ownership and Open Science
OpenAIRE
 
PPTX
20200429_Research Data & the GDPR: How Open is Open? (updated version)
OpenAIRE
 
PDF
20200429_Data, Data Ownership and Open Science
OpenAIRE
 
PPTX
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
OpenAIRE
 
PDF
COVID-19: Activities, tools, best practice and contact points in Greece
OpenAIRE
 
PDF
5th Content Providers Community Call
OpenAIRE
 
PDF
3rd Content Providers Community Call
OpenAIRE
 
PDF
2nd Content Providers Community Call
OpenAIRE
 
PDF
1st Content Providers Community Call
OpenAIRE
 
PPTX
20200130_Mannocci_OpenAIRE_ResearchGraph
OpenAIRE
 
10th OpenAIRE Content Providers Community Call
OpenAIRE
 
9th Content Providers Community Call\
OpenAIRE
 
OpenAIRE in the European Open Science Cloud (EOSC)
OpenAIRE
 
8th Content Providers Community Call
OpenAIRE
 
OpenAIRE PROVIDE Dashboard for Turkish repository managers
OpenAIRE
 
What will it cost to manage and share my data?
OpenAIRE
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
OpenAIRE
 
6th Content Providers Community Call
OpenAIRE
 
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
OpenAIRE
 
20200504_Research Data & the GDPR: How Open is Open?
OpenAIRE
 
20200504_Data, Data Ownership and Open Science
OpenAIRE
 
20200429_Research Data & the GDPR: How Open is Open? (updated version)
OpenAIRE
 
20200429_Data, Data Ownership and Open Science
OpenAIRE
 
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
OpenAIRE
 
COVID-19: Activities, tools, best practice and contact points in Greece
OpenAIRE
 
5th Content Providers Community Call
OpenAIRE
 
3rd Content Providers Community Call
OpenAIRE
 
2nd Content Providers Community Call
OpenAIRE
 
1st Content Providers Community Call
OpenAIRE
 
20200130_Mannocci_OpenAIRE_ResearchGraph
OpenAIRE
 
Ad

Recently uploaded (20)

PPTX
西班牙武康大学毕业证书{UCAMOfferUCAM成绩单水印}原版制作
Taqyea
 
PPT
Computer Securityyyyyyyy - Chapter 1.ppt
SolomonSB
 
PPTX
Research Design - Report on seminar in thesis writing. PPTX
arvielobos1
 
PDF
Web Hosting for Shopify WooCommerce etc.
Harry_Phoneix Harry_Phoneix
 
PPTX
ONLINE BIRTH CERTIFICATE APPLICATION SYSYTEM PPT.pptx
ShyamasreeDutta
 
PPTX
sajflsajfljsdfljslfjslfsdfas;fdsfksadfjlsdflkjslgfs;lfjlsajfl;sajfasfd.pptx
theknightme
 
PPTX
原版西班牙莱昂大学毕业证(León毕业证书)如何办理
Taqyea
 
PDF
𝐁𝐔𝐊𝐓𝐈 𝐊𝐄𝐌𝐄𝐍𝐀𝐍𝐆𝐀𝐍 𝐊𝐈𝐏𝐄𝐑𝟒𝐃 𝐇𝐀𝐑𝐈 𝐈𝐍𝐈 𝟐𝟎𝟐𝟓
hokimamad0
 
PPTX
Lec15_Mutability Immutability-converted.pptx
khanjahanzaib1
 
PPTX
L1A Season 1 Guide made by A hegy Eng Grammar fixed
toszolder91
 
PPTX
一比一原版(LaTech毕业证)路易斯安那理工大学毕业证如何办理
Taqyea
 
PPT
Agilent Optoelectronic Solutions for Mobile Application
andreashenniger2
 
PPTX
法国巴黎第二大学本科毕业证{Paris 2学费发票Paris 2成绩单}办理方法
Taqyea
 
PPTX
PE introd.pptxfrgfgfdgfdgfgrtretrt44t444
nepmithibai2024
 
PPTX
本科硕士学历佛罗里达大学毕业证(UF毕业证书)24小时在线办理
Taqyea
 
PPTX
一比一原版(SUNY-Albany毕业证)纽约州立大学奥尔巴尼分校毕业证如何办理
Taqyea
 
PDF
Apple_Environmental_Progress_Report_2025.pdf
yiukwong
 
PPT
introduction to networking with basics coverage
RamananMuthukrishnan
 
PDF
DevOps Design for different deployment options
henrymails
 
PPT
Computer Securityyyyyyyy - Chapter 2.ppt
SolomonSB
 
西班牙武康大学毕业证书{UCAMOfferUCAM成绩单水印}原版制作
Taqyea
 
Computer Securityyyyyyyy - Chapter 1.ppt
SolomonSB
 
Research Design - Report on seminar in thesis writing. PPTX
arvielobos1
 
Web Hosting for Shopify WooCommerce etc.
Harry_Phoneix Harry_Phoneix
 
ONLINE BIRTH CERTIFICATE APPLICATION SYSYTEM PPT.pptx
ShyamasreeDutta
 
sajflsajfljsdfljslfjslfsdfas;fdsfksadfjlsdflkjslgfs;lfjlsajfl;sajfasfd.pptx
theknightme
 
原版西班牙莱昂大学毕业证(León毕业证书)如何办理
Taqyea
 
𝐁𝐔𝐊𝐓𝐈 𝐊𝐄𝐌𝐄𝐍𝐀𝐍𝐆𝐀𝐍 𝐊𝐈𝐏𝐄𝐑𝟒𝐃 𝐇𝐀𝐑𝐈 𝐈𝐍𝐈 𝟐𝟎𝟐𝟓
hokimamad0
 
Lec15_Mutability Immutability-converted.pptx
khanjahanzaib1
 
L1A Season 1 Guide made by A hegy Eng Grammar fixed
toszolder91
 
一比一原版(LaTech毕业证)路易斯安那理工大学毕业证如何办理
Taqyea
 
Agilent Optoelectronic Solutions for Mobile Application
andreashenniger2
 
法国巴黎第二大学本科毕业证{Paris 2学费发票Paris 2成绩单}办理方法
Taqyea
 
PE introd.pptxfrgfgfdgfdgfgrtretrt44t444
nepmithibai2024
 
本科硕士学历佛罗里达大学毕业证(UF毕业证书)24小时在线办理
Taqyea
 
一比一原版(SUNY-Albany毕业证)纽约州立大学奥尔巴尼分校毕业证如何办理
Taqyea
 
Apple_Environmental_Progress_Report_2025.pdf
yiukwong
 
introduction to networking with basics coverage
RamananMuthukrishnan
 
DevOps Design for different deployment options
henrymails
 
Computer Securityyyyyyyy - Chapter 2.ppt
SolomonSB
 

Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

  • 1. Sahar Vahdati Christoph Lange Giorgos Alexiou George Papastefanatos Making Use of the Linked Open Data Services for OpenAIRE: Querying Data about Research Results, Person, Projects and Organizations Digital Infrastructure for Research (DI4R) 28-30 September 2016 Krakau, Poland University of Bonn, Germany Athena Research Center
  • 2. Session outline • Introduction to OpenAIRE • Technical Concepts • Hands on Session
  • 3. Open Access Infrastructure for Research in Europe Need for digital research infrastructures for all kinds of research outputs, across disciplines and countries! •comprises a database of all EC FP7 and H2020 funded research projects, publications, datasets •manages scientific publications and associated scientific material •aggregates Open Access publications and links them to research data and funding bodies •supports the Open Access principles via national helpdesks and comprehensive guidelines http://www.openaire.eu
  • 4. OpenAIRE Services OpenAIRE focuses on: • Workflows and processes of scholarly communication rather than resources, • Research data and other research outputs rather than only publications, • The links between considered entities, • Relationship of European OA infrastructures with other regions of the world. enables search, discovery and monitoring of the publications and datasets resulting from: >100k research projects >17m publications >23k datasets >5k repositories.
  • 6. Example of data about Core Entities Entity type Result openaireID od_______908::fac3db85bbcb1f52ae07c5868d8fb453 dateOfTransformation 2015-02-06 dateOfCollection 2015-02-06 title A Patient from Argentina Infected with Rickettsia massiliae Dateofacceptance 01/04/2010 Publisher The American Society of Tropical Medicine and Hygiene Pid oai:europepmc.org:2077077;PMC2844561 Language English Subject Articles BestLicense Open Acces An entity of type Result
  • 7. Interlink to other databases Support researchers by answering interesting queries The OpenAIRE vision: • Data about scientific events  emergence of scientific topics • Data about people affiliation  impact of certain research
  • 8. Use cases: • Research managers  use new indicators for measuring the quality • Policy makers  get a quick overview of the findings and projects • Researchers  find comprehensive citations list, research movement between communities/organizations • Reviewers  get a quick overview of the field covered by the paper or dataset under review
  • 9. Challenges supported by LOD Services Linked Open Data (LOD) RDF data model Publishing the OpenAIRE data as Linked Open Data and linking it to related datasets! • Diverse data formats • Various means to access/query data • Use of different identifiers • Heterogeneity of metadata schemas
  • 10. Expected values • Open up a window to the Linked Open Data Web • Increase the OpenAIRE technical interoperability • Increase the reusability of the OpenAIRE research metadata • Engage with additional user communities • Explore synergies with and added value to related open content initiatives • Provide links through LOD to similar infrastructures • Offer new services for OA data monitoring activities • Provide services to export the OpenAIRE objects as a LOD graph • Facilitate integration with other LOD graphs relative to similar systems and infrastructures • Find patterns to enrich the OpenAIRE information space Exposing the OpenAIRE Information Space as linked data!
  • 11. Towards OpenAIRE LOD Services Phase 1: LOD Production Phase 1: Interlinking OpenAIRE RDF Graph to LOD cloud
  • 12. Steps: • Specify an RDF vocabulary • Specify terms and namespaces • Map the OA data model to an RDF data model • Map the OA data to an statistic RDF dump • Specify strategies to automate the RDF generation OA RDF graph … @prefix oad: <http://lod.openaire.eu/data/> . @prefix oav: <http://lod.openaire.eu/vocab#> . @prefix dbpedia-owl: http://dbpedia.org/ontology/. @prefix vivo: <http://vivoweb.org/files/vivo-isf-public-1.6.owl#> . @prefix pext: <http://www.ontotext.com/proton-ontology/#> . @prefix swrc:<http://swrc.ontoware.org/ontology#> . oad:07553d8e646b69b868a9791da39a1802 a foaf:Person; foaf:firstName "P."^^xsd:string; foaf:lastName "Jha"^^xsd:string; foaf:name "Jha, P."^^xsd:string; oav:isAuthorOf . oad:755469c995c2cb6cb55c3483634b026 a foaf:Person; oav:hasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095; oav:hasLabel "personResult_authorship_isAuthorOf"^^xsd:string; oav:ranking "6"^^xsd:integer. oad:075558cd104f737d82a34cb7e9fecd7d a foaf:Person; foaf:firstName "T."^^xsd:string; foaf:lastName "Bere"^^xsd:string; foaf:name "Bere, T."^^xsd:string. … OpenAIRE data OA RDF Phase 1. LOD Production Core entities Linking entities
  • 14. Organizations Results* Persons Datasources Projects 68.526 17,414,766 62,958,315 19,443 624,417 *including duplicates connected with sameAs Total Number of Triples: 1,013,527,855 Distinct Entities: 98,256 OpenAIRE data as RDF Graph
  • 15. Steps: • Identify datasets to be interlinked to • Select interlinking tools: LIMES, Silk • Test interlinking OA with DBLP and DBpedia • Evaluate resulting link sets • Specify strategy for interlinking in OA workflow DBLP CiteSeer CEUR Ope Pu lAK A Phase2. Interlinking OA-RDF Graph to LOD cloud … @prefix oad: <http://lod.openaire.eu/data/> . @prefix oav: <http://lod.openaire.eu/vocab#> . @prefix dbpedia-owl: http://dbpedia.org/ontology/. . oad:07553d8e646b69b868a9791da39a1802 a foaf:Person; foaf:firstName "P."^^xsd:string; foaf:lastName "Jha"^^xsd:string; foaf:name "Jha, P."^^xsd:string; oav:isAuthorOf . oad:755469c995c2cb6cb55c3483634b026 a foaf:Person; oav:hasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095; oav:hasLabel "personResult_authorship_isAuthorOf"^^xsd:string; oav:ranking "6"^^xsd:integer. OA LOD Linked Open Data (LOD) http://beta.lod.openaire.eu/
  • 16. RDF (Resource Description Framework) • Resource : anything uniquely identifiable • Description: description of resource via representing properties and relations • Framework: web-based protocols and semantics • RDF triples: List of statements Subject (URI) Predicate (URI) Object (URI or Literal) oad:publication1 “Juan Carlos García“ oav:hasAuthor
  • 17. RDF version of example PREFIX dcterms: <http://purl.org/dc/terms/> … PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX cerif: <http://www.eurocris.org/ontologies/cerif/1.3#> PREFIX prov: <http://www.w3.org/ns/prov# :od_______908::… rdf:type cerif:ResultEntity; dcterms:description “ The first confirmed case “; dcterms:publisher “The American Society of Tropical Medicine and Hygiene”; … oav:resultSubject “Articles“; oav:dateOfCollection 2015-02-06. .
  • 18. Example of data about Linking entities An entity of type Person_Result whose ranking property can have the value 1 to indicate the first author. od_______908::f39…1c4a PersonResult od_______908::fa3...b453 Rdf:type foaf:Person; oav:rank 1. Rdf:type cerif:ResultEntity
  • 19. How to query RDF? SPARQL (Protocol and RDF Query Language) • Query language of RDF-based data • SPARQL endpoint: RDF-triple database on a server available on the Web • Pattern matching language • Protocol layer • Query interface
  • 20. How to query? • SPARQL variables are bound to RDF terms e.g., ?title , ?author • Inspired by SQL via SELECT statement Example: SELECT ?title ?author • Return as a table ?title ?author A Patient from Argentina Infected with Rickettsia massiliae Juan Carlos García
  • 21. OpenAIRE as LOD • OA LOD in BETA version • Triples per entity • Online data: SPARQL endpoint • Offline data: RDF dump • Entities and URIs (interactive browsing) • Dereferenceable URIs for all entities http://www. beta.lod.openaire.eu
  • 22. Steps: • Specify an RDF vocabulary • Specify terms and namespaces • Map the OA data model to an RDF data model • Map the OA data to an statistic RDF dump • Specify strategies to automate the RDF generation Data conforming to LOD best practices published in BETA, December 2015 Main entitiesLinking entities http://beta.lod.openaire.eu/ OA RDF graph … @prefix oad: <http://lod.openaire.eu/data/> . @prefix oav: <http://lod.openaire.eu/vocab#> . @prefix dbpedia-owl: http://dbpedia.org/ontology/. @prefix vivo: <http://vivoweb.org/files/vivo-isf-public-1.6.owl#> . @prefix pext: <http://www.ontotext.com/proton-ontology/#> . @prefix swrc:<http://swrc.ontoware.org/ontology#> . oad:07553d8e646b69b868a9791da39a1802 a foaf:Person; foaf:firstName "P."^^xsd:string; foaf:lastName "Jha"^^xsd:string; foaf:name "Jha, P."^^xsd:string; oav:isAuthorOf . oad:755469c995c2cb6cb55c3483634b026 a foaf:Person; oav:hasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095; oav:hasLabel "personResult_authorship_isAuthorOf"^^xsd:string; oav:ranking "6"^^xsd:integer. oad:075558cd104f737d82a34cb7e9fecd7d a foaf:Person; foaf:firstName "T."^^xsd:string; foaf:lastName "Bere"^^xsd:string; foaf:name "Bere, T."^^xsd:string. … OpenAIRE data OA RDF
  • 23. Sample query select (count (distinct ?s) as ?count) ?flevel from <test> from <relationsTest> where {?s a <http://www.eurocris.org/ontologies/cerif/1.3#Project>; <http://lod.openaire.eu/vocab/fundingLevel0> ?flevel} GROUP BY ?flevel order by ?count Number of publications with their corresponding funding level
  • 24. General architecture OpenAIRE Metadata RDFization Interlinking RDF Store Deduplication & Inference Apache Solr https://www.openaire.eu LOD Client http://beta.lod.openaire.eu OA Vocabulary OA Data Model HTML Browser HTML HTML RDF
  • 25. Steps: • Identify datasets to be interlinked to • Select interlinking tools: LIMES, Silk • Test interlinking OA with DBLP and DBpedia • Evaluate resulting link sets • Specify strategy for interlinking in OA workflow DBLP CiteSeer CEUR Ope Pu lAK A Interlinking OpenAIRE RDF Graph to LOD cloud … @prefix oad: <http://lod.openaire.eu/data/> . @prefix oav: <http://lod.openaire.eu/vocab#> . @prefix dbpedia-owl: http://dbpedia.org/ontology/. @prefix vivo: <http://vivoweb.org/files/vivo-isf-public-1.6.owl#> . @prefix pext: <http://www.ontotext.com/proton-ontology/#> . @prefix swrc:<http://swrc.ontoware.org/ontology#> . oad:07553d8e646b69b868a9791da39a1802 a foaf:Person; foaf:firstName "P."^^xsd:string; foaf:lastName "Jha"^^xsd:string; foaf:name "Jha, P."^^xsd:string; oav:isAuthorOf . oad:755469c995c2cb6cb55c3483634b026 a foaf:Person; oav:hasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095; oav:hasLabel "personResult_authorship_isAuthorOf"^^xsd:string; oav:ranking "6"^^xsd:integer. oad:075558cd104f737d82a34cb7e9fecd7d a foaf:Person; foaf:firstName "T."^^xsd:string; foaf:lastName "Bere"^^xsd:string; foaf:name "Bere, T."^^xsd:string. … OA LOD Linked Open Data (LOD) http://beta.lod.openaire.eu/
  • 26. OA LOD interlinking workflow Preprocessing • Process all the dumps from candidate datasets • Prune useless metadata • Transform the metadata to key-value pairs(hadoop key(ID)- value([Properties])) • Store in HDFS
  • 27. Sample interlinking result Result of interlinking is a set of links between URIs from source and target dataset: DBLP dump is not complete <http://lod.openaire...bde783> owl:sameAs <http://dblp.l3s.../BoissonnatN96> <http://lod.openaire...4f8964> owl:sameAs <http://dblp.l3s.../Shrobe96> <http://lod.openaire...27fea2> owl:sameAs <http://dblp.l3s.../X96c> <http://lod.openaire...f433b9> owl:sameAs <http://dblp.l3s.../LiroyG96>
  • 28. DBLP CiteSeer CEUR Ope Pu lAK A … @prefix oad: <http://lod.openaire.eu/data/> . @prefix oav: <http://lod.openaire.eu/vocab#> . @prefix dbpedia-owl: http://dbpedia.org/ontology/. @prefix vivo: <http://vivoweb.org/files/vivo-isf-public-1.6.owl#> . @prefix pext: <http://www.ontotext.com/proton-ontology/#> . @prefix swrc:<http://swrc.ontoware.org/ontology#> . oad:07553d8e646b69b868a9791da39a1802 a foaf:Person; foaf:firstName "P."^^xsd:string; foaf:lastName "Jha"^^xsd:string; foaf:name "Jha, P."^^xsd:string; oav:isAuthorOf . oad:755469c995c2cb6cb55c3483634b026 a foaf:Person; oav:hasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095; oav:hasLabel "personResult_authorship_isAuthorOf"^^xsd:string; oav:ranking "6"^^xsd:integer. oad:075558cd104f737d82a34cb7e9fecd7d a foaf:Person; foaf:firstName "T."^^xsd:string; foaf:lastName "Bere"^^xsd:string; foaf:name "Bere, T."^^xsd:string. … OA LOD Linked Open Data (LOD) Ideas for LOD in Monitoring monitoring interlinking: when the target dataset grows from one version to another one, we can expect the linkset to grow as well
  • 29. Scientific events Bootstrapping datasets for scientific events: • CEUR-WS.org dataset • OpenResearch.org • Include events in OA Data Model (Conference Object?) • Measure the quality of events • Related to funding and sponsoring • Continuality • Accepted project publications • Reputation of people • Location • Citation • …
  • 32. Example: What is the overall research output of a given project? oav:produces and UNION are not working: PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX oav: <http://lod.openaire.eu/vocab/> PREFIX cerif: http://www.eurocris.org/ontologies/cerif/1.3# SELECT ?x ?y WHERE { ?y a cerif:ResultEntity { ?y oav:resultType 'dataset'} UNION { ?y oav:resultType 'publication'} ?x a cerif:Project. ?y cerif:linkToProject ?y } LIMIT 10
  • 33. PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX oav: <http://lod.openaire.eu/vocab/> PREFIX foaf: <http://xmlns.com/foaf/0.1/> SELECT ?o WHERE { ?x oav:projectOrganization ?o. ?o a foaf:Organization. ?y oav:projectOrganization ?o2. ?o2 a foaf:Organization. FILTER (sameTerm(?o, ?o2) && !sameTerm(?x, ?y)) } LIMIT 10 Example: What organizations are more active than others w.r.t. projects?
  • 34. PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX oav: <http://lod.openaire.eu/vocab/> PREFIX cerif: <http://www.eurocris.org/ontologies/cerif/1.3#> PREFIX dcterms: <http://purl.org/dc/terms/> PREFIX foaf: <http://xmlns.com/foaf/0.1/> SELECT ?y WHERE { ?p cerif:linksToPerson ?x ?x a foaf:Person. ?x dcterms:creator ?y. ?y oav:resultType "dataset" } LIMIT 10 Example: What datasets has published by a specific person who involved in a given project?
  • 35. PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX oav: <http://lod.openaire.eu/vocab/> PREFIX cerif: <http://www.eurocris.org/ontologies/cerif/1.3#> PREFIX dcterms: <http://purl.org/dc/terms/> PREFIX foaf: <http://xmlns.com/foaf/0.1/> SELECT ?y WHERE { ?p cerif:linksToPerson ?x ?x a foaf:Person. ?x dcterms:creator ?y. ?y oav:resultType "dataset" } LIMIT 10 Example: List the full names of all authors who have (co-)authored a publication in project P?

Editor's Notes

  • #2: UBONN together with ARC develops the LOD services for OA. CNR provides technical support for synchronizing content of the OpenAIRE Information Space with LOD services and vice versa
  • #8: The why!
  • #9: Reviewer: The OpenAIRE LOD itself has information about the subject of a paper or a dataset, which can be linked to subject classification schemes such as the ACM CCS. Furthermore, CiteSeer provides citation graphs of papers. We can thus offer to peer reviewers a service that finds papers or datasets similar to the one under review. Researcher: A service similar to the one for peer reviewers explained above could be offered to authors. move in the community, e.g., to other organizations. Use case 7: Having access to the networks of a paper's authors and their organizations, and furthermore taking into account the events in which people participate enables new indicators for measuring the quality and relevance of research that are not just based on counting citations.
  • #10: To be able to give such services, you need to deal with diverse data format…. LOD support solving such challenges easily…
  • #11: Explore synergies with and added value to related open content initiatives (e.g. in the Open Educational Resources) Find patterns to enrich the OpenAIRE information space by exploiting the enrichments inherited by third-party re-use of its LOD graph representation
  • #12: The how!
  • #14: Here show how to map
  • #17: First let me tell you what is RDF data model!
  • #18: The result of mapping
  • #23: Repeated just for reminding
  • #29: (potential research paper e.g., for ESWC 2017 Linked Data Track; cfp. http://2016.eswc-conferences.org/call-papers#3)