SlideShare a Scribd company logo
Achieving Time Effective Federated
Information from scalable RDF data using
SPARQL Queries

By
D.Teja Santosh, Assistant Professor
Computer Science and Engineering
GITAM University,
Rudraram, Hyderabad
01/20/14

NCACPR-14

1
Aim: To retrieve federated information from scalable RDF data using SPARQL
query as a “Global Service” in very less time.

Technologies Used:
RDF, SPARQL,OWL.

Novelty of the concept:
 Integrating heterogeneous data from different databases is shown using
a single RDF data file.
 Retrieving federated data using SPARQL query as a Global Web Service.

01/20/14

NCACPR-14

2
My LINKED IN Profile

01/20/14

NCACPR-14

3
My International Conf. Title

01/20/14

NCACPR-14

4
Web 2.0 Architecture

01/20/14

NCACPR-14

5
Issues in integrating two different web services

• Different platforms.
• Different languages/technologies.
• Individual XML data viewed as Tree that may lead to
inconsistent responses when integrated.
• Availability of web services.

01/20/14

NCACPR-14

6
Web 3.0 Architecture

01/20/14

NCACPR-14

7
How RDF overcomes this issue?
•

The RDF model is made up of triples: subject-predicate-object.

•

These triples are uniquely identified on the web through URI. [Like “PASSPORT
NUMBER” to uniquely identify a person across the real world].

•

This lets machines understand human knowledge statements. [Computer saying:
Oh!]

•

The RDF model is essentially the canonicalization of a (directed) graph, and so as
such has all the advantages (and generality) of structuring information using
graphs

•

Any number of author profile and corresponding conference title data (federation)
in combined format are linked and retrieved due to the query pointing to the
generic subject node(s).

01/20/14

NCACPR-14

8
RDF Data Graph

01/20/14

NCACPR-14

9
RDF Tabulator Screenshot

01/20/14

NCACPR-14

10
SPARQL

•

I call SPARQL as a test bed which makes us to have clear idea about the result
accuracy (as a Web 3.0 learner).

•

Queries RDF data. If your data is in RDF, then SPARQL can query it natively.

•

Implicit join syntax. SPARQL queries RDF graphs, which consist of various triples
expressing binary relations between resources. As all relationships are of a fixed
size and data lives in a single graph, SPARQL does not require explicit joins that
specify the relationship between differently structured data.

•

The SPARQL query above has a similar structure:
SELECT <variable list>
WHERE {<graph pattern> }

•

FROM is used as a Base URL of the RDF Triple Store.

01/20/14

NCACPR-14

11
SPARQL QUERY IN TWINKLE

FEDERATED INFORMATION OF LINKEDIN AUTHOR AND ICHCI INTL.
CONF. TITLE OF THE SAME PERSON DATA
01/20/14

NCACPR-14

12
SPARQL QUERY PROCESSING
•

SPARQL queries are executed against RDF datasets, consisting of RDF graphs.

•

A SPARQL endpoint accepts queries and returns results via HTTP.

•

SPARQL endpoints will query any Web-accessible RDF data.

•

The results of SPARQL queries can be returned and/or rendered in a variety of formats:
–
–
–
–
–

01/20/14

XML
JSON
RDF
HTML
CSV

NCACPR-14

13
ANALYSIS OF RDF DATA GRAPH FOR FEDERATED QUERY
I. SCALABLE TRIPLES VISUALIZATION

01/20/14

NCACPR-14

14
II. RESPONSE TIME OF SPARQL QUERY ON ‘n’ (VIRTUAL) OBJECTS

01/20/14

NCACPR-14

15
SELECTIVITY:
•
•
•
•

sel(t) = sel(s) * sel(p) * sel(o)
sel(s) = 1/R, R - No. of Resources.
sel(p) = Tp/T, T – Total No. of triples, Tp – Triples matching predicate p.
sel(o) = hc(p,oc)/Tp, where (p,oc) represents the class of the histogram for predicate p in which
object o falls.

INFERENCE:
When Selectivity parameter is used for analysis of SPARQL query, accessing the data using subject
is encouraged when federated response is required.

01/20/14

NCACPR-14

16
REFERENCES
[1] RDF and SOA by David Booth, Ph.D., HP Software.
[2] TechnicaLeeSpeaking: Software designs, implementations, solutions, and
musings by Lee Feigenbaum.
[3] Frank Manola, Eric Miller, W3C, RDF Primer.
[4] David Booth, W3C Fellow / Hewlett-Packard, Hugo Haas, W3C Francis
McCabe, Fujitsu Labs of America, Eric Newcomer (until October 2003),
Iona Michael Champion (until March 2003), Software AG, Chris Ferris
(until March 2003), IBM, David Orchard (until March 2003), BEA Systems,
Web Services Architecture.
[5] Eric Prud'hommeaux, W3C ,Andy Seaborne, Hewlett-Packard
Laboratories, Bristol, SPARQL Query Language for RDF,W3C
Recommendation 15 January 2008.
[6] Lesley Charles, November 23, 2009, SPARQL Query Optimization.
[7] A. Bernstein, M. Stocker, and C. Kiefer. SPARQL Query Optimization Using
Selectivity Estimation. InPoster Proceedings of the 6th International
Semantic Web Conference (ISWC), Lecture Notes in Computer
Science.Springer, 2007.
01/20/14

NCACPR-14

17

More Related Content

What's hot (20)

PPT
Scalable Data Analysis in R -- Lee Edlefsen
Revolution Analytics
 
PDF
The RDF Report Card: Beyond the Triple Count
Leigh Dodds
 
PDF
Managing RDF data with graph databases
Graph-TA
 
PPTX
LD4KD 2015 - Demos and tools
Vrije Universiteit Amsterdam
 
PPTX
Deriving an Emergent Relational Schema from RDF Data
Graph-TA
 
PPTX
RDF Graph Data Management in Oracle Database and NoSQL Platforms
Graph-TA
 
PDF
FAIRness through a novel combination of Web technologies
Research Data Alliance
 
PPT
A hint of_mint
Peter Sefton
 
PDF
RDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
National Institute of Informatics
 
PPT
Talis Platform: A Linked Data Engine
Leigh Dodds
 
PPTX
Eureka Research Workbench: A Semantic Approach to an Open Source Electroni...
Stuart Chalk
 
PPTX
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Stuart Chalk
 
PDF
VALA Tech Camp 2017: Intro to Wikidata & SPARQL
Jane Frazier
 
PDF
Semantic Web Technology
Rathachai Chawuthai
 
PPTX
The CIARD RINGValeri
CIARD Movement
 
PPTX
Unit 3
Piyush Rochwani
 
PPTX
Hai huang presentation
hai huang
 
PDF
Do it on your own - From 3 to 5 Star Linked Open Data with RMLio
Open Knowledge Belgium
 
PDF
useR! 2012 Talk
rtelmore
 
PPTX
247th ACS Meeting: The Eureka Research Workbench
Stuart Chalk
 
Scalable Data Analysis in R -- Lee Edlefsen
Revolution Analytics
 
The RDF Report Card: Beyond the Triple Count
Leigh Dodds
 
Managing RDF data with graph databases
Graph-TA
 
LD4KD 2015 - Demos and tools
Vrije Universiteit Amsterdam
 
Deriving an Emergent Relational Schema from RDF Data
Graph-TA
 
RDF Graph Data Management in Oracle Database and NoSQL Platforms
Graph-TA
 
FAIRness through a novel combination of Web technologies
Research Data Alliance
 
A hint of_mint
Peter Sefton
 
RDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
National Institute of Informatics
 
Talis Platform: A Linked Data Engine
Leigh Dodds
 
Eureka Research Workbench: A Semantic Approach to an Open Source Electroni...
Stuart Chalk
 
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Stuart Chalk
 
VALA Tech Camp 2017: Intro to Wikidata & SPARQL
Jane Frazier
 
Semantic Web Technology
Rathachai Chawuthai
 
The CIARD RINGValeri
CIARD Movement
 
Hai huang presentation
hai huang
 
Do it on your own - From 3 to 5 Star Linked Open Data with RMLio
Open Knowledge Belgium
 
useR! 2012 Talk
rtelmore
 
247th ACS Meeting: The Eureka Research Workbench
Stuart Chalk
 

Similar to Achieving time effective federated information from scalable rdf data using sparql queries (20)

PDF
Sparql semantic information retrieval by
IJNSA Journal
 
PDF
Translation of Relational and Non-Relational Databases into RDF with xR2RML
Franck Michel
 
PPTX
Transient and persistent RDF views over relational databases in the context o...
Nikolaos Konstantinou
 
PDF
SPARQL: SEMANTIC INFORMATION RETRIEVAL BY EMBEDDING PREPOSITIONS
IJNSA Journal
 
PDF
semanticweb
Kevin Hutt
 
PPT
Sparql
Serge Garlatti
 
PDF
RDF and Java
Constantin Stan
 
PDF
SemFacet paper
DBOnto
 
PDF
Sem facet paper
DBOnto
 
PPT
Analysis on semantic web layer cake entities
తేజ దండిభట్ల
 
PPT
Re-using Media on the Web: Media fragment re-mixing and playout
MediaMixerCommunity
 
PPTX
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
Ontotext
 
PDF
final_copy_camera_ready_paper (7)
Ankit Rathi
 
PDF
Modern PHP RDF toolkits: a comparative study
Marius Butuc
 
PDF
Comparative study on the processing of RDF in PHP
MSGUNC
 
PDF
Web Spa
Constantin Stan
 
PDF
Strata NYC 2015 - Supercharging R with Apache Spark
Databricks
 
PDF
Matching and merging anonymous terms from web sources
IJwest
 
PDF
Short Report Bridges performance gap between Relational and RDF
Akram Abbasi
 
PDF
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
Fabrizio Orlandi
 
Sparql semantic information retrieval by
IJNSA Journal
 
Translation of Relational and Non-Relational Databases into RDF with xR2RML
Franck Michel
 
Transient and persistent RDF views over relational databases in the context o...
Nikolaos Konstantinou
 
SPARQL: SEMANTIC INFORMATION RETRIEVAL BY EMBEDDING PREPOSITIONS
IJNSA Journal
 
semanticweb
Kevin Hutt
 
RDF and Java
Constantin Stan
 
SemFacet paper
DBOnto
 
Sem facet paper
DBOnto
 
Analysis on semantic web layer cake entities
తేజ దండిభట్ల
 
Re-using Media on the Web: Media fragment re-mixing and playout
MediaMixerCommunity
 
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
Ontotext
 
final_copy_camera_ready_paper (7)
Ankit Rathi
 
Modern PHP RDF toolkits: a comparative study
Marius Butuc
 
Comparative study on the processing of RDF in PHP
MSGUNC
 
Strata NYC 2015 - Supercharging R with Apache Spark
Databricks
 
Matching and merging anonymous terms from web sources
IJwest
 
Short Report Bridges performance gap between Relational and RDF
Akram Abbasi
 
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
Fabrizio Orlandi
 
Ad

Recently uploaded (20)

PDF
Governor Josh Stein letter to NC delegation of U.S. House
Mebane Rash
 
PDF
Aprendendo Arquitetura Framework Salesforce - Dia 03
Mauricio Alexandre Silva
 
PPTX
Universal immunization Programme (UIP).pptx
Vishal Chanalia
 
PPTX
Cultivation practice of Litchi in Nepal.pptx
UmeshTimilsina1
 
PDF
Council of Chalcedon Re-Examined
Smiling Lungs
 
PPTX
Difference between write and update in odoo 18
Celine George
 
PPTX
TRANSLATIONAL AND ROTATIONAL MOTION.pptx
KIPAIZAGABAWA1
 
PDF
Exploring the Different Types of Experimental Research
Thelma Villaflores
 
PPTX
DIGITAL CITIZENSHIP TOPIC TLE 8 MATATAG CURRICULUM
ROBERTAUGUSTINEFRANC
 
PDF
Knee Extensor Mechanism Injuries - Orthopedic Radiologic Imaging
Sean M. Fox
 
PPTX
Identifying elements in the story. Arrange the events in the story
geraldineamahido2
 
PDF
Stokey: A Jewish Village by Rachel Kolsky
History of Stoke Newington
 
PDF
Week 2 - Irish Natural Heritage Powerpoint.pdf
swainealan
 
PPTX
PPT-Q1-WEEK-3-SCIENCE-ERevised Matatag Grade 3.pptx
reijhongidayawan02
 
PDF
Characteristics, Strengths and Weaknesses of Quantitative Research.pdf
Thelma Villaflores
 
PPTX
Nitrogen rule, ring rule, mc lafferty.pptx
nbisen2001
 
PPTX
HUMAN RESOURCE MANAGEMENT: RECRUITMENT, SELECTION, PLACEMENT, DEPLOYMENT, TRA...
PRADEEP ABOTHU
 
PPTX
Introduction to Indian Writing in English
Trushali Dodiya
 
PDF
Women's Health: Essential Tips for Every Stage.pdf
Iftikhar Ahmed
 
PPTX
How to Send Email From Odoo 18 Website - Odoo Slides
Celine George
 
Governor Josh Stein letter to NC delegation of U.S. House
Mebane Rash
 
Aprendendo Arquitetura Framework Salesforce - Dia 03
Mauricio Alexandre Silva
 
Universal immunization Programme (UIP).pptx
Vishal Chanalia
 
Cultivation practice of Litchi in Nepal.pptx
UmeshTimilsina1
 
Council of Chalcedon Re-Examined
Smiling Lungs
 
Difference between write and update in odoo 18
Celine George
 
TRANSLATIONAL AND ROTATIONAL MOTION.pptx
KIPAIZAGABAWA1
 
Exploring the Different Types of Experimental Research
Thelma Villaflores
 
DIGITAL CITIZENSHIP TOPIC TLE 8 MATATAG CURRICULUM
ROBERTAUGUSTINEFRANC
 
Knee Extensor Mechanism Injuries - Orthopedic Radiologic Imaging
Sean M. Fox
 
Identifying elements in the story. Arrange the events in the story
geraldineamahido2
 
Stokey: A Jewish Village by Rachel Kolsky
History of Stoke Newington
 
Week 2 - Irish Natural Heritage Powerpoint.pdf
swainealan
 
PPT-Q1-WEEK-3-SCIENCE-ERevised Matatag Grade 3.pptx
reijhongidayawan02
 
Characteristics, Strengths and Weaknesses of Quantitative Research.pdf
Thelma Villaflores
 
Nitrogen rule, ring rule, mc lafferty.pptx
nbisen2001
 
HUMAN RESOURCE MANAGEMENT: RECRUITMENT, SELECTION, PLACEMENT, DEPLOYMENT, TRA...
PRADEEP ABOTHU
 
Introduction to Indian Writing in English
Trushali Dodiya
 
Women's Health: Essential Tips for Every Stage.pdf
Iftikhar Ahmed
 
How to Send Email From Odoo 18 Website - Odoo Slides
Celine George
 
Ad

Achieving time effective federated information from scalable rdf data using sparql queries

  • 1. Achieving Time Effective Federated Information from scalable RDF data using SPARQL Queries By D.Teja Santosh, Assistant Professor Computer Science and Engineering GITAM University, Rudraram, Hyderabad 01/20/14 NCACPR-14 1
  • 2. Aim: To retrieve federated information from scalable RDF data using SPARQL query as a “Global Service” in very less time. Technologies Used: RDF, SPARQL,OWL. Novelty of the concept:  Integrating heterogeneous data from different databases is shown using a single RDF data file.  Retrieving federated data using SPARQL query as a Global Web Service. 01/20/14 NCACPR-14 2
  • 3. My LINKED IN Profile 01/20/14 NCACPR-14 3
  • 4. My International Conf. Title 01/20/14 NCACPR-14 4
  • 6. Issues in integrating two different web services • Different platforms. • Different languages/technologies. • Individual XML data viewed as Tree that may lead to inconsistent responses when integrated. • Availability of web services. 01/20/14 NCACPR-14 6
  • 8. How RDF overcomes this issue? • The RDF model is made up of triples: subject-predicate-object. • These triples are uniquely identified on the web through URI. [Like “PASSPORT NUMBER” to uniquely identify a person across the real world]. • This lets machines understand human knowledge statements. [Computer saying: Oh!] • The RDF model is essentially the canonicalization of a (directed) graph, and so as such has all the advantages (and generality) of structuring information using graphs • Any number of author profile and corresponding conference title data (federation) in combined format are linked and retrieved due to the query pointing to the generic subject node(s). 01/20/14 NCACPR-14 8
  • 11. SPARQL • I call SPARQL as a test bed which makes us to have clear idea about the result accuracy (as a Web 3.0 learner). • Queries RDF data. If your data is in RDF, then SPARQL can query it natively. • Implicit join syntax. SPARQL queries RDF graphs, which consist of various triples expressing binary relations between resources. As all relationships are of a fixed size and data lives in a single graph, SPARQL does not require explicit joins that specify the relationship between differently structured data. • The SPARQL query above has a similar structure: SELECT <variable list> WHERE {<graph pattern> } • FROM is used as a Base URL of the RDF Triple Store. 01/20/14 NCACPR-14 11
  • 12. SPARQL QUERY IN TWINKLE FEDERATED INFORMATION OF LINKEDIN AUTHOR AND ICHCI INTL. CONF. TITLE OF THE SAME PERSON DATA 01/20/14 NCACPR-14 12
  • 13. SPARQL QUERY PROCESSING • SPARQL queries are executed against RDF datasets, consisting of RDF graphs. • A SPARQL endpoint accepts queries and returns results via HTTP. • SPARQL endpoints will query any Web-accessible RDF data. • The results of SPARQL queries can be returned and/or rendered in a variety of formats: – – – – – 01/20/14 XML JSON RDF HTML CSV NCACPR-14 13
  • 14. ANALYSIS OF RDF DATA GRAPH FOR FEDERATED QUERY I. SCALABLE TRIPLES VISUALIZATION 01/20/14 NCACPR-14 14
  • 15. II. RESPONSE TIME OF SPARQL QUERY ON ‘n’ (VIRTUAL) OBJECTS 01/20/14 NCACPR-14 15
  • 16. SELECTIVITY: • • • • sel(t) = sel(s) * sel(p) * sel(o) sel(s) = 1/R, R - No. of Resources. sel(p) = Tp/T, T – Total No. of triples, Tp – Triples matching predicate p. sel(o) = hc(p,oc)/Tp, where (p,oc) represents the class of the histogram for predicate p in which object o falls. INFERENCE: When Selectivity parameter is used for analysis of SPARQL query, accessing the data using subject is encouraged when federated response is required. 01/20/14 NCACPR-14 16
  • 17. REFERENCES [1] RDF and SOA by David Booth, Ph.D., HP Software. [2] TechnicaLeeSpeaking: Software designs, implementations, solutions, and musings by Lee Feigenbaum. [3] Frank Manola, Eric Miller, W3C, RDF Primer. [4] David Booth, W3C Fellow / Hewlett-Packard, Hugo Haas, W3C Francis McCabe, Fujitsu Labs of America, Eric Newcomer (until October 2003), Iona Michael Champion (until March 2003), Software AG, Chris Ferris (until March 2003), IBM, David Orchard (until March 2003), BEA Systems, Web Services Architecture. [5] Eric Prud'hommeaux, W3C ,Andy Seaborne, Hewlett-Packard Laboratories, Bristol, SPARQL Query Language for RDF,W3C Recommendation 15 January 2008. [6] Lesley Charles, November 23, 2009, SPARQL Query Optimization. [7] A. Bernstein, M. Stocker, and C. Kiefer. SPARQL Query Optimization Using Selectivity Estimation. InPoster Proceedings of the 6th International Semantic Web Conference (ISWC), Lecture Notes in Computer Science.Springer, 2007. 01/20/14 NCACPR-14 17