SlideShare a Scribd company logo
Ravish Bhagdev



The Invisible Web of

Linked Data
WWW     Web of      Our Role
      Linked Data
World Wide Web



                 HTTP, URL, HTML
WWW as we know

                                   A global repository of
           Web Page X              interconnected documents
   Text Text Text Text Text Text
   Link Text Describing Page Y     Documents linked by hyperlinks
   Text Text Text Text Text Text
   Text Text Text Text Text Text   – Implicit meaning assigned to links
                                     with a few words or pictures

                                   – Anyone can link a new page to any
                                     other available page
          Web Page Y
                                   – Links can also be made to specific
  Text Text Text Text Text Text      sections of a page (anchor tags)
  Text Text Text Text Text Text
  Text Text Text Text Text Text
  Text Text Text Text Text Text
WWW as we know
Search Engines and power of links


                      •   Use links between web-pages to rank
                          results

                      •   Each search performed results in
                          generation of a new page with more
                          hyperlinks

                      •   Click-through relevance information

                      •   Redundancy of information

                      •   Focus on the most useful and relevant page

                      •   Freshness

                      •   Links are the key
Web of Linked Data



            Linking more than just pages
Structured vs Unstructured Data
      Structured           Simply defined as
                              Data that can be parsed
   XML        DB              and processed by
                              machines to automate
                              operations like matching,
   CSV       RDF              classification, querying etc.

     Unstructured
                              Data that cannot be parsed
  HTML       Doc              in this manner without loss
                              of information or
                              requirement of input from
   PDF       PPT              external agents
Semi-Structured Data

                       Most however is semi-
   XML       DB        structured
                         – HTML (HEAD, TITLE, BODY etc)
   CSV      RDF          – Email (Header vs Body)
                         – DBs with free text fields
                         – Forms with descriptive fields
                         – CSVs, Spreadsheets
  HTML       Doc         – Some more structured than
                           others
                         – Not possible to express
   PDF       PPT           everything in structured form
Web: Giant Data Shredder




                           Web Page
     DBs                    (HTML)
Creating Links at Level of Entities

                                      Requires Semantic Markup
                                        – Uniquely Identify Entities in a
               Name:
              <String>                    Document (URIs)
             John Smith
                                        – Make Relations Between
                                          Entities Explicit
                                        – Shared Vocabulary
              Person:
              <Person>                    (schema.org, FOAF, SKOS,
             Person3456                   GoodRelations etc.)
Residence:                  Age:
<Country>                 <Integer>     – Reuse existing vocabularies
   UK                        24           instead of inventing new ones
                                        – Both with-in the same
                                          document and with other
                                          Entities in other Documents
Integrated Data Across Applications



   Where should I go on vacation?
                                                                         How do I get the best fare?
      what
                                          What is it like there?
                                                                                how
                                                                                           Travel
     Travel                                where
                                                                                           Services
     Interests
     (FB)           who              Places
                                     to go                                     Where should I stay?
              People who             (lonely planet)           what
              have been there
              (foursquare)                                 What do I need to know about it?
                                                   Photos, blogs, news stories
How Search Engines Use Linked Data
How Search Engines Use Linked Data
Our Role



 Why I think we should engage in Linked Data Initivatives
We Build Web Apps
» That publish content

» We want the content to be

    » Visible to next generation apps

    » Ranked High by Search Engines

    » Presented clearly and unambiguously
      (think price comparision websites)

    » All the big players are doing it

    » Including most governments
The Visible Web

Is Dominated by Marketing

   – The web is seen through the lense of Search Engines and Social
     Networks

   – Not every relevant page can appear on first page of search results

   – Paid, Targetted advertising

   – Affiliate Programs and Contracts

   – Popularity overrides quality of matches

   – What is popular is decided by sites that are already popular

   – Visible web is more and more biased
Emerging marketing tactics, circa 2010
Creating Linked Data (How?)

» Implicitly

» Integrating Linked Data Standards with
  Publishing tools (CMS!), apps, gadgets,
                                                            Entity P
                                             • Entity A                • Entity X
  social networks etc.
                                             • Entity B   • Entity Q   • Entity Y
» Minimize the effort while maximizing                    • Entity R
  the return
                                               Entity C                  Entity Z

» Every time you add a new friend on FB
  or follow someone on twitter, you create
  linked data
Linked Data: How it is changing the way data is published and accessed on web
Linking of information creates
 Insights
                                  Enterprise Search & KM
                                  Linked Data is even more relevant in the
                                  context of enterprise search because
Raw Data: symbols and chars       these data don’t even have simple
                                  hyperlinks
Informaton: Data in usable form

Knowledge: Information Enriched   Popularity of a document is not always
with Semantics                    the most important factor
Wisdom: Understanding,
Hindsight, Experience             Large companies struggle to make use of
                                  knowledge and information across
                                  departments and silos

                                  Timly linking of data across these silos
                                  can have a big impact
?             Questions?
                 @RavBhagdev




Theres more: Information Extraction, Social Search, Knowledge Capture etc.
What’s Your Message?
Thanks!

More Related Content

PPT
The Semantic Web
PPT
MyLifeBits van Microsoft
PPT
Semantic Web
PPTX
Social Networks and the Semantic Web: a retrospective of the past 10 years
PPTX
Social Semantic Web on Facebook Open Graph protocol and Twitter Annotations
PPTX
Semantic Web, e-commerce
PPTX
Social Features of SharePoint 2013: Enhancing Productivity
PDF
Web 3.0 Intro
The Semantic Web
MyLifeBits van Microsoft
Semantic Web
Social Networks and the Semantic Web: a retrospective of the past 10 years
Social Semantic Web on Facebook Open Graph protocol and Twitter Annotations
Semantic Web, e-commerce
Social Features of SharePoint 2013: Enhancing Productivity
Web 3.0 Intro

Viewers also liked (6)

PPT
Percentages amounts
DOC
Buy online e cigs 2
PPT
What Is The Capstone Program
DOC
TehtäVäMoniste
PDF
Nyc map town
PPTX
Whathaveyoulearntabouttechnologyfromthe final
Percentages amounts
Buy online e cigs 2
What Is The Capstone Program
TehtäVäMoniste
Nyc map town
Whathaveyoulearntabouttechnologyfromthe final
Ad

Similar to Linked Data: How it is changing the way data is published and accessed on web (20)

PDF
Linked data and the future of scientific publishing
PDF
What is the Semantic Web
PPTX
The Semantic Web #1 - Overview
PDF
Introduction to Semantic Web
PDF
Sharing data on the web (2013)
PPTX
Social Semantic Web (Social Activity and Facebook)
PDF
WordLift 2.0 presented on the Semantic Web Meetup in Rome
PDF
Питер Мика "Making the web searchable"
PPTX
Linked data in the digital humanities skills workshop for realising the oppo...
PDF
WordLift 2.0 (presentation for the IKS annual review in Saarbrücken)
PPTX
NCompass Live: Linked Data and Libraries: What? Why? How?
PDF
Web 3.0: The Upcoming Revolution
PDF
Is linked data something for me?
PDF
Applying Semantic Extensions And New Services To Drupal Sem Tech June 2010
PPT
E learning getting started with online learning reduced for uploading
PDF
Bernadette Hyland SemTech 2011 West - Linked Data Cookbook
PPTX
Linked data for Libraries, Archives, Museums
PDF
Content Used to be King: The Semantic Web in Education
PDF
Linked Data on the Web
PDF
GoodRelations Tutorial Part 1
Linked data and the future of scientific publishing
What is the Semantic Web
The Semantic Web #1 - Overview
Introduction to Semantic Web
Sharing data on the web (2013)
Social Semantic Web (Social Activity and Facebook)
WordLift 2.0 presented on the Semantic Web Meetup in Rome
Питер Мика "Making the web searchable"
Linked data in the digital humanities skills workshop for realising the oppo...
WordLift 2.0 (presentation for the IKS annual review in Saarbrücken)
NCompass Live: Linked Data and Libraries: What? Why? How?
Web 3.0: The Upcoming Revolution
Is linked data something for me?
Applying Semantic Extensions And New Services To Drupal Sem Tech June 2010
E learning getting started with online learning reduced for uploading
Bernadette Hyland SemTech 2011 West - Linked Data Cookbook
Linked data for Libraries, Archives, Museums
Content Used to be King: The Semantic Web in Education
Linked Data on the Web
GoodRelations Tutorial Part 1
Ad

Recently uploaded (20)

PDF
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
PPTX
Spectroscopy.pptx food analysis technology
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
GamePlan Trading System Review: Professional Trader's Honest Take
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Machine learning based COVID-19 study performance prediction
PPTX
Big Data Technologies - Introduction.pptx
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
Cloud computing and distributed systems.
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
cuic standard and advanced reporting.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
madgavkar20181017ppt McKinsey Presentation.pdf
PPTX
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
Spectroscopy.pptx food analysis technology
Reach Out and Touch Someone: Haptics and Empathic Computing
The AUB Centre for AI in Media Proposal.docx
GamePlan Trading System Review: Professional Trader's Honest Take
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Machine learning based COVID-19 study performance prediction
Big Data Technologies - Introduction.pptx
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Understanding_Digital_Forensics_Presentation.pptx
Per capita expenditure prediction using model stacking based on satellite ima...
Cloud computing and distributed systems.
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Chapter 3 Spatial Domain Image Processing.pdf
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
cuic standard and advanced reporting.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
madgavkar20181017ppt McKinsey Presentation.pdf
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....

Linked Data: How it is changing the way data is published and accessed on web

  • 1. Ravish Bhagdev The Invisible Web of Linked Data
  • 2. WWW Web of Our Role Linked Data
  • 3. World Wide Web HTTP, URL, HTML
  • 4. WWW as we know A global repository of Web Page X interconnected documents Text Text Text Text Text Text Link Text Describing Page Y Documents linked by hyperlinks Text Text Text Text Text Text Text Text Text Text Text Text – Implicit meaning assigned to links with a few words or pictures – Anyone can link a new page to any other available page Web Page Y – Links can also be made to specific Text Text Text Text Text Text sections of a page (anchor tags) Text Text Text Text Text Text Text Text Text Text Text Text Text Text Text Text Text Text
  • 5. WWW as we know
  • 6. Search Engines and power of links • Use links between web-pages to rank results • Each search performed results in generation of a new page with more hyperlinks • Click-through relevance information • Redundancy of information • Focus on the most useful and relevant page • Freshness • Links are the key
  • 7. Web of Linked Data Linking more than just pages
  • 8. Structured vs Unstructured Data Structured Simply defined as Data that can be parsed XML DB and processed by machines to automate operations like matching, CSV RDF classification, querying etc. Unstructured Data that cannot be parsed HTML Doc in this manner without loss of information or requirement of input from PDF PPT external agents
  • 9. Semi-Structured Data Most however is semi- XML DB structured – HTML (HEAD, TITLE, BODY etc) CSV RDF – Email (Header vs Body) – DBs with free text fields – Forms with descriptive fields – CSVs, Spreadsheets HTML Doc – Some more structured than others – Not possible to express PDF PPT everything in structured form
  • 10. Web: Giant Data Shredder Web Page DBs (HTML)
  • 11. Creating Links at Level of Entities Requires Semantic Markup – Uniquely Identify Entities in a Name: <String> Document (URIs) John Smith – Make Relations Between Entities Explicit – Shared Vocabulary Person: <Person> (schema.org, FOAF, SKOS, Person3456 GoodRelations etc.) Residence: Age: <Country> <Integer> – Reuse existing vocabularies UK 24 instead of inventing new ones – Both with-in the same document and with other Entities in other Documents
  • 12. Integrated Data Across Applications  Where should I go on vacation?  How do I get the best fare? what  What is it like there? how Travel Travel where Services Interests (FB) who Places to go  Where should I stay? People who (lonely planet) what have been there (foursquare)  What do I need to know about it? Photos, blogs, news stories
  • 13. How Search Engines Use Linked Data
  • 14. How Search Engines Use Linked Data
  • 15. Our Role Why I think we should engage in Linked Data Initivatives
  • 16. We Build Web Apps » That publish content » We want the content to be » Visible to next generation apps » Ranked High by Search Engines » Presented clearly and unambiguously (think price comparision websites) » All the big players are doing it » Including most governments
  • 17. The Visible Web Is Dominated by Marketing – The web is seen through the lense of Search Engines and Social Networks – Not every relevant page can appear on first page of search results – Paid, Targetted advertising – Affiliate Programs and Contracts – Popularity overrides quality of matches – What is popular is decided by sites that are already popular – Visible web is more and more biased
  • 19. Creating Linked Data (How?) » Implicitly » Integrating Linked Data Standards with Publishing tools (CMS!), apps, gadgets, Entity P • Entity A • Entity X social networks etc. • Entity B • Entity Q • Entity Y » Minimize the effort while maximizing • Entity R the return Entity C Entity Z » Every time you add a new friend on FB or follow someone on twitter, you create linked data
  • 21. Linking of information creates Insights Enterprise Search & KM Linked Data is even more relevant in the context of enterprise search because Raw Data: symbols and chars these data don’t even have simple hyperlinks Informaton: Data in usable form Knowledge: Information Enriched Popularity of a document is not always with Semantics the most important factor Wisdom: Understanding, Hindsight, Experience Large companies struggle to make use of knowledge and information across departments and silos Timly linking of data across these silos can have a big impact
  • 22. ? Questions? @RavBhagdev Theres more: Information Extraction, Social Search, Knowledge Capture etc.