SlideShare a Scribd company logo
Ravish Bhagdev



The Invisible Web of

Linked Data
WWW     Web of      Our Role
      Linked Data
World Wide Web



                 HTTP, URL, HTML
WWW as we know

                                   A global repository of
           Web Page X              interconnected documents
   Text Text Text Text Text Text
   Link Text Describing Page Y     Documents linked by hyperlinks
   Text Text Text Text Text Text
   Text Text Text Text Text Text   – Implicit meaning assigned to links
                                     with a few words or pictures

                                   – Anyone can link a new page to any
                                     other available page
          Web Page Y
                                   – Links can also be made to specific
  Text Text Text Text Text Text      sections of a page (anchor tags)
  Text Text Text Text Text Text
  Text Text Text Text Text Text
  Text Text Text Text Text Text
WWW as we know
Search Engines and power of links


                      •   Use links between web-pages to rank
                          results

                      •   Each search performed results in
                          generation of a new page with more
                          hyperlinks

                      •   Click-through relevance information

                      •   Redundancy of information

                      •   Focus on the most useful and relevant page

                      •   Freshness

                      •   Links are the key
Web of Linked Data



            Linking more than just pages
Structured vs Unstructured Data
      Structured           Simply defined as
                              Data that can be parsed
   XML        DB              and processed by
                              machines to automate
                              operations like matching,
   CSV       RDF              classification, querying etc.

     Unstructured
                              Data that cannot be parsed
  HTML       Doc              in this manner without loss
                              of information or
                              requirement of input from
   PDF       PPT              external agents
Semi-Structured Data

                       Most however is semi-
   XML       DB        structured
                         – HTML (HEAD, TITLE, BODY etc)
   CSV      RDF          – Email (Header vs Body)
                         – DBs with free text fields
                         – Forms with descriptive fields
                         – CSVs, Spreadsheets
  HTML       Doc         – Some more structured than
                           others
                         – Not possible to express
   PDF       PPT           everything in structured form
Web: Giant Data Shredder




                           Web Page
     DBs                    (HTML)
Creating Links at Level of Entities

                                      Requires Semantic Markup
                                        – Uniquely Identify Entities in a
               Name:
              <String>                    Document (URIs)
             John Smith
                                        – Make Relations Between
                                          Entities Explicit
                                        – Shared Vocabulary
              Person:
              <Person>                    (schema.org, FOAF, SKOS,
             Person3456                   GoodRelations etc.)
Residence:                  Age:
<Country>                 <Integer>     – Reuse existing vocabularies
   UK                        24           instead of inventing new ones
                                        – Both with-in the same
                                          document and with other
                                          Entities in other Documents
Integrated Data Across Applications



   Where should I go on vacation?
                                                                         How do I get the best fare?
      what
                                          What is it like there?
                                                                                how
                                                                                           Travel
     Travel                                where
                                                                                           Services
     Interests
     (FB)           who              Places
                                     to go                                     Where should I stay?
              People who             (lonely planet)           what
              have been there
              (foursquare)                                 What do I need to know about it?
                                                   Photos, blogs, news stories
How Search Engines Use Linked Data
How Search Engines Use Linked Data
Our Role



 Why I think we should engage in Linked Data Initivatives
We Build Web Apps
» That publish content

» We want the content to be

    » Visible to next generation apps

    » Ranked High by Search Engines

    » Presented clearly and unambiguously
      (think price comparision websites)

    » All the big players are doing it

    » Including most governments
The Visible Web

Is Dominated by Marketing

   – The web is seen through the lense of Search Engines and Social
     Networks

   – Not every relevant page can appear on first page of search results

   – Paid, Targetted advertising

   – Affiliate Programs and Contracts

   – Popularity overrides quality of matches

   – What is popular is decided by sites that are already popular

   – Visible web is more and more biased
Emerging marketing tactics, circa 2010
Creating Linked Data (How?)

» Implicitly

» Integrating Linked Data Standards with
  Publishing tools (CMS!), apps, gadgets,
                                                            Entity P
                                             • Entity A                • Entity X
  social networks etc.
                                             • Entity B   • Entity Q   • Entity Y
» Minimize the effort while maximizing                    • Entity R
  the return
                                               Entity C                  Entity Z

» Every time you add a new friend on FB
  or follow someone on twitter, you create
  linked data
Linked Data: How it is changing the way data is published and accessed on web
Linking of information creates
 Insights
                                  Enterprise Search & KM
                                  Linked Data is even more relevant in the
                                  context of enterprise search because
Raw Data: symbols and chars       these data don’t even have simple
                                  hyperlinks
Informaton: Data in usable form

Knowledge: Information Enriched   Popularity of a document is not always
with Semantics                    the most important factor
Wisdom: Understanding,
Hindsight, Experience             Large companies struggle to make use of
                                  knowledge and information across
                                  departments and silos

                                  Timly linking of data across these silos
                                  can have a big impact
?             Questions?
                 @RavBhagdev




Theres more: Information Extraction, Social Search, Knowledge Capture etc.
What’s Your Message?
Thanks!

More Related Content

Viewers also liked (6)

PPT
Percentages amounts
Oliver Bowles
 
DOC
Buy online e cigs 2
milika8666
 
PPT
What Is The Capstone Program
Lonisha Howell
 
DOC
TehtäVäMoniste
guest70fecb
 
PDF
Nyc map town
Jeff Ryu
 
PPTX
Whathaveyoulearntabouttechnologyfromthe final
Emily236
 
Percentages amounts
Oliver Bowles
 
Buy online e cigs 2
milika8666
 
What Is The Capstone Program
Lonisha Howell
 
TehtäVäMoniste
guest70fecb
 
Nyc map town
Jeff Ryu
 
Whathaveyoulearntabouttechnologyfromthe final
Emily236
 

Similar to Linked Data: How it is changing the way data is published and accessed on web (20)

PPTX
Search engines
Anshuman Tyagi
 
PDF
Semantic Mapping and LOD prez
Carol Chiodo
 
PDF
Питер Мика "Making the web searchable"
Yandex
 
PPTX
(Keynote) Peter Mika - “Making the Web Searchable”
icwe2015
 
PPTX
Making the Web Searchable - Keynote ICWE 2015
Peter Mika
 
PPT
Publishing data on the Semantic Web
Peter Mika
 
PPTX
Smart data and branding
Larry Smith
 
PPTX
Introduction to Linked Data 1/5
Juan Sequeda
 
PPTX
Semantic mark-up with schema.org: helping search engines understand the Web
Peter Mika
 
PPT
Spivack Blogtalk 2008
Blogtalk 2008
 
PPTX
What is the Semantic Web
Juan Sequeda
 
PDF
Foaf Openid Milan
Dan Brickley
 
PPT
Koreacomm - Does Web 3.0 exist?
Jonathan Allen
 
PPTX
It19 20140721 linked data personal perspective
Janifer Gatenby
 
PPTX
Semantic Search keynote at CORIA 2015
Peter Mika
 
PPTX
Optimizing Your Practice for Online Visibility - CAMFT Presentation
Yo! Yo! SEO
 
PPT
DM110 - Week 10 - Semantic Web / Web 3.0
John Breslin
 
PPT
Nova Spivack - Semantic Web Talk
syawal
 
PPTX
Linked Data Integration and semantic web
Diego Pessoa
 
PPTX
Data Lakehouse Symposium | Day 2
Databricks
 
Search engines
Anshuman Tyagi
 
Semantic Mapping and LOD prez
Carol Chiodo
 
Питер Мика "Making the web searchable"
Yandex
 
(Keynote) Peter Mika - “Making the Web Searchable”
icwe2015
 
Making the Web Searchable - Keynote ICWE 2015
Peter Mika
 
Publishing data on the Semantic Web
Peter Mika
 
Smart data and branding
Larry Smith
 
Introduction to Linked Data 1/5
Juan Sequeda
 
Semantic mark-up with schema.org: helping search engines understand the Web
Peter Mika
 
Spivack Blogtalk 2008
Blogtalk 2008
 
What is the Semantic Web
Juan Sequeda
 
Foaf Openid Milan
Dan Brickley
 
Koreacomm - Does Web 3.0 exist?
Jonathan Allen
 
It19 20140721 linked data personal perspective
Janifer Gatenby
 
Semantic Search keynote at CORIA 2015
Peter Mika
 
Optimizing Your Practice for Online Visibility - CAMFT Presentation
Yo! Yo! SEO
 
DM110 - Week 10 - Semantic Web / Web 3.0
John Breslin
 
Nova Spivack - Semantic Web Talk
syawal
 
Linked Data Integration and semantic web
Diego Pessoa
 
Data Lakehouse Symposium | Day 2
Databricks
 
Ad

Recently uploaded (20)

PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
PPTX
Designing_the_Future_AI_Driven_Product_Experiences_Across_Devices.pptx
presentifyai
 
PDF
Kit-Works Team Study_20250627_한달만에만든사내서비스키링(양다윗).pdf
Wonjun Hwang
 
PDF
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
PPTX
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
PDF
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
PPTX
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
PPTX
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
PDF
Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
PDF
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Safe Software
 
PDF
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
Edge AI and Vision Alliance
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PPTX
Digital Circuits, important subject in CS
contactparinay1
 
PDF
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
PDF
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
Safe Software
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
PDF
UiPath DevConnect 2025: Agentic Automation Community User Group Meeting
DianaGray10
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
Designing_the_Future_AI_Driven_Product_Experiences_Across_Devices.pptx
presentifyai
 
Kit-Works Team Study_20250627_한달만에만든사내서비스키링(양다윗).pdf
Wonjun Hwang
 
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Safe Software
 
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
Edge AI and Vision Alliance
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
Digital Circuits, important subject in CS
contactparinay1
 
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
Safe Software
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
UiPath DevConnect 2025: Agentic Automation Community User Group Meeting
DianaGray10
 
Ad

Linked Data: How it is changing the way data is published and accessed on web

  • 1. Ravish Bhagdev The Invisible Web of Linked Data
  • 2. WWW Web of Our Role Linked Data
  • 3. World Wide Web HTTP, URL, HTML
  • 4. WWW as we know A global repository of Web Page X interconnected documents Text Text Text Text Text Text Link Text Describing Page Y Documents linked by hyperlinks Text Text Text Text Text Text Text Text Text Text Text Text – Implicit meaning assigned to links with a few words or pictures – Anyone can link a new page to any other available page Web Page Y – Links can also be made to specific Text Text Text Text Text Text sections of a page (anchor tags) Text Text Text Text Text Text Text Text Text Text Text Text Text Text Text Text Text Text
  • 5. WWW as we know
  • 6. Search Engines and power of links • Use links between web-pages to rank results • Each search performed results in generation of a new page with more hyperlinks • Click-through relevance information • Redundancy of information • Focus on the most useful and relevant page • Freshness • Links are the key
  • 7. Web of Linked Data Linking more than just pages
  • 8. Structured vs Unstructured Data Structured Simply defined as Data that can be parsed XML DB and processed by machines to automate operations like matching, CSV RDF classification, querying etc. Unstructured Data that cannot be parsed HTML Doc in this manner without loss of information or requirement of input from PDF PPT external agents
  • 9. Semi-Structured Data Most however is semi- XML DB structured – HTML (HEAD, TITLE, BODY etc) CSV RDF – Email (Header vs Body) – DBs with free text fields – Forms with descriptive fields – CSVs, Spreadsheets HTML Doc – Some more structured than others – Not possible to express PDF PPT everything in structured form
  • 10. Web: Giant Data Shredder Web Page DBs (HTML)
  • 11. Creating Links at Level of Entities Requires Semantic Markup – Uniquely Identify Entities in a Name: <String> Document (URIs) John Smith – Make Relations Between Entities Explicit – Shared Vocabulary Person: <Person> (schema.org, FOAF, SKOS, Person3456 GoodRelations etc.) Residence: Age: <Country> <Integer> – Reuse existing vocabularies UK 24 instead of inventing new ones – Both with-in the same document and with other Entities in other Documents
  • 12. Integrated Data Across Applications  Where should I go on vacation?  How do I get the best fare? what  What is it like there? how Travel Travel where Services Interests (FB) who Places to go  Where should I stay? People who (lonely planet) what have been there (foursquare)  What do I need to know about it? Photos, blogs, news stories
  • 13. How Search Engines Use Linked Data
  • 14. How Search Engines Use Linked Data
  • 15. Our Role Why I think we should engage in Linked Data Initivatives
  • 16. We Build Web Apps » That publish content » We want the content to be » Visible to next generation apps » Ranked High by Search Engines » Presented clearly and unambiguously (think price comparision websites) » All the big players are doing it » Including most governments
  • 17. The Visible Web Is Dominated by Marketing – The web is seen through the lense of Search Engines and Social Networks – Not every relevant page can appear on first page of search results – Paid, Targetted advertising – Affiliate Programs and Contracts – Popularity overrides quality of matches – What is popular is decided by sites that are already popular – Visible web is more and more biased
  • 19. Creating Linked Data (How?) » Implicitly » Integrating Linked Data Standards with Publishing tools (CMS!), apps, gadgets, Entity P • Entity A • Entity X social networks etc. • Entity B • Entity Q • Entity Y » Minimize the effort while maximizing • Entity R the return Entity C Entity Z » Every time you add a new friend on FB or follow someone on twitter, you create linked data
  • 21. Linking of information creates Insights Enterprise Search & KM Linked Data is even more relevant in the context of enterprise search because Raw Data: symbols and chars these data don’t even have simple hyperlinks Informaton: Data in usable form Knowledge: Information Enriched Popularity of a document is not always with Semantics the most important factor Wisdom: Understanding, Hindsight, Experience Large companies struggle to make use of knowledge and information across departments and silos Timly linking of data across these silos can have a big impact
  • 22. ? Questions? @RavBhagdev Theres more: Information Extraction, Social Search, Knowledge Capture etc.