SlideShare a Scribd company logo
UKOLN is supported  by: Aggregation Using Linked Data – LOCAH Project Experiences 23rd June 2011 OAI7, Geneva, Switzerland Adrian Stevenson LOCAH Project Manager
LOCAH Project L inked  O pen  C opac and  A rchives  H ub Funded by #JiscEXPO 2/10 ‘Expose’ call 1 year project. Started August 2010 Partners & Consultants: UKOLN  – Adrian Stevenson, Julian Cheal Mimas  – Jane Stevenson, Bethan Ruddock, Yogesh Patel Eduserv  – Pete Johnston Talis  – Leigh Dodds, Tim Hodson OCLC  - Ralph LeVan, Thom Hickey Ed Summers http://blogs.ukoln.ac.uk/locah/ tag: #locah
Archives Hub and Copac UK National Data Services based at Mimas Archives Hub is an aggregation of archival descriptions from archive repositories across the UK http://archiveshub.ac.uk Copac provides access to the merged library catalogues of libraries throughout the UK, including all national libraries http://copac.ac.uk
What is LOCAH Doing? Part 1: Exposing Archives Hub & Copac data as Linked Data Part 2: Creating a prototype visualisation Part 3: Reporting on opportunities and barriers
We’re Aggregating If something is identified, it can be linked to We take  items from one dataset and link them to items from other datasets BBC VIAF DBPedia Archives Hub Copac GeoNames
Enhancing our data Already have some links: Time - reference.data.gov.uk URIs Location - UK Postcodes URIs and Ordnance Survey URIs  Names - Virtual International Authority File Matches and links widely-used authority files - http://viaf.org/ Names - DBPedia Also looking at: Subjects - Library Congress Subject Headings and DBPedia
http://data.archiveshub.ac.uk/
‘ Aggregates’ property points to http://www.openarchives.org/ore/terms/aggregates
 
Visualisation Prototype Using Timemap –  Googlemaps and Simile http://code.google.com/p/timemap / Early stages with this Will give location and ‘extent’ of archive. Will link through to Archives Hub
 
BBC Music
APIs, Mashups and Linked Data Mashups work against a fixed set of data sources Hand crafted by humans Don’t integrate well Linked Data promises an unbound global data space Easy dataset integration Generic ‘mesh-up’ tools
Aggregation / Integration Challenges
 
 
 
Sustainability Can you rely on data sources long-term?  Ed Summers at the Library of Congress created http://lcsh.info Linked Data interface for LOC subject headings People started using it
Library of Congress Subject Headings
Scalability Will the Web of Data scale? Example by Bradley Allen, Elsevier  at LOD LAM Summit, SF, USA
Data Modelling Complexity Archival description is hierarchical and multi-level Dirty Data Licensing ‘ Ownership’ of data Hard to track attribution CC0 for Archives Hub and Copac data
Linked Data the Way for Aggregation? Enables ‘straightforward’ aggregation of wide variety of data sources New channels into your data services Researchers are more likely to discover sources  ‘ Hidden' collections of repositories become  of  the Web
Questions for Discussion Will using vocabularies and ontologies always be too difficult? Or will the tools appear? – MS Access for Linked Data? Will the Web of Data scale?
What constitutes data worth linking to? How to find datasets suitable for interlinking?  How to make my dataset worth linking to? How to encourage others to link to my data? What is the added value of links?  How to determine the quality of a link? Questions if you’ve bought in
Attribution and CC License  Sections of this presentation adapted from materials created by other members of the LOCAH Project This presentation available under creative commons   Non Commercial-Share Alike: http://creativecommons.org/licenses/by-nc/2.0/uk/

More Related Content

What's hot (20)

PPT
AddressingHistory - Crowdsourcing historical data and maps
EDINA, University of Edinburgh
 
PPT
Pushing Open The Jorum: A national repository for learning materials
EDINA, University of Edinburgh
 
PPT
Open Access Repository Junction
EDINA, University of Edinburgh
 
PPT
Crowdsourcing the Past with AddressingHistory
EDINA, University of Edinburgh
 
PPT
VALA2008 L Plate Session1
David Feighan
 
PDF
ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...
Martin Klein
 
PPT
OARJ: assisting deposit into multiple repository services
EDINA, University of Edinburgh
 
PPT
Who is doing what, and how do we know? [PEPRS]
EDINA, University of Edinburgh
 
PPT
Accessing Treasure on lands and peoples
EDINA, University of Edinburgh
 
PPT
Report on EDINA Authentication Related Academic Sector Activities
EDINA, University of Edinburgh
 
PPTX
Linked Data: thinking big, starting small
Peter Neish
 
ODP
Dataincubator
Leigh Dodds
 
PDF
Mapping the Repository Landscape
EDINA, University of Edinburgh
 
PPTX
Preserving Streams of Issued Content
EDINA, University of Edinburgh
 
PPT
AddressingHistory: Lessons and Messages
EDINA, University of Edinburgh
 
PPTX
Lodlam.slideshare
Hafabe
 
PPTX
OpenAIRE guidelines and broker service for repository managers - OpenAIRE #OA...
OpenAIRE
 
PPT
Inmagic user group meeting Melbourne june 2011
Peter Neish
 
PDF
agINFRA work on germplasm and soil Linked Data by Luca Matteus, Giovanni L’Ab...
CIARD Movement
 
PPT
Cambridge university library ess update for ucs
Edmund Chamberlain
 
AddressingHistory - Crowdsourcing historical data and maps
EDINA, University of Edinburgh
 
Pushing Open The Jorum: A national repository for learning materials
EDINA, University of Edinburgh
 
Open Access Repository Junction
EDINA, University of Edinburgh
 
Crowdsourcing the Past with AddressingHistory
EDINA, University of Edinburgh
 
VALA2008 L Plate Session1
David Feighan
 
ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...
Martin Klein
 
OARJ: assisting deposit into multiple repository services
EDINA, University of Edinburgh
 
Who is doing what, and how do we know? [PEPRS]
EDINA, University of Edinburgh
 
Accessing Treasure on lands and peoples
EDINA, University of Edinburgh
 
Report on EDINA Authentication Related Academic Sector Activities
EDINA, University of Edinburgh
 
Linked Data: thinking big, starting small
Peter Neish
 
Dataincubator
Leigh Dodds
 
Mapping the Repository Landscape
EDINA, University of Edinburgh
 
Preserving Streams of Issued Content
EDINA, University of Edinburgh
 
AddressingHistory: Lessons and Messages
EDINA, University of Edinburgh
 
Lodlam.slideshare
Hafabe
 
OpenAIRE guidelines and broker service for repository managers - OpenAIRE #OA...
OpenAIRE
 
Inmagic user group meeting Melbourne june 2011
Peter Neish
 
agINFRA work on germplasm and soil Linked Data by Luca Matteus, Giovanni L’Ab...
CIARD Movement
 
Cambridge university library ess update for ucs
Edmund Chamberlain
 

Viewers also liked (7)

PPT
Hub Distributed Model 2009
Jane Stevenson
 
KEY
WWIC - Library Linked Data as a Customer Service Medium
Dan Chudnov
 
PPT
Using OpenUrl Activity Data Summary for RDTF Day 26 May 11
EDINA, University of Edinburgh
 
PPTX
Discovery at the RLUK conference 2012
andymcg
 
PPT
Jorum: Increasing Access to Institutional e-Learning
Adrian Stevenson
 
PPTX
The Impact of Web 2.0 on Archives
Jane Stevenson
 
PPT
RSS Newsfeeds and Podcasting
Adrian Stevenson
 
Hub Distributed Model 2009
Jane Stevenson
 
WWIC - Library Linked Data as a Customer Service Medium
Dan Chudnov
 
Using OpenUrl Activity Data Summary for RDTF Day 26 May 11
EDINA, University of Edinburgh
 
Discovery at the RLUK conference 2012
andymcg
 
Jorum: Increasing Access to Institutional e-Learning
Adrian Stevenson
 
The Impact of Web 2.0 on Archives
Jane Stevenson
 
RSS Newsfeeds and Podcasting
Adrian Stevenson
 
Ad

Similar to Aggregation Using Linked Data – LOCAH Project Experiences (20)

PPT
Linked Data - the Future for Open Repositories?
Adrian Stevenson
 
PPTX
Linked Open Data: Opportunities & Barriers for Archives
Adrian Stevenson
 
PPT
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data
Adrian Stevenson
 
PPTX
Linked Data and Locah, UKSG2011
Jane Stevenson
 
PPT
The Digital Library Federation Aquifer Initiative
Jenn Riley
 
PPT
Uk discovery-jisc-project-showcase
RDTF-Discovery
 
PPTX
NISO Webinar: Library Linked Data: From Vision to Reality
National Information Standards Organization (NISO)
 
PPT
Huwe C105 Ili08 Ppt Pres 3
thuwe
 
PPTX
Linked data presentation for libraries (COMO)
robin fay
 
PPT
Lifting the Lid on Linked Data
Jane Stevenson
 
PPT
Of Cataloging & Context
charper
 
PDF
Charleston 2012 - The Future of Serials in a Linked Data World
ProQuest
 
PPT
Open for Business Open Archives, OpenURL, RSS and the Dublin Core
Andy Powell
 
PPT
An Open Context for Archaeology
guest756e05
 
PPSX
Linked Data to Improve the OER Experience
The Open Education Consortium
 
PDF
Linked Data at the OU - the story so far
Enrico Daga
 
PDF
Informal presentation about RES
Christophe Guéret
 
PPT
The Open Archives Initiative Protocol for Metadata Harvesting and ePrints UK
Andy Powell
 
PPT
Establishing the Connection: Creating a Linked Data Version of the BNB
nw13
 
PPTX
Linked Open Data for Cultural Heritage
Noreen Whysel
 
Linked Data - the Future for Open Repositories?
Adrian Stevenson
 
Linked Open Data: Opportunities & Barriers for Archives
Adrian Stevenson
 
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data
Adrian Stevenson
 
Linked Data and Locah, UKSG2011
Jane Stevenson
 
The Digital Library Federation Aquifer Initiative
Jenn Riley
 
Uk discovery-jisc-project-showcase
RDTF-Discovery
 
NISO Webinar: Library Linked Data: From Vision to Reality
National Information Standards Organization (NISO)
 
Huwe C105 Ili08 Ppt Pres 3
thuwe
 
Linked data presentation for libraries (COMO)
robin fay
 
Lifting the Lid on Linked Data
Jane Stevenson
 
Of Cataloging & Context
charper
 
Charleston 2012 - The Future of Serials in a Linked Data World
ProQuest
 
Open for Business Open Archives, OpenURL, RSS and the Dublin Core
Andy Powell
 
An Open Context for Archaeology
guest756e05
 
Linked Data to Improve the OER Experience
The Open Education Consortium
 
Linked Data at the OU - the story so far
Enrico Daga
 
Informal presentation about RES
Christophe Guéret
 
The Open Archives Initiative Protocol for Metadata Harvesting and ePrints UK
Andy Powell
 
Establishing the Connection: Creating a Linked Data Version of the BNB
nw13
 
Linked Open Data for Cultural Heritage
Noreen Whysel
 
Ad

More from Adrian Stevenson (20)

PPTX
Tools for Data Manipulation - UKAD Open Refine Workshop
Adrian Stevenson
 
PPTX
Exploring British Design
Adrian Stevenson
 
PPTX
SEO Matters
Adrian Stevenson
 
PPTX
Linking Data with sameAs: Challenges and Solutions - Workshop
Adrian Stevenson
 
PPTX
“Il n’y a pas de hors-texte” - Challenges for Archival Linked Data
Adrian Stevenson
 
PPTX
Wrapping and Unwrapping History: What’s Gained and What’s Lost
Adrian Stevenson
 
PPTX
Very Gentle Linked Data Workshop
Adrian Stevenson
 
PPTX
Digital Humanities and the First World War
Adrian Stevenson
 
PPTX
Lessons from ‘Linking Lives’ and ‘WW1 Discovery’ Projects
Adrian Stevenson
 
PPTX
The Winner Takes it All? -APIs and Linked Data Battle It Out
Adrian Stevenson
 
PPTX
Introduction to APIs and Linked Data
Adrian Stevenson
 
PPTX
GLAM Rocks! London Semantic Web Meetup
Adrian Stevenson
 
PPTX
Linked Data - the Future for Open Repositories. Kultivate Workshop
Adrian Stevenson
 
PPTX
2 minutes on LOCAH Linking Lives at Europeana Tech 2011
Adrian Stevenson
 
PPTX
Report on the International Linked Open Data for Libraries, Archives and Muse...
Adrian Stevenson
 
PPT
RDFa From Theory to Practice
Adrian Stevenson
 
PPT
Linked Data and the Semantic Web - Mimas Seminar
Adrian Stevenson
 
PPT
Semantic Technologies: Which Way Now? – UKOLN Response
Adrian Stevenson
 
PPT
SWORD 3 Kick-off Meeting
Adrian Stevenson
 
PPT
Linked Data and the Semantic Web: What Are They and Should I Care?
Adrian Stevenson
 
Tools for Data Manipulation - UKAD Open Refine Workshop
Adrian Stevenson
 
Exploring British Design
Adrian Stevenson
 
SEO Matters
Adrian Stevenson
 
Linking Data with sameAs: Challenges and Solutions - Workshop
Adrian Stevenson
 
“Il n’y a pas de hors-texte” - Challenges for Archival Linked Data
Adrian Stevenson
 
Wrapping and Unwrapping History: What’s Gained and What’s Lost
Adrian Stevenson
 
Very Gentle Linked Data Workshop
Adrian Stevenson
 
Digital Humanities and the First World War
Adrian Stevenson
 
Lessons from ‘Linking Lives’ and ‘WW1 Discovery’ Projects
Adrian Stevenson
 
The Winner Takes it All? -APIs and Linked Data Battle It Out
Adrian Stevenson
 
Introduction to APIs and Linked Data
Adrian Stevenson
 
GLAM Rocks! London Semantic Web Meetup
Adrian Stevenson
 
Linked Data - the Future for Open Repositories. Kultivate Workshop
Adrian Stevenson
 
2 minutes on LOCAH Linking Lives at Europeana Tech 2011
Adrian Stevenson
 
Report on the International Linked Open Data for Libraries, Archives and Muse...
Adrian Stevenson
 
RDFa From Theory to Practice
Adrian Stevenson
 
Linked Data and the Semantic Web - Mimas Seminar
Adrian Stevenson
 
Semantic Technologies: Which Way Now? – UKOLN Response
Adrian Stevenson
 
SWORD 3 Kick-off Meeting
Adrian Stevenson
 
Linked Data and the Semantic Web: What Are They and Should I Care?
Adrian Stevenson
 

Recently uploaded (20)

PPT
Ericsson LTE presentation SEMINAR 2010.ppt
npat3
 
PDF
Kit-Works Team Study_20250627_한달만에만든사내서비스키링(양다윗).pdf
Wonjun Hwang
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PPTX
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
PDF
Staying Human in a Machine- Accelerated World
Catalin Jora
 
PDF
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
PDF
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
PDF
AI Agents in the Cloud: The Rise of Agentic Cloud Architecture
Lilly Gracia
 
PDF
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
Edge AI and Vision Alliance
 
PDF
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
Safe Software
 
PDF
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
PDF
NASA A Researcher’s Guide to International Space Station : Physical Sciences ...
Dr. PANKAJ DHUSSA
 
DOCX
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PDF
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
Edge AI and Vision Alliance
 
PDF
Future-Proof or Fall Behind? 10 Tech Trends You Can’t Afford to Ignore in 2025
DIGITALCONFEX
 
PDF
SIZING YOUR AIR CONDITIONER---A PRACTICAL GUIDE.pdf
Muhammad Rizwan Akram
 
PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
PPTX
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
Ericsson LTE presentation SEMINAR 2010.ppt
npat3
 
Kit-Works Team Study_20250627_한달만에만든사내서비스키링(양다윗).pdf
Wonjun Hwang
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
Staying Human in a Machine- Accelerated World
Catalin Jora
 
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
AI Agents in the Cloud: The Rise of Agentic Cloud Architecture
Lilly Gracia
 
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
Edge AI and Vision Alliance
 
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
Safe Software
 
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
NASA A Researcher’s Guide to International Space Station : Physical Sciences ...
Dr. PANKAJ DHUSSA
 
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
Edge AI and Vision Alliance
 
Future-Proof or Fall Behind? 10 Tech Trends You Can’t Afford to Ignore in 2025
DIGITALCONFEX
 
SIZING YOUR AIR CONDITIONER---A PRACTICAL GUIDE.pdf
Muhammad Rizwan Akram
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 

Aggregation Using Linked Data – LOCAH Project Experiences

  • 1. UKOLN is supported by: Aggregation Using Linked Data – LOCAH Project Experiences 23rd June 2011 OAI7, Geneva, Switzerland Adrian Stevenson LOCAH Project Manager
  • 2. LOCAH Project L inked O pen C opac and A rchives H ub Funded by #JiscEXPO 2/10 ‘Expose’ call 1 year project. Started August 2010 Partners & Consultants: UKOLN – Adrian Stevenson, Julian Cheal Mimas – Jane Stevenson, Bethan Ruddock, Yogesh Patel Eduserv – Pete Johnston Talis – Leigh Dodds, Tim Hodson OCLC - Ralph LeVan, Thom Hickey Ed Summers http://blogs.ukoln.ac.uk/locah/ tag: #locah
  • 3. Archives Hub and Copac UK National Data Services based at Mimas Archives Hub is an aggregation of archival descriptions from archive repositories across the UK http://archiveshub.ac.uk Copac provides access to the merged library catalogues of libraries throughout the UK, including all national libraries http://copac.ac.uk
  • 4. What is LOCAH Doing? Part 1: Exposing Archives Hub & Copac data as Linked Data Part 2: Creating a prototype visualisation Part 3: Reporting on opportunities and barriers
  • 5. We’re Aggregating If something is identified, it can be linked to We take items from one dataset and link them to items from other datasets BBC VIAF DBPedia Archives Hub Copac GeoNames
  • 6. Enhancing our data Already have some links: Time - reference.data.gov.uk URIs Location - UK Postcodes URIs and Ordnance Survey URIs Names - Virtual International Authority File Matches and links widely-used authority files - http://viaf.org/ Names - DBPedia Also looking at: Subjects - Library Congress Subject Headings and DBPedia
  • 8. ‘ Aggregates’ property points to http://www.openarchives.org/ore/terms/aggregates
  • 9.  
  • 10. Visualisation Prototype Using Timemap – Googlemaps and Simile http://code.google.com/p/timemap / Early stages with this Will give location and ‘extent’ of archive. Will link through to Archives Hub
  • 11.  
  • 13. APIs, Mashups and Linked Data Mashups work against a fixed set of data sources Hand crafted by humans Don’t integrate well Linked Data promises an unbound global data space Easy dataset integration Generic ‘mesh-up’ tools
  • 15.  
  • 16.  
  • 17.  
  • 18. Sustainability Can you rely on data sources long-term? Ed Summers at the Library of Congress created http://lcsh.info Linked Data interface for LOC subject headings People started using it
  • 19. Library of Congress Subject Headings
  • 20. Scalability Will the Web of Data scale? Example by Bradley Allen, Elsevier at LOD LAM Summit, SF, USA
  • 21. Data Modelling Complexity Archival description is hierarchical and multi-level Dirty Data Licensing ‘ Ownership’ of data Hard to track attribution CC0 for Archives Hub and Copac data
  • 22. Linked Data the Way for Aggregation? Enables ‘straightforward’ aggregation of wide variety of data sources New channels into your data services Researchers are more likely to discover sources ‘ Hidden' collections of repositories become of the Web
  • 23. Questions for Discussion Will using vocabularies and ontologies always be too difficult? Or will the tools appear? – MS Access for Linked Data? Will the Web of Data scale?
  • 24. What constitutes data worth linking to? How to find datasets suitable for interlinking? How to make my dataset worth linking to? How to encourage others to link to my data? What is the added value of links? How to determine the quality of a link? Questions if you’ve bought in
  • 25. Attribution and CC License Sections of this presentation adapted from materials created by other members of the LOCAH Project This presentation available under creative commons Non Commercial-Share Alike: http://creativecommons.org/licenses/by-nc/2.0/uk/

Editor's Notes

  • #4: Copac a union catalogue Both successful JISC services running for many years now Locah is a research project – will have to see if go into service with LD interface
  • #6: In hypertext web sites it is considered generally rather bad etiquette not to link to related external material. The value of your own information is very much a function of what it links to, as well as the inherent value of the information within the web page.  So it is also in the Semantic Web. Remember, this is about machines linking – machines need identifiers; humans generally know when something is a place or when it is a person. BBC + DBPedia + GeoNames + Archives Hub + Copac + VIAF = the Web as an exploratory space
  • #11: This also an example of aggregating data
  • #12: More aggregation
  • #14: Data can be integrated from many diff sources
  • #21: Ex
  • #22: “ lower level” units interpreted in context of the higher levels of description Arguably “incomplete” without the contextual data. Relations are asserted, e.g. member-of/component-of But there is no requirement or expectation that data consumers will follow the links describing the relations