UKOLN is supported  by: Re-usable metadata, re-usable content Paul Walk Technical Manager [email_address] A centre of expertise in digital information management www.ukoln.ac.uk
harvesting, searching, syndicating options for metadata and content: the lines can be blurred search engines also harvest! your  metadata  may be my  content metadata content harvestable searchable ✓ ✓ syndicable ✓
being harvestable (1) Open Archives Initiative OAI-PMH repositories OAI-ORE aggregators: Intute Institutional Repository Search currently harvesting eprints metadata records from 88 institutions planning to explore the harvesting of metadata for: images learning objects other media..... MLA’s Discover Service your content is of interest to other domains
being harvestable (2) what is your metadata record actually going to point to? more than one item of content? a ‘jumping off’ page? is this consistent? what metadata format are you going to use? is it commonly supported? are you using it correctly? (you’d be surprised.....) where/how is your metadata going to be used? this is necessarily out of your control!
being searchable (1) exposing your content to search engines search engine optimisation (SEO) make it easy for the search engines have content people want make it eminently  linkable Google is your friend! SiteMaps - describe your content in ways Google can understand OAI-PMH interface can be treated as a SiteMap
being searchable (2) Z39.50 from the library domain allows the target to participate in a cross search very mature, very widely deployed not a web protocol SRU web-ified Z39.50 ReSTful Common Query Language (CQL) SRW as above, but for heavier SOA/Web Services use OpenSearch piggyback on RSS/Atom
being searchable (3) search portals community portals institutional portals/VLEs
be syndicable, enable re-use by 3rd parties consider RSS (and the Atom syndication format) in some ways the lingua franca of Web 2.0 machine and human friendly surprsing how much content lends itself to this structure RSS2.0 can also ‘enclose’ binary data syndicating podcasts “ the coolest use of your data will be thought of by someone else” be  mashup  friendly: addressable content cool URLs simple formats aspire to APIs that need no documentation!
human and machine interfaces (1) they’re completely different....right? well, not necessarily RSS! OAI-PMH with a CSS stylesheet referenced from the XML
human and machine interfaces (2) ‘ screen-scraping’ is back in fashion plain old semantic HTML (POSH) linked-data (the semantic web with a small ‘s’) the web of data is imminent!
future design: taking a REST from service provision the  resource -oriented-architecture ReST: resources with cool URLs 4 HTTP verbs: get, put, post & delete CRUD for the Web (create, retrieve, update, delete) make everything addressable with URLs be cool! make the URLs persistent make them human-parsable e.g. http://www.myserver.com/gallery/collections/pictures/image_0001.jpg is better than: http://www.myserver.com/gallery.php?collection_id=7&item_id=0001
my suggestions using  web  protocols make content  addressable  - and persistently so reduce barriers to third-parties developing other (competing!?) UIs are our UIs really just ‘gateways’ to information (implying that there is a wall around that information) making the machine APIs the heart of our services a good design principle is to use the machine API as  the  API used by our own user-interfaces we just can’t know for sure all the ways in which our information services might be used
acknowledgements in preparation for this presentation, I blogged about giving this presentation and asked my readers: “ Aside from the obvious stuff like OAI-PMH, Google, RSS, what should I be talking about? Persistent identifiers? Cool URLs? Any other suggestions?” 6 responses - all containing great suggestions which I have incorporated into this presentation, from the following people: Jim Downing, Owen Stephens, Ian Ibbotson, Pete Johnston, Mike Ellis thanks!! you can read all of the comments, and find links/addresses for these people on my blog at: http://blog.paulwalk.net/2008/02/11/making-digitised-content-available-for-searching-and-harvesting/
comments Ian Ibbotson said: It’s very hard to engineer a consistent search user interface when half the metadata refers to the actual digital artefact, and half to a front page. It’s useful to have both links, as you can then negotiate with providers if they feel you need to go through a front page for stats and marketing.... Pete Johnstone said: a shift away from the “repository” towards the “collection” or “collections” (which I think is the consequence of a more “resource-oriented view”) Owen Stephens said: Integration of resources into the wider web - e.g. LoC experiment with Flickr to expose content. Many projects in this area create a new silo of material that is hidden from the wider web [...] reusable metadata as well as objects. Jim Downing said: ....making the content reusable (not a hard sell in eLearning?). Recent use of RDF and Atom in a cultural setting:  Asemantics BBC aggregator Mike Ellis said: ....RSS, and possibly “programmable” RSS (for example, surfacing search results by adding query parameters to the feed address, etc)....
questions?

More Related Content

PPTX
A comparative study between commercial and open source discovery tools
PPTX
Web browser
PPTX
Search Engine
PDF
Internet and search engine
PPT
Hypertext and hypermedia
PPT
PPTX
Search engine ppt
PPT
Hypertext presentation
A comparative study between commercial and open source discovery tools
Web browser
Search Engine
Internet and search engine
Hypertext and hypermedia
Search engine ppt
Hypertext presentation

What's hot (20)

PPTX
Components of a search engine
PPTX
Lesson hypertext and intertext
PDF
L017447590
PDF
PPTX
Semantic Web
PPTX
Hypertext
PPTX
Avtar's ppt
PDF
Semantic web technology
PDF
Website Migration Planning
PPTX
Semantic web
PPT
Technical skills in multimedia for odl learners
PPTX
Bridging the gap from Wikipedia to scholarly resources: a simple JavaScript s...
PPTX
Bridging the Gap from Wikipedia to Scholarly Resources
PPTX
Semantic web
PDF
Singley "Building Privacy Infrastructure - An Academic Library’s Perspective"
PPTX
Discovery bookmarklet - Metro Science Librarians SIG
PDF
Search engine and web crawler
PPT
Internet Tutorial 03
 
DOCX
Semantic web Document
 
PDF
An Intelligent Meta Search Engine for Efficient Web Document Retrieval
Components of a search engine
Lesson hypertext and intertext
L017447590
Semantic Web
Hypertext
Avtar's ppt
Semantic web technology
Website Migration Planning
Semantic web
Technical skills in multimedia for odl learners
Bridging the gap from Wikipedia to scholarly resources: a simple JavaScript s...
Bridging the Gap from Wikipedia to Scholarly Resources
Semantic web
Singley "Building Privacy Infrastructure - An Academic Library’s Perspective"
Discovery bookmarklet - Metro Science Librarians SIG
Search engine and web crawler
Internet Tutorial 03
 
Semantic web Document
 
An Intelligent Meta Search Engine for Efficient Web Document Retrieval

Similar to Re-usable metadata, re-usable content (20)

PPT
Open for Business Open Archives, OpenURL, RSS and the Dublin Core
PPT
How to Find a Needle in the Haystack
PPT
Audio in a social Web of linked data
PPT
5 steps to becoming a JISC IE content provider
PPT
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
PPT
Resource Discovery Landscape
PPTX
Introduction to APIs and Linked Data
PPT
Open, social and linked - what do current Web trends tell us about the future...
PPTX
How do I aggregate oers
PPT
Does metadata matter?
PDF
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
PPT
Digital library and MLE integration - where are we now and where do we want t...
PPT
Technical overview of the JISC Information Environment
PPTX
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
PPT
From bit-streams-to-life-streams-ajai-narendran-srishti-bangalore-stff-2011
PPT
Metadata april 8 2013
PPT
The JISC Information Environment and collection description
PDF
Content Used to be King: The Semantic Web in Education
PPT
Test presentation
PPT
Social networks and collaborative tool: connecting information in the Googlez...
Open for Business Open Archives, OpenURL, RSS and the Dublin Core
How to Find a Needle in the Haystack
Audio in a social Web of linked data
5 steps to becoming a JISC IE content provider
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
Resource Discovery Landscape
Introduction to APIs and Linked Data
Open, social and linked - what do current Web trends tell us about the future...
How do I aggregate oers
Does metadata matter?
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
Digital library and MLE integration - where are we now and where do we want t...
Technical overview of the JISC Information Environment
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
From bit-streams-to-life-streams-ajai-narendran-srishti-bangalore-stff-2011
Metadata april 8 2013
The JISC Information Environment and collection description
Content Used to be King: The Semantic Web in Education
Test presentation
Social networks and collaborative tool: connecting information in the Googlez...

More from Paul Walk (20)

PPTX
COAR Notify - presentation to PRC Meeting Lyon Notify
PDF
Should Repositories Participate in the Fediverse?
PPTX
Introduction to the COAR Notify project
PPTX
Documenting metadata application profiles and vocabularies
PPTX
Next generation repositories
PDF
What does the next generation repository look like?
PPTX
COAR Next Generation Repositories Working Group
PPTX
Static Site Generators: what they are and when they are useful
PPTX
RIOXX: a Modern Metadata Application Profile
PDF
Implementing RIOXX
PPTX
Exploiting the value of Dublin Core through pragmatic development
PPTX
Rioxx 2 repository fringe
PPTX
The Strategic Developer: a new role for Higher Education?
PDF
Local, technical innovation in an outsourced world
PDF
Working with Developers
PPT
It's their cloud, not yours
PDF
Technical Challenges in Resource Discovery
PDF
Responsive Innovation in a Local Context
KEY
The Changing Role of the Developer in HE
KEY
Supporting Developers, Supporting Research
COAR Notify - presentation to PRC Meeting Lyon Notify
Should Repositories Participate in the Fediverse?
Introduction to the COAR Notify project
Documenting metadata application profiles and vocabularies
Next generation repositories
What does the next generation repository look like?
COAR Next Generation Repositories Working Group
Static Site Generators: what they are and when they are useful
RIOXX: a Modern Metadata Application Profile
Implementing RIOXX
Exploiting the value of Dublin Core through pragmatic development
Rioxx 2 repository fringe
The Strategic Developer: a new role for Higher Education?
Local, technical innovation in an outsourced world
Working with Developers
It's their cloud, not yours
Technical Challenges in Resource Discovery
Responsive Innovation in a Local Context
The Changing Role of the Developer in HE
Supporting Developers, Supporting Research

Recently uploaded (20)

PDF
GSA-Past-Papers-2010-2024-2.pdf CSS examination
PDF
English 2nd semesteNotesh biology biopsy results from the other day and I jus...
PDF
V02-Session-4-Leadership-Through-Assessment-MLB.pdf
PPTX
MMW-CHAPTER-1-final.pptx major Elementary Education
PDF
HSE 2022-2023.pdf الصحه والسلامه هندسه نفط
PPTX
INTRODUCTION TO PHILOSOPHY FULL SEM - COMPLETE.pptxINTRODUCTION TO PHILOSOPHY...
PPTX
ENGlishGrade8_Quarter2_WEEK1_LESSON1.pptx
PPTX
MALARIA - educational ppt for students..
PDF
The 10 Most Inspiring Education Leaders to Follow in 2025.pdf
DOCX
HELMET DETECTION AND BIOMETRIC BASED VEHICLESECURITY USING MACHINE LEARNING.docx
PPTX
Chapter-4-Rizal-Higher-Education-1-2_081545.pptx
PDF
HSE and their team are going through the hazards of the issues with learning ...
PDF
LATAM’s Top EdTech Innovators Transforming Learning in 2025.pdf
PDF
WHAT NURSES SAY_ COMMUNICATION BEHAVIORS ASSOCIATED WITH THE COMP.pdf
PPTX
FILIPINO 8 Q2 WEEK 1(DAY 1).power point presentation
PPTX
Unit1_Kumod_deeplearning.pptx DEEP LEARNING
PPTX
Environmental Sciences and Sustainability Chapter 2
PDF
Design and Evaluation of a Inonotus obliquus-AgNP-Maltodextrin Delivery Syste...
PDF
Physical pharmaceutics two in b pharmacy
PDF
Teacher's Day Quiz 2025
GSA-Past-Papers-2010-2024-2.pdf CSS examination
English 2nd semesteNotesh biology biopsy results from the other day and I jus...
V02-Session-4-Leadership-Through-Assessment-MLB.pdf
MMW-CHAPTER-1-final.pptx major Elementary Education
HSE 2022-2023.pdf الصحه والسلامه هندسه نفط
INTRODUCTION TO PHILOSOPHY FULL SEM - COMPLETE.pptxINTRODUCTION TO PHILOSOPHY...
ENGlishGrade8_Quarter2_WEEK1_LESSON1.pptx
MALARIA - educational ppt for students..
The 10 Most Inspiring Education Leaders to Follow in 2025.pdf
HELMET DETECTION AND BIOMETRIC BASED VEHICLESECURITY USING MACHINE LEARNING.docx
Chapter-4-Rizal-Higher-Education-1-2_081545.pptx
HSE and their team are going through the hazards of the issues with learning ...
LATAM’s Top EdTech Innovators Transforming Learning in 2025.pdf
WHAT NURSES SAY_ COMMUNICATION BEHAVIORS ASSOCIATED WITH THE COMP.pdf
FILIPINO 8 Q2 WEEK 1(DAY 1).power point presentation
Unit1_Kumod_deeplearning.pptx DEEP LEARNING
Environmental Sciences and Sustainability Chapter 2
Design and Evaluation of a Inonotus obliquus-AgNP-Maltodextrin Delivery Syste...
Physical pharmaceutics two in b pharmacy
Teacher's Day Quiz 2025

Re-usable metadata, re-usable content

  • 1. UKOLN is supported by: Re-usable metadata, re-usable content Paul Walk Technical Manager [email_address] A centre of expertise in digital information management www.ukoln.ac.uk
  • 2. harvesting, searching, syndicating options for metadata and content: the lines can be blurred search engines also harvest! your metadata may be my content metadata content harvestable searchable ✓ ✓ syndicable ✓
  • 3. being harvestable (1) Open Archives Initiative OAI-PMH repositories OAI-ORE aggregators: Intute Institutional Repository Search currently harvesting eprints metadata records from 88 institutions planning to explore the harvesting of metadata for: images learning objects other media..... MLA’s Discover Service your content is of interest to other domains
  • 4. being harvestable (2) what is your metadata record actually going to point to? more than one item of content? a ‘jumping off’ page? is this consistent? what metadata format are you going to use? is it commonly supported? are you using it correctly? (you’d be surprised.....) where/how is your metadata going to be used? this is necessarily out of your control!
  • 5. being searchable (1) exposing your content to search engines search engine optimisation (SEO) make it easy for the search engines have content people want make it eminently linkable Google is your friend! SiteMaps - describe your content in ways Google can understand OAI-PMH interface can be treated as a SiteMap
  • 6. being searchable (2) Z39.50 from the library domain allows the target to participate in a cross search very mature, very widely deployed not a web protocol SRU web-ified Z39.50 ReSTful Common Query Language (CQL) SRW as above, but for heavier SOA/Web Services use OpenSearch piggyback on RSS/Atom
  • 7. being searchable (3) search portals community portals institutional portals/VLEs
  • 8. be syndicable, enable re-use by 3rd parties consider RSS (and the Atom syndication format) in some ways the lingua franca of Web 2.0 machine and human friendly surprsing how much content lends itself to this structure RSS2.0 can also ‘enclose’ binary data syndicating podcasts “ the coolest use of your data will be thought of by someone else” be mashup friendly: addressable content cool URLs simple formats aspire to APIs that need no documentation!
  • 9. human and machine interfaces (1) they’re completely different....right? well, not necessarily RSS! OAI-PMH with a CSS stylesheet referenced from the XML
  • 10. human and machine interfaces (2) ‘ screen-scraping’ is back in fashion plain old semantic HTML (POSH) linked-data (the semantic web with a small ‘s’) the web of data is imminent!
  • 11. future design: taking a REST from service provision the resource -oriented-architecture ReST: resources with cool URLs 4 HTTP verbs: get, put, post & delete CRUD for the Web (create, retrieve, update, delete) make everything addressable with URLs be cool! make the URLs persistent make them human-parsable e.g. http://www.myserver.com/gallery/collections/pictures/image_0001.jpg is better than: http://www.myserver.com/gallery.php?collection_id=7&item_id=0001
  • 12. my suggestions using web protocols make content addressable - and persistently so reduce barriers to third-parties developing other (competing!?) UIs are our UIs really just ‘gateways’ to information (implying that there is a wall around that information) making the machine APIs the heart of our services a good design principle is to use the machine API as the API used by our own user-interfaces we just can’t know for sure all the ways in which our information services might be used
  • 13. acknowledgements in preparation for this presentation, I blogged about giving this presentation and asked my readers: “ Aside from the obvious stuff like OAI-PMH, Google, RSS, what should I be talking about? Persistent identifiers? Cool URLs? Any other suggestions?” 6 responses - all containing great suggestions which I have incorporated into this presentation, from the following people: Jim Downing, Owen Stephens, Ian Ibbotson, Pete Johnston, Mike Ellis thanks!! you can read all of the comments, and find links/addresses for these people on my blog at: http://blog.paulwalk.net/2008/02/11/making-digitised-content-available-for-searching-and-harvesting/
  • 14. comments Ian Ibbotson said: It’s very hard to engineer a consistent search user interface when half the metadata refers to the actual digital artefact, and half to a front page. It’s useful to have both links, as you can then negotiate with providers if they feel you need to go through a front page for stats and marketing.... Pete Johnstone said: a shift away from the “repository” towards the “collection” or “collections” (which I think is the consequence of a more “resource-oriented view”) Owen Stephens said: Integration of resources into the wider web - e.g. LoC experiment with Flickr to expose content. Many projects in this area create a new silo of material that is hidden from the wider web [...] reusable metadata as well as objects. Jim Downing said: ....making the content reusable (not a hard sell in eLearning?). Recent use of RDF and Atom in a cultural setting: Asemantics BBC aggregator Mike Ellis said: ....RSS, and possibly “programmable” RSS (for example, surfacing search results by adding query parameters to the feed address, etc)....