SlideShare a Scribd company logo
Repository Deposit Service Description OR 2007   :  the 2nd International Conference on Open Repositories San Antonio, Texas, USA, 23-26 Jan 2007 Presenter: Julie Allinson, UKOLN, University of Bath Co-authors: Rachel Heery (UKOLN), Martin Morrey (Intrallect), Christopher Gutteridge (Southampton), and Jim Downing (Cambridge) www.bath.ac.uk a centre of expertise in digital information management www.ukoln.ac.uk UKOLN is supported  by:
Overview Background and context Requirements for depositing content in repositories Defining a lightweight deposit service Developing the service Proof-of-concept implementation - update
Context Higher (and Further) Education in the United Kingdom JISC – the Joint Information Systems Committee JISC – considerable investment in UK repositories R&D over the last 5 years, and continuing FAIR Programme (2002-2005) Exchange for Learning (X4L) Programme (2002-2005) Digital Repositories Programme (2005-2007) JISC Capital Programme Repositories and Preservation strand (2006-2009)
Deposit API Deposit API activity was brought together to find lightweight solution to assist populating repositories within timescales of JISC programmes It comprised a group of repository software developers from Eprints.org, DSpace, Fedora, Intrallect and others facilitated by the JISC Repositories Research Team to address the need for a common Deposit standard
Background motivation In general, developers are not creating repository systems and software from scratch repositories must interface with each other, with users and with other applications within institutions and the wider information landscape VLEs, authoring tools, packaging tools, name authority services, classification services and research systems There is no common deposit API or protocol
Pain points no standardised way of transferring existing collections of digital objects and/or metadata from a filesystem or legacy database into a repository no standard interface for tagging, packaging or authoring tools to upload catalogued objects into a repository no standard interface for transferring digital objects between repositories no way of initiating a contribution workflow from outside a repository system no way of including deposit into a repository a part of service orientated architecture for harvesting there is OAI-PMH – this has had a major impact There is no equivalent mechanism for deposit
Why is deposit  so  important? Without it, there is nothing in our repositories Ensuring the emerging network of repositories is well populated with content is a PRIORITY Encouraging deposit is one of the most difficult cultural issues for repositories Technology needs to support culture change and advocacy, through ease of use multiple deposit auto-deposit NOT closed or proprietary mechanisms
Repository Stores, manages and makes available content and metadata   Deposit interface  Delete interface  Search interface  Harvest interface  Obtain interface From Andy Powell, A 'service oriented' view of the JISC Information Environment:  http://www.ukoln.ac.uk/distributed-systems/jisc-ie/arch/soa/jisc-ie-soa.pdf Similarly, the ORE initiative identifies put (deposit), obtain and harvest services
Deposit – abstract service definition A Deposit interface:  Provides an interface through which content and metadata can be deposited and initiates ingest process for local storage. Summarised from Andy Powell, A 'service oriented' view of the JISC Information Environment:  http://www.ukoln.ac.uk/distributed-systems/jisc-ie/arch/soa/jisc-ie-soa.pdf Put:  A put service supports the request for ingest of one or more surrogates into a repository, thereby allowing the addition of digital objects to the repositories’ collection From An interoperable fabric for scholarly value chains: http://www.dlib.org/dlib/october06/vandesompel/10vandesompel.html
A note on terminology Add Deposit Put Submit Post Ingest deposit, put, add etc. may be part of an ingest process, along with other functions may include both automated and manual procedures including format checking, editorial control, quality assurance mechanisms, etc. defined by OAIS these are out of scope for this activity used by the e-Framework terminology of choice for this work used by ORE used for blogs Largely synonymous, with subtle differences, often related to community of use
User requirements / scenarios Author deposits using a desktop authoring system to a mediated multiple deposit service A user submits an IMS-compliant learning object to a National Repository using a client application Deposit into multiple repositories Transfer between intermediate hosts Repositories share improved metadata Experimental data output from spectrometer is 'saved as' a file and a file containing metadata on operational parameters is also generated. A data capture service is invoked and the files pertaining to the experiment are deposited, along with the necessary metadata, in the laboratory repository. See more at http://www.ukoln.ac.uk/repositories/digirep/
Scenario 1 : Author deposits using a desktop authoring system to a mediated multiple deposit service A lightweight deposit web service can facilitate this transfer of object(s) Librarian L completes the deposit through the repository interface id Librarian L invokes deposit of a surrogate into arxiv.org Deposit id Author A deposits via an easy-deposit desktop application into the institutional repository's mediated deposit queue
Scenario 2 : A user submits an IMS-compliant learning object to a National Repository using a client application A lightweight deposit web service can facilitate this transfer of object(s) A user wishes to submit an IMS-compliant content package to a repository using a client application id Put The user can choose from a list of ‘groups/collections’ to which they are allowed to deposit, in this centralised national LO repository.  They are not required to use the repository interface, but can deposit via a decentralised client. id
Scenario 3 : Deposit in multiple repositories A lightweight deposit web service can facilitate this transfer of object(s) Deposit The depositor can choose one or more repositories to deposit into A depositor is required to submit to a Research Council repository, but they also wish to deposit into their institutional repository and a relevant subject repository
Scenario 4 : transfer  between intermediate hosts A lightweight deposit web service can facilitate this transfer of object(s) Deposit Subsequent repositories may also transfer objects Deposit A repository may transfer objects to other repositories, or services, e.g. a preservation service
Scenario 5 : Repositories share improved metadata  (put both ways) A lightweight deposit web service can facilitate this transfer of object(s) Deposit id Repository A  deposits an object in another repository Deposit Repository B improves the metadata and  deposits the object back into repository A id
Scenario 6 : laboratory auto-deposit A lightweight deposit web service can facilitate this transfer of object(s) Deposit Experimental data output from laboratory machines is  deposited, along with the necessary metadata, in the laboratory repository in an automated process A metadata record is also deposited into the Institutional Repository
Some functional requirements A Deposit service should: be generic enough to support wide range of heterogeneous repositories scholarly publications, data, learning objects, images, etc. accept submission of   different digital object types in consistent way:  data and/or metadata in the form of complex objects or content packages support different workflows for deposit, e.g.  user to multiple repositories via intermediate client  user to repository, repository to additional repositories  user-triggered and machine-triggered deposit  accept large-scale (scientific datasets) support statuses, e.g. deposit to different states of a workflow support collections and changes in policy and permissions support differences in repository policy support non-instantaneous processes, e.g. deposit pending mediation support validation report and integrity checks support anonymous deposit  support more complex, authenticated deposit  support acceptance and handling of incomplete records  support rejection of records (reasons for rejection are out of scope)  support human-selected targets for deposit  support different deposit requests
Defining a lightweight deposit service Define abstract service scope information models and APIs must be developed in manner neutral to implementation binding Examine existing protocols and specifications could they be used implement the defined abstract service? Evaluate and decide whether a new protocol or API is required Abstract service:  a discrete piece of technical functionality required to fulfil a specific requirement or set of requirements Synonymous with a ‘service genre’ in the  JISC DEST e-Framework
Deposit – abstract service definition A Deposit interface:  Provides an interface through which content and metadata can be deposited and initiates ingest process for local storage. Summarised from Andy Powell, A 'service oriented' view of the JISC Information Environment:  http://www.ukoln.ac.uk/distributed-systems/jisc-ie/arch/soa/jisc-ie-soa.pdf Put:  A put service supports the request for ingest of one or more surrogates into a repository, thereby allowing the addition of digital objects to the repositories’ collection From An interoperable fabric for scholarly value chains: http://www.dlib.org/dlib/october06/vandesompel/10vandesompel.html
Existing standards WebDAV ( http://www.webdav.org/ )  JSR 170 ( http://www.jcp.org/en/jsr/detail?id=170 )  JSR 283 ( http://www.jcp.org/en/jsr/detail?id=283 )  SRW Update ( http://www.loc.gov/standards/sru/ )  Flickr Deposit API ( http://www.flickr.com/services/api/ )  Fedora Deposit API ( http://www.fedora.info/definitions/1/0/api/ )  OKI OSID ( http://www.okiproject.org/ )  ECL ( http://ecl.iat.sfu.ca/ ) ATOM Publishing Protocol ( http://www.ietf.org/html-charters/atompub-charter. html )
Deposit – two components Deposit : service offered by a repository, allowing remote users (machines or people) to upload data data in: deposit request with optional parameters (e.g.digital object ‘semantics’, metadata formats..)  data out: status (success, failure, pending), receipt confirmation and digital object identifier Explain : service offered by a repository, allowing remote users (machines or people) to inspect the repository for policy and/or other data data in: introspection request (“explain”) data out: introspection response (“repository policy info”)
Draft XML serialisations http://www.ukoln.ac.uk/repositories/digirep
Deposit service specification To recap, following the scope defined earlier the repository developers came up two services: deposit and explain and a draft XML serialisation for each the service will work by the client issuing XML commands over HTTP to the repository Deposit service the service responds with formatted XML messages other approaches may also be considered, e.g. SOAP a layered approach was taken, with the specification of two levels of compliance.  Level 0 compliance requires a set of mandatory elements  Level 1 offers a set of additional optional elements that may or may not be used
Some issues Boundaries between deposit and ingest what has already happened at point of deposit? regarding metadata and identifiers how far does the deposit service need to validate what is being deposited and can it reject deposit requests? Data integrity is there requirement to get back (export) exact object that was deposited? Multiple data types, metadata formats and content packages how far should the deposit service check its ability to accept what is being deposited? Can look up of policy rules be done as a request to service registry? how far is look up of policy rules automated? Authorisation and authentication how will the deposit service check the authority of the person/machine doing the ‘putting’ how will it interface with auth services?
Next steps Finish it and test it! At the moment, the deposit web service is still embryonic To take it forward, a funding proposal has been submitted to finalise the original deposit API work test it against different repository software Eprints DSpace Fedora Intrallect intraLibrary build a client implementation iteratively revise and re-test disseminate and embed into the repositories community
Final thoughts … This work is aligned with the vision  of the JISC-DEST  E-Framework  and the soa approach in general also with the  JISC Information Environment  commitment to interoperability and the use of web services to facilitate interaction between Repositories and other services and  with the objectives of the Object Re-use and Exchange Initiative and the definition of a ‘put’ interface Thank you …

More Related Content

PPT
An introduction to repository reference models
Julie Allinson
 
PPT
Unit Testing DFC
BlueFish
 
PPT
Sachins My Slidekit
sachinkas
 
PPT
Using Web Services To Integrate a .NET Solution with EMC Documentum
BlueFish
 
PPT
Power to the People- Enabling Ever US Citizen to Participate in Federal Rule ...
BlueFish
 
PPT
Documentum content server
Sanjay Singh
 
PPT
Cuckoo (Graham Mason, Ed Beddows)
JISC.AM
 
PPTX
EMC ECD Documentum D2
mister_moun
 
An introduction to repository reference models
Julie Allinson
 
Unit Testing DFC
BlueFish
 
Sachins My Slidekit
sachinkas
 
Using Web Services To Integrate a .NET Solution with EMC Documentum
BlueFish
 
Power to the People- Enabling Ever US Citizen to Participate in Federal Rule ...
BlueFish
 
Documentum content server
Sanjay Singh
 
Cuckoo (Graham Mason, Ed Beddows)
JISC.AM
 
EMC ECD Documentum D2
mister_moun
 

What's hot (20)

ODP
Portfolio interoperability progress in the UK
Simon Grant
 
PDF
Documentum introduction
otnawrup
 
PPTX
Documentum training
tekslate1
 
PPT
EMC Documentum Product Line Overview
Emirates Computers
 
PPTX
Virtuoso Platform Overview
Kingsley Uyi Idehen
 
PPTX
Enterprise & Web based Federated Identity Management & Data Access Controls
Kingsley Uyi Idehen
 
PPT
Overview of Documentum
sushl
 
PPT
Tips To Implementing Multiple Cloud Storage APIs
SNIACloud
 
PPT
Using Tibco SpotFire (via Virtuoso ODBC) as Linked Data Front-end
Kingsley Uyi Idehen
 
PPT
ISWC 2012 - Linked Data Meetup
Kingsley Uyi Idehen
 
PPTX
OpenLink Virtuoso - Management & Decision Makers Overview
Kingsley Uyi Idehen
 
PPTX
EuroSakai CLIF project presentation
Chris Awre
 
PPT
SNIA Cloud Storage Presentation
Mark Carlson
 
PDF
Integrating Semantic Systems
Kingsley Uyi Idehen
 
PPT
Documentum Overview
Hisham Abdel Moneim
 
PPT
EMC Documentum Enterprise Content Management 6.5
Emirates Computers
 
ODP
Open Standards
Simon Grant
 
PPT
EMC World 2009 - Standards: CMIS
Laurence Hart
 
PPT
Tableau Desktop as a Linked (Open) Data Front-End via ODBC
Kingsley Uyi Idehen
 
PPT
Exploiting Linked (Open) Data via Microsoft Access using ODBC File DSNs
Kingsley Uyi Idehen
 
Portfolio interoperability progress in the UK
Simon Grant
 
Documentum introduction
otnawrup
 
Documentum training
tekslate1
 
EMC Documentum Product Line Overview
Emirates Computers
 
Virtuoso Platform Overview
Kingsley Uyi Idehen
 
Enterprise & Web based Federated Identity Management & Data Access Controls
Kingsley Uyi Idehen
 
Overview of Documentum
sushl
 
Tips To Implementing Multiple Cloud Storage APIs
SNIACloud
 
Using Tibco SpotFire (via Virtuoso ODBC) as Linked Data Front-end
Kingsley Uyi Idehen
 
ISWC 2012 - Linked Data Meetup
Kingsley Uyi Idehen
 
OpenLink Virtuoso - Management & Decision Makers Overview
Kingsley Uyi Idehen
 
EuroSakai CLIF project presentation
Chris Awre
 
SNIA Cloud Storage Presentation
Mark Carlson
 
Integrating Semantic Systems
Kingsley Uyi Idehen
 
Documentum Overview
Hisham Abdel Moneim
 
EMC Documentum Enterprise Content Management 6.5
Emirates Computers
 
Open Standards
Simon Grant
 
EMC World 2009 - Standards: CMIS
Laurence Hart
 
Tableau Desktop as a Linked (Open) Data Front-End via ODBC
Kingsley Uyi Idehen
 
Exploiting Linked (Open) Data via Microsoft Access using ODBC File DSNs
Kingsley Uyi Idehen
 
Ad

Similar to Repository Deposit Service Description (20)

PPT
Simple Web service Offering Repository Deposit (SWORD)‏
Julie Allinson
 
PPT
Sword Cetis 2007 06 29
Julie Allinson
 
PPT
Sword Cetis 2007 06 29
Sheila MacNeill
 
PPT
Sword 2007 06 22
Julie Allinson
 
PPT
Sword Crig 2007 12 06
Julie Allinson
 
PPT
Sword Or2008 Julieallinson
Julie Allinson
 
PPT
Sword Bl 0903[1]
Julie Allinson
 
PPT
OAI and Publishers’ metadata: Using the static repositories approach to discl...
R. John Robertson
 
PPT
Open Archives Initiative Object Reuse and Exchange
lagoze
 
PPT
Interoperability Fundamentals: SWORD 2
Adrian Stevenson
 
PPT
Reference Model for an Open Archival Information Systems (OAIS): Overview and...
faflrt
 
PPTX
Enhancing and testing repository deposit interfaces
depositMO
 
PPT
Trm Trusted Repositories
DigitalPreservationEurope
 
PPT
Archiving as a Service - A Model for the Provision of Shared Archiving Servic...
janaskhoj
 
PPTX
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Blue BRIDGE
 
PPTX
Technical integration of data repositories status and challenges
vty
 
PPT
Item Banks and the JISC Information Environment
Eduserv Foundation
 
PDF
Information sharing pipeline
Violeta Ilik
 
PPT
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
Andy Powell
 
PPTX
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
OpenAIRE
 
Simple Web service Offering Repository Deposit (SWORD)‏
Julie Allinson
 
Sword Cetis 2007 06 29
Julie Allinson
 
Sword Cetis 2007 06 29
Sheila MacNeill
 
Sword 2007 06 22
Julie Allinson
 
Sword Crig 2007 12 06
Julie Allinson
 
Sword Or2008 Julieallinson
Julie Allinson
 
Sword Bl 0903[1]
Julie Allinson
 
OAI and Publishers’ metadata: Using the static repositories approach to discl...
R. John Robertson
 
Open Archives Initiative Object Reuse and Exchange
lagoze
 
Interoperability Fundamentals: SWORD 2
Adrian Stevenson
 
Reference Model for an Open Archival Information Systems (OAIS): Overview and...
faflrt
 
Enhancing and testing repository deposit interfaces
depositMO
 
Trm Trusted Repositories
DigitalPreservationEurope
 
Archiving as a Service - A Model for the Provision of Shared Archiving Servic...
janaskhoj
 
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Blue BRIDGE
 
Technical integration of data repositories status and challenges
vty
 
Item Banks and the JISC Information Environment
Eduserv Foundation
 
Information sharing pipeline
Violeta Ilik
 
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
Andy Powell
 
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
OpenAIRE
 
Ad

More from Julie Allinson (20)

PDF
Music workflow4
Julie Allinson
 
PPT
Radically Open Cultural Heritage Data on the Web
Julie Allinson
 
PPTX
Linked Data and the OpenART project
Julie Allinson
 
PPT
Filenames are not metadata, changing cultures at the University of York
Julie Allinson
 
PPT
Towards a digital library for York
Julie Allinson
 
PPT
Swap For Dummies Rsp 2007 11 29
Julie Allinson
 
PPT
SWAP : A Dublin Core Application Profile for desribing scholarly works
Julie Allinson
 
PPT
Dcap Ja Progmeet 2007 07 05
Julie Allinson
 
PPT
Ore 2007 06 22
Julie Allinson
 
PPT
Swap Poster 2007 06 05
Julie Allinson
 
PPT
Object models and object representation
Julie Allinson
 
PPT
A Dublin Core Application Profile for Scholarly Works (eprints)
Julie Allinson
 
PPT
UKOLN open forum digital repositories
Julie Allinson
 
PPT
Dublin Core Application Profile for Scholarly Works KE
Julie Allinson
 
PPT
Dublin Core Application Profile for Scholarly Works Slainte
Julie Allinson
 
PPT
An introduction to OAI-ORE
Julie Allinson
 
PPT
The Eprints Application Profile: a FRBR approach to modelling repository meta...
Julie Allinson
 
PPT
Repositories and the wider context
Julie Allinson
 
PPT
Signed metadata : method and application
Julie Allinson
 
PPT
OAIS as a reference model for repositories
Julie Allinson
 
Music workflow4
Julie Allinson
 
Radically Open Cultural Heritage Data on the Web
Julie Allinson
 
Linked Data and the OpenART project
Julie Allinson
 
Filenames are not metadata, changing cultures at the University of York
Julie Allinson
 
Towards a digital library for York
Julie Allinson
 
Swap For Dummies Rsp 2007 11 29
Julie Allinson
 
SWAP : A Dublin Core Application Profile for desribing scholarly works
Julie Allinson
 
Dcap Ja Progmeet 2007 07 05
Julie Allinson
 
Ore 2007 06 22
Julie Allinson
 
Swap Poster 2007 06 05
Julie Allinson
 
Object models and object representation
Julie Allinson
 
A Dublin Core Application Profile for Scholarly Works (eprints)
Julie Allinson
 
UKOLN open forum digital repositories
Julie Allinson
 
Dublin Core Application Profile for Scholarly Works KE
Julie Allinson
 
Dublin Core Application Profile for Scholarly Works Slainte
Julie Allinson
 
An introduction to OAI-ORE
Julie Allinson
 
The Eprints Application Profile: a FRBR approach to modelling repository meta...
Julie Allinson
 
Repositories and the wider context
Julie Allinson
 
Signed metadata : method and application
Julie Allinson
 
OAIS as a reference model for repositories
Julie Allinson
 

Recently uploaded (20)

PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PPTX
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
PDF
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
PPTX
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
PDF
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
PDF
Software Development Methodologies in 2025
KodekX
 
PDF
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
PPTX
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
PDF
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
PPTX
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
PDF
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
PPTX
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
PDF
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
PDF
Doc9.....................................
SofiaCollazos
 
PDF
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
PDF
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
PPTX
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PDF
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
PPTX
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
Software Development Methodologies in 2025
KodekX
 
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
Doc9.....................................
SofiaCollazos
 
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 

Repository Deposit Service Description

  • 1. Repository Deposit Service Description OR 2007 : the 2nd International Conference on Open Repositories San Antonio, Texas, USA, 23-26 Jan 2007 Presenter: Julie Allinson, UKOLN, University of Bath Co-authors: Rachel Heery (UKOLN), Martin Morrey (Intrallect), Christopher Gutteridge (Southampton), and Jim Downing (Cambridge) www.bath.ac.uk a centre of expertise in digital information management www.ukoln.ac.uk UKOLN is supported by:
  • 2. Overview Background and context Requirements for depositing content in repositories Defining a lightweight deposit service Developing the service Proof-of-concept implementation - update
  • 3. Context Higher (and Further) Education in the United Kingdom JISC – the Joint Information Systems Committee JISC – considerable investment in UK repositories R&D over the last 5 years, and continuing FAIR Programme (2002-2005) Exchange for Learning (X4L) Programme (2002-2005) Digital Repositories Programme (2005-2007) JISC Capital Programme Repositories and Preservation strand (2006-2009)
  • 4. Deposit API Deposit API activity was brought together to find lightweight solution to assist populating repositories within timescales of JISC programmes It comprised a group of repository software developers from Eprints.org, DSpace, Fedora, Intrallect and others facilitated by the JISC Repositories Research Team to address the need for a common Deposit standard
  • 5. Background motivation In general, developers are not creating repository systems and software from scratch repositories must interface with each other, with users and with other applications within institutions and the wider information landscape VLEs, authoring tools, packaging tools, name authority services, classification services and research systems There is no common deposit API or protocol
  • 6. Pain points no standardised way of transferring existing collections of digital objects and/or metadata from a filesystem or legacy database into a repository no standard interface for tagging, packaging or authoring tools to upload catalogued objects into a repository no standard interface for transferring digital objects between repositories no way of initiating a contribution workflow from outside a repository system no way of including deposit into a repository a part of service orientated architecture for harvesting there is OAI-PMH – this has had a major impact There is no equivalent mechanism for deposit
  • 7. Why is deposit so important? Without it, there is nothing in our repositories Ensuring the emerging network of repositories is well populated with content is a PRIORITY Encouraging deposit is one of the most difficult cultural issues for repositories Technology needs to support culture change and advocacy, through ease of use multiple deposit auto-deposit NOT closed or proprietary mechanisms
  • 8. Repository Stores, manages and makes available content and metadata Deposit interface Delete interface Search interface Harvest interface Obtain interface From Andy Powell, A 'service oriented' view of the JISC Information Environment: http://www.ukoln.ac.uk/distributed-systems/jisc-ie/arch/soa/jisc-ie-soa.pdf Similarly, the ORE initiative identifies put (deposit), obtain and harvest services
  • 9. Deposit – abstract service definition A Deposit interface: Provides an interface through which content and metadata can be deposited and initiates ingest process for local storage. Summarised from Andy Powell, A 'service oriented' view of the JISC Information Environment: http://www.ukoln.ac.uk/distributed-systems/jisc-ie/arch/soa/jisc-ie-soa.pdf Put: A put service supports the request for ingest of one or more surrogates into a repository, thereby allowing the addition of digital objects to the repositories’ collection From An interoperable fabric for scholarly value chains: http://www.dlib.org/dlib/october06/vandesompel/10vandesompel.html
  • 10. A note on terminology Add Deposit Put Submit Post Ingest deposit, put, add etc. may be part of an ingest process, along with other functions may include both automated and manual procedures including format checking, editorial control, quality assurance mechanisms, etc. defined by OAIS these are out of scope for this activity used by the e-Framework terminology of choice for this work used by ORE used for blogs Largely synonymous, with subtle differences, often related to community of use
  • 11. User requirements / scenarios Author deposits using a desktop authoring system to a mediated multiple deposit service A user submits an IMS-compliant learning object to a National Repository using a client application Deposit into multiple repositories Transfer between intermediate hosts Repositories share improved metadata Experimental data output from spectrometer is 'saved as' a file and a file containing metadata on operational parameters is also generated. A data capture service is invoked and the files pertaining to the experiment are deposited, along with the necessary metadata, in the laboratory repository. See more at http://www.ukoln.ac.uk/repositories/digirep/
  • 12. Scenario 1 : Author deposits using a desktop authoring system to a mediated multiple deposit service A lightweight deposit web service can facilitate this transfer of object(s) Librarian L completes the deposit through the repository interface id Librarian L invokes deposit of a surrogate into arxiv.org Deposit id Author A deposits via an easy-deposit desktop application into the institutional repository's mediated deposit queue
  • 13. Scenario 2 : A user submits an IMS-compliant learning object to a National Repository using a client application A lightweight deposit web service can facilitate this transfer of object(s) A user wishes to submit an IMS-compliant content package to a repository using a client application id Put The user can choose from a list of ‘groups/collections’ to which they are allowed to deposit, in this centralised national LO repository. They are not required to use the repository interface, but can deposit via a decentralised client. id
  • 14. Scenario 3 : Deposit in multiple repositories A lightweight deposit web service can facilitate this transfer of object(s) Deposit The depositor can choose one or more repositories to deposit into A depositor is required to submit to a Research Council repository, but they also wish to deposit into their institutional repository and a relevant subject repository
  • 15. Scenario 4 : transfer between intermediate hosts A lightweight deposit web service can facilitate this transfer of object(s) Deposit Subsequent repositories may also transfer objects Deposit A repository may transfer objects to other repositories, or services, e.g. a preservation service
  • 16. Scenario 5 : Repositories share improved metadata (put both ways) A lightweight deposit web service can facilitate this transfer of object(s) Deposit id Repository A deposits an object in another repository Deposit Repository B improves the metadata and deposits the object back into repository A id
  • 17. Scenario 6 : laboratory auto-deposit A lightweight deposit web service can facilitate this transfer of object(s) Deposit Experimental data output from laboratory machines is deposited, along with the necessary metadata, in the laboratory repository in an automated process A metadata record is also deposited into the Institutional Repository
  • 18. Some functional requirements A Deposit service should: be generic enough to support wide range of heterogeneous repositories scholarly publications, data, learning objects, images, etc. accept submission of different digital object types in consistent way: data and/or metadata in the form of complex objects or content packages support different workflows for deposit, e.g. user to multiple repositories via intermediate client user to repository, repository to additional repositories user-triggered and machine-triggered deposit accept large-scale (scientific datasets) support statuses, e.g. deposit to different states of a workflow support collections and changes in policy and permissions support differences in repository policy support non-instantaneous processes, e.g. deposit pending mediation support validation report and integrity checks support anonymous deposit support more complex, authenticated deposit support acceptance and handling of incomplete records support rejection of records (reasons for rejection are out of scope) support human-selected targets for deposit support different deposit requests
  • 19. Defining a lightweight deposit service Define abstract service scope information models and APIs must be developed in manner neutral to implementation binding Examine existing protocols and specifications could they be used implement the defined abstract service? Evaluate and decide whether a new protocol or API is required Abstract service: a discrete piece of technical functionality required to fulfil a specific requirement or set of requirements Synonymous with a ‘service genre’ in the JISC DEST e-Framework
  • 20. Deposit – abstract service definition A Deposit interface: Provides an interface through which content and metadata can be deposited and initiates ingest process for local storage. Summarised from Andy Powell, A 'service oriented' view of the JISC Information Environment: http://www.ukoln.ac.uk/distributed-systems/jisc-ie/arch/soa/jisc-ie-soa.pdf Put: A put service supports the request for ingest of one or more surrogates into a repository, thereby allowing the addition of digital objects to the repositories’ collection From An interoperable fabric for scholarly value chains: http://www.dlib.org/dlib/october06/vandesompel/10vandesompel.html
  • 21. Existing standards WebDAV ( http://www.webdav.org/ ) JSR 170 ( http://www.jcp.org/en/jsr/detail?id=170 ) JSR 283 ( http://www.jcp.org/en/jsr/detail?id=283 ) SRW Update ( http://www.loc.gov/standards/sru/ ) Flickr Deposit API ( http://www.flickr.com/services/api/ ) Fedora Deposit API ( http://www.fedora.info/definitions/1/0/api/ ) OKI OSID ( http://www.okiproject.org/ ) ECL ( http://ecl.iat.sfu.ca/ ) ATOM Publishing Protocol ( http://www.ietf.org/html-charters/atompub-charter. html )
  • 22. Deposit – two components Deposit : service offered by a repository, allowing remote users (machines or people) to upload data data in: deposit request with optional parameters (e.g.digital object ‘semantics’, metadata formats..) data out: status (success, failure, pending), receipt confirmation and digital object identifier Explain : service offered by a repository, allowing remote users (machines or people) to inspect the repository for policy and/or other data data in: introspection request (“explain”) data out: introspection response (“repository policy info”)
  • 23. Draft XML serialisations http://www.ukoln.ac.uk/repositories/digirep
  • 24. Deposit service specification To recap, following the scope defined earlier the repository developers came up two services: deposit and explain and a draft XML serialisation for each the service will work by the client issuing XML commands over HTTP to the repository Deposit service the service responds with formatted XML messages other approaches may also be considered, e.g. SOAP a layered approach was taken, with the specification of two levels of compliance. Level 0 compliance requires a set of mandatory elements Level 1 offers a set of additional optional elements that may or may not be used
  • 25. Some issues Boundaries between deposit and ingest what has already happened at point of deposit? regarding metadata and identifiers how far does the deposit service need to validate what is being deposited and can it reject deposit requests? Data integrity is there requirement to get back (export) exact object that was deposited? Multiple data types, metadata formats and content packages how far should the deposit service check its ability to accept what is being deposited? Can look up of policy rules be done as a request to service registry? how far is look up of policy rules automated? Authorisation and authentication how will the deposit service check the authority of the person/machine doing the ‘putting’ how will it interface with auth services?
  • 26. Next steps Finish it and test it! At the moment, the deposit web service is still embryonic To take it forward, a funding proposal has been submitted to finalise the original deposit API work test it against different repository software Eprints DSpace Fedora Intrallect intraLibrary build a client implementation iteratively revise and re-test disseminate and embed into the repositories community
  • 27. Final thoughts … This work is aligned with the vision of the JISC-DEST E-Framework and the soa approach in general also with the JISC Information Environment commitment to interoperability and the use of web services to facilitate interaction between Repositories and other services and with the objectives of the Object Re-use and Exchange Initiative and the definition of a ‘put’ interface Thank you …