Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library
europeana
cloud
Ingestion Clinic
Chiara Latronico

Operations Officer, The European Library

Marian Lefferts

Executive Manager, CERL – WP4 Leader
Europeana Cloud Ingestion Clinic: 19-21 June, 3 July, 2013
Agenda





Ingestion process step by step
Ingestion plan broken down per provider
Rights documentation (Europeana Pro)
Other topics:






Thumbnails
De-duplication
Sets and subsets
Catalogue records vs digital objects
Collection descriptions

 Providers experience and questions
The European Library Portal
The European Library: Ingestion
Workflow
Preparation Work
Content ingestion questionnaire
Ingestion plan
Sample records to ingest
Datasets ready for harvesting
Create case in CRM: case # to provider

Step by Step
Harvest metadata
Enhance metadata
Index in acceptance portal
Communicate with data provider
Live index = live portal
Deliver to Europeana
Enhance and publish in Europeana
The European Library: System
Architecture
Harvesting: Repox
Harvesting: Repox
Ingestion: UIM Loading
Ingestion: UIM Validation
Ingestion: UIM Validation
Ingestion: UIM Validation
UIM Validation: Record in Portal
UIM Validation: Record in Portal
UIM Validation: Records in Portal
Validation: Acceptance Portal
XSLT to Internal Object Model
Ingestion: UIM OAI-EnrichmentAcceptance
Ingestion: UIM OAI-EnrichmentAcceptance
Validation: Acceptance Portal
Dataset in Acceptance

 Create an account on
http://www.theeuropeanlibrary.org/
 Use credentials to sign in to acceptance
http://www.tel.ulcc.ac.uk/acceptance/
 Validate data using tabs
 Default
 Dublin Core
 (Soon) EDM
Validation: Acceptance Portal
Acceptance Portal:
Communication

 When a dataset is in acceptance
 Communication with data provider
 Fixing dataset if needed
 More commination until provider
gives approval to publish
 Data provider accepts dataset
 Dataset ready for The European Library
live index
Ingestion: UIM Index to Publish
Live Index: Live Portal

 When a provider accepts dataset
 Dataset ready for live index
 Dataset indexed into the live portal
 It takes from 1 day to 1 week for a
dataset to be searchable in The
European Library live portal
(this is variable and changes due to circumstances)
Dataset Live in Europeana
When a provider accepts a dataset
 Dataset delivered to Europeana
 Dataset searchable in Europeana by
following quarter
 Dataset published live in Europeana
 E-mail to provider with link to dataset into
Europeana portal
SugarCRM: eCloud Ingestion Plan

eCloud Ingestion Plan Report
eCloud Ingestion Plan: Hangout # 1
19 th June
1. National Library of Technology (NTK), Prague
Three datasets scheduled for Q2 2014
Delivery to The European Library: April 2014
In Europeana by Q3 2014
2. ULB
Five datasets scheduled for Q4 2013
Delivery to The European Library: October 2013
In Europeana by Q1 2014
3. DIALNET
Two datasets scheduled for Q1 2014
Delivery to The European Library: January 2014
In Europeana by Q2 2014
eCloud Ingestion Plan: Hangout # 1
19 th June
4. Tilburg University
One dataset scheduled for Q1 2014
Delivery to The European Library: January 2014
In Europeana by Q2 2014
5. OAPEN
Two datasets scheduled for Q2 2013
Delivery to The European Library: May 2013
In Europeana by Q3 2013
eCloud Ingestion Plan: Hangout # 2
(21 st June)
1. University of Edinburgh
Ten datasets scheduled for Q4 2013
Delivery to The European Library: October 2013
In Europeana by Q1 2014
2. DANS
Three datasets scheduled for Q3 2013
Delivery to The European Library: July 2013
In Europeana by Q4 2014
3. UNIBI
One dataset scheduled for Q3 2013
Delivery to The European Library: July 2013
In Europeana by Q4 2014
eCloud Ingestion Plan: Hangout # 2
(21 st June)
4. VU University
Nine datasets scheduled for Q3 2013
Delivery to The European Library: July 2013
In Europeana by Q4 2013
one dataset scheduled for Q1 2014
Delivery to The European Library: January 2014
In Europeana by Q2 2014
one dataset scheduled for Q3 2014
Delivery to The European Library: July 2014
In Europeana by Q4 2014
5. Wales
One dataset scheduled for Q1 2014
Delivery to The European Library: January 2014
In Europeana by Q2 2014
eCloud Ingestion Plan: Hangout # 3
(3 rd July)
1. Bavarian State Library
One dataset scheduled for Q4 2013
Delivery to The European Library: October 2013
In Europeana by Q4 2013
2. Debrecen University Library
Three datasets scheduled for Q1 2015
Delivery to The European Library: January 2015
In Europeana by Q2 2015
eCloud Ingestion Plan: Hangout # 3
(3 rd July)
3. HAZU
Twenty-eight sub-sets scheduled for Q4 2013
Delivery to The European Library: October 2013
In Europeana by Q1 2014
One sub-set scheduled for Q2 2014
Delivery to The European Library: April 2014
In Europeana by Q3 2014
eCloud Ingestion Plan: Number of
Records
Records promised = Records delivered
Number of records promised needs to be the same of the
number of records delivered to The European Library
If a data provider cannot deliver the record promised
 The Collections Team needs to be informed soon
If a data provider has more records to deliver
 It’s good news and we will be happy to ingest more
Deliverable D4.1 (containing the ingestion schedule) is available
on Basecamp and can be accessed by everyone
Europeana Pro Website
Europeana Pro is the Europeana Professional website
http://pro.europeana.eu/
Here is possible to find
Information about projects
News
Discussions
Technical documentation
 For data provider to make metadata
 Europeana rights information
Europeana Rights on Europeana Pro
Europeana Rights...
Define the rights to the digital object
A definition is mandatory for each record
Can be inserted into the metadata
Can be sent via email (if the same statement is
appliccable for each record)
There are 12 rights statements to choose from
 2 Public Domain
 6 Creative Commons Licenses
 4 Europeana Rights Reserved Statements
Europeana Rights on Europeana Pro website
http://pro.europeana.eu/web/guest/available-rights-statements
Other Topics
Thumbnails
Are not mandatory but they enrich the collection
Can be inserted into the metadata
A pattern to thumbnails can be sent via email
Other Topics
De-duplication
If two or more of your datasets share the same records,
the data provider needs to
Inform the Collections Team
Help us to identify a pattern to de-duplicate records
Or give us a list of identifiers to work with
The European Library portal clusters similar records
But Europrana does not accept duplications
Other Topics
Sub-sets
If a dataset is made up of
several sub-sets, the data
provider needs to
Inform the Collections Team
Because tables and Ingestion
Plan might need to be updated
Other Topics
Catalogue records and digital objects
A catalogue record (bibliographic info) is recommended for each
record
A link to a digital object is mandatory for each record
Link to digital objects need be inserted into the metadata
Europeana does not accept records without links to digital
objects
Other Topics
Example of record with no catalogue records or digital objects
Other Topics
Collection descriptions
A data provider could enrich a dataset by sending us a
collection description
It would appear on the collection level page in The European
Library portal
It would improve retrieval of a dataset on Google search
It supports data analysis for Content Ingestion Strategy
A few examples
Picture Archives and Graphics Collection, Austrian National
Library
Alba amicorum from the Koninklijke Bibliotheek, National
Library of the Netherlands
Digital Periodicals and Newspapers, National Library of
Spain
Other Topics
Providers experience
Comments about time table?
Special issues regarding your own datasets?
Assistance in preparing the data?
Issues with number of records?
Questions?
Thank you!
For every questions or feedback contact
collections@theeuropeanlibrary.org
Chiara Latronico
Chiara.Latronico@kb.nl
www.theeuropeanlibrary.org

More Related Content

PPT
Short Presentation on Europeana Cloud at Europeana AGM 2013
PPT
Introduction to eCloud
PPT
Europeana Cloud - Alastair Dunning - November 2013
PPT
BHL-Europe_metadata_harmonisation_TDWG_20111018_kollerw_hrainer
PPT
Europeana Cloud - Essential Facts (low resolution version)
PPT
BASE : a powerful search engine for Open Access documents
PPTX
SUMMA: A Common API for Linked Data Entity Summaries
PDF
OpenML Tutorial ECMLPKDD 2015
Short Presentation on Europeana Cloud at Europeana AGM 2013
Introduction to eCloud
Europeana Cloud - Alastair Dunning - November 2013
BHL-Europe_metadata_harmonisation_TDWG_20111018_kollerw_hrainer
Europeana Cloud - Essential Facts (low resolution version)
BASE : a powerful search engine for Open Access documents
SUMMA: A Common API for Linked Data Entity Summaries
OpenML Tutorial ECMLPKDD 2015

Similar to Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library (20)

PPT
Chiara Latronico, Europeana Cloud - Ingestion and Aggregation Workshop, The E...
PPT
Europeana Cloud - Ingestion and Aggregation Workshop
PPT
Chiara latronico, Europeana Collections 1914-1918 - Ingestion and Aggregation...
PDF
Europeana aggregation workflow
PDF
Aggregation workflow
PDF
Ingestion workflows. Presentation at the Europeana Aggregator Forum 2015
PPTX
europeana agm 2015, 4/11, europeana cloud - alastair dunning & pavel kats
PPT
Aggregation Workflow at Europeana Aggregator Forum
PPT
AGM 2013 - Strategic Plan working groups outcomes
PPTX
Results of aggregator needs europeana cloud
PPT
Europeana Cloud: The Essential Facts
PPT
Europeana Newspapers Aggregation Plan
PPTX
How EUDAT services support FAIR data - IDCC 2017| www.eudat.eu |
PDF
Eudat research data management services | www.eudat.eu |
PPT
Olaf Janssen on the principles of large-scale digital libraries and their app...
PPT
Europeana Cloud Aggregator Forum 2014
PPT
Europeana @ NISO Bibliographic Roadmap Meeting
PPT
Europeana and Researchers
PPT
Ecloud copenhagen-130625074823-phpapp01
PPT
Europeana Cloud Work Package 1: Assessing Researchers' Needs in the Cloud
Chiara Latronico, Europeana Cloud - Ingestion and Aggregation Workshop, The E...
Europeana Cloud - Ingestion and Aggregation Workshop
Chiara latronico, Europeana Collections 1914-1918 - Ingestion and Aggregation...
Europeana aggregation workflow
Aggregation workflow
Ingestion workflows. Presentation at the Europeana Aggregator Forum 2015
europeana agm 2015, 4/11, europeana cloud - alastair dunning & pavel kats
Aggregation Workflow at Europeana Aggregator Forum
AGM 2013 - Strategic Plan working groups outcomes
Results of aggregator needs europeana cloud
Europeana Cloud: The Essential Facts
Europeana Newspapers Aggregation Plan
How EUDAT services support FAIR data - IDCC 2017| www.eudat.eu |
Eudat research data management services | www.eudat.eu |
Olaf Janssen on the principles of large-scale digital libraries and their app...
Europeana Cloud Aggregator Forum 2014
Europeana @ NISO Bibliographic Roadmap Meeting
Europeana and Researchers
Ecloud copenhagen-130625074823-phpapp01
Europeana Cloud Work Package 1: Assessing Researchers' Needs in the Cloud

More from The European Library (20)

PPT
Linking Collections Through Linked Open Data
PPT
Linked Data and cultural heritage data: an overview of the approaches from Eu...
PPT
Freire model api
PPT
The european library ukb nienke 13 feb 2014
PPT
Aubéry Escande - Europeana Newspapers - A new tool for researchers
PDF
Europeana Newspapers: Surveying Newspaper Digitisation in European Libraries,...
PPT
Europeana Newspapers (Project Details and Aggregation Workflow)
PDF
Europeana Newspapers Aggregation and Indexing Plan
PPT
Alastair Dunning, Open data at The European library, TEL
PPT
Alastair Dunning, Europeana Newspapers, The European Library
PPT
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
PPT
Alastair Dunning, Introduction to Europeana Cloud, The European Library
PPT
Alastair Dunning, Europeana Cloud: The Project and the Challenges of Assessin...
PPT
Dunning welsh-newspapers-130314110640-phpapp01
PPT
Dunning seedi-2013-130517083015-phpapp02
PDF
Alastair Dunning, Breaking the waves, The European Library
PPT
Alastair Dunning, Challenges and Solutions in Creating a European Historic Ne...
PPT
Alastair Dunning, Future Directions for The European Library
PPT
Word Occurrence Based Extraction of Work Contributors from Statements of Resp...
PPT
Joining The European Library, Adam Sofronijevic, University of Belgrade
Linking Collections Through Linked Open Data
Linked Data and cultural heritage data: an overview of the approaches from Eu...
Freire model api
The european library ukb nienke 13 feb 2014
Aubéry Escande - Europeana Newspapers - A new tool for researchers
Europeana Newspapers: Surveying Newspaper Digitisation in European Libraries,...
Europeana Newspapers (Project Details and Aggregation Workflow)
Europeana Newspapers Aggregation and Indexing Plan
Alastair Dunning, Open data at The European library, TEL
Alastair Dunning, Europeana Newspapers, The European Library
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
Alastair Dunning, Introduction to Europeana Cloud, The European Library
Alastair Dunning, Europeana Cloud: The Project and the Challenges of Assessin...
Dunning welsh-newspapers-130314110640-phpapp01
Dunning seedi-2013-130517083015-phpapp02
Alastair Dunning, Breaking the waves, The European Library
Alastair Dunning, Challenges and Solutions in Creating a European Historic Ne...
Alastair Dunning, Future Directions for The European Library
Word Occurrence Based Extraction of Work Contributors from Statements of Resp...
Joining The European Library, Adam Sofronijevic, University of Belgrade

Recently uploaded (20)

PDF
Connector Corner: Transform Unstructured Documents with Agentic Automation
PDF
NewMind AI Weekly Chronicles – August ’25 Week IV
PPTX
Information-Technology-in-Human-Society.pptx
PDF
Transform-Your-Supply-Chain-with-AI-Driven-Quality-Engineering.pdf
PPTX
Build automations faster and more reliably with UiPath ScreenPlay
PDF
EIS-Webinar-Regulated-Industries-2025-08.pdf
PDF
IT-ITes Industry bjjbnkmkhkhknbmhkhmjhjkhj
PDF
CEH Module 2 Footprinting CEH V13, concepts
PPTX
Presentation - Principles of Instructional Design.pptx
PDF
giants, standing on the shoulders of - by Daniel Stenberg
PDF
Streamline Vulnerability Management From Minimal Images to SBOMs
PDF
5-Ways-AI-is-Revolutionizing-Telecom-Quality-Engineering.pdf
PPTX
How to Convert Tickets Into Sales Opportunity in Odoo 18
PDF
The-Future-of-Automotive-Quality-is-Here-AI-Driven-Engineering.pdf
PDF
Altius execution marketplace concept.pdf
PDF
Build Real-Time ML Apps with Python, Feast & NoSQL
PDF
Transform-Quality-Engineering-with-AI-A-60-Day-Blueprint-for-Digital-Success.pdf
PDF
Identification of potential depression in social media posts
PDF
Advancing precision in air quality forecasting through machine learning integ...
PPTX
AQUEEL MUSHTAQUE FAKIH COMPUTER CENTER .
Connector Corner: Transform Unstructured Documents with Agentic Automation
NewMind AI Weekly Chronicles – August ’25 Week IV
Information-Technology-in-Human-Society.pptx
Transform-Your-Supply-Chain-with-AI-Driven-Quality-Engineering.pdf
Build automations faster and more reliably with UiPath ScreenPlay
EIS-Webinar-Regulated-Industries-2025-08.pdf
IT-ITes Industry bjjbnkmkhkhknbmhkhmjhjkhj
CEH Module 2 Footprinting CEH V13, concepts
Presentation - Principles of Instructional Design.pptx
giants, standing on the shoulders of - by Daniel Stenberg
Streamline Vulnerability Management From Minimal Images to SBOMs
5-Ways-AI-is-Revolutionizing-Telecom-Quality-Engineering.pdf
How to Convert Tickets Into Sales Opportunity in Odoo 18
The-Future-of-Automotive-Quality-is-Here-AI-Driven-Engineering.pdf
Altius execution marketplace concept.pdf
Build Real-Time ML Apps with Python, Feast & NoSQL
Transform-Quality-Engineering-with-AI-A-60-Day-Blueprint-for-Digital-Success.pdf
Identification of potential depression in social media posts
Advancing precision in air quality forecasting through machine learning integ...
AQUEEL MUSHTAQUE FAKIH COMPUTER CENTER .

Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

  • 2. europeana cloud Ingestion Clinic Chiara Latronico Operations Officer, The European Library Marian Lefferts Executive Manager, CERL – WP4 Leader Europeana Cloud Ingestion Clinic: 19-21 June, 3 July, 2013
  • 3. Agenda     Ingestion process step by step Ingestion plan broken down per provider Rights documentation (Europeana Pro) Other topics:      Thumbnails De-duplication Sets and subsets Catalogue records vs digital objects Collection descriptions  Providers experience and questions
  • 5. The European Library: Ingestion Workflow Preparation Work Content ingestion questionnaire Ingestion plan Sample records to ingest Datasets ready for harvesting Create case in CRM: case # to provider Step by Step Harvest metadata Enhance metadata Index in acceptance portal Communicate with data provider Live index = live portal Deliver to Europeana Enhance and publish in Europeana
  • 6. The European Library: System Architecture
  • 17. XSLT to Internal Object Model
  • 21. Dataset in Acceptance  Create an account on http://www.theeuropeanlibrary.org/  Use credentials to sign in to acceptance http://www.tel.ulcc.ac.uk/acceptance/  Validate data using tabs  Default  Dublin Core  (Soon) EDM
  • 23. Acceptance Portal: Communication  When a dataset is in acceptance  Communication with data provider  Fixing dataset if needed  More commination until provider gives approval to publish  Data provider accepts dataset  Dataset ready for The European Library live index
  • 24. Ingestion: UIM Index to Publish
  • 25. Live Index: Live Portal  When a provider accepts dataset  Dataset ready for live index  Dataset indexed into the live portal  It takes from 1 day to 1 week for a dataset to be searchable in The European Library live portal (this is variable and changes due to circumstances)
  • 26. Dataset Live in Europeana When a provider accepts a dataset  Dataset delivered to Europeana  Dataset searchable in Europeana by following quarter  Dataset published live in Europeana  E-mail to provider with link to dataset into Europeana portal
  • 27. SugarCRM: eCloud Ingestion Plan eCloud Ingestion Plan Report
  • 28. eCloud Ingestion Plan: Hangout # 1 19 th June 1. National Library of Technology (NTK), Prague Three datasets scheduled for Q2 2014 Delivery to The European Library: April 2014 In Europeana by Q3 2014 2. ULB Five datasets scheduled for Q4 2013 Delivery to The European Library: October 2013 In Europeana by Q1 2014 3. DIALNET Two datasets scheduled for Q1 2014 Delivery to The European Library: January 2014 In Europeana by Q2 2014
  • 29. eCloud Ingestion Plan: Hangout # 1 19 th June 4. Tilburg University One dataset scheduled for Q1 2014 Delivery to The European Library: January 2014 In Europeana by Q2 2014 5. OAPEN Two datasets scheduled for Q2 2013 Delivery to The European Library: May 2013 In Europeana by Q3 2013
  • 30. eCloud Ingestion Plan: Hangout # 2 (21 st June) 1. University of Edinburgh Ten datasets scheduled for Q4 2013 Delivery to The European Library: October 2013 In Europeana by Q1 2014 2. DANS Three datasets scheduled for Q3 2013 Delivery to The European Library: July 2013 In Europeana by Q4 2014 3. UNIBI One dataset scheduled for Q3 2013 Delivery to The European Library: July 2013 In Europeana by Q4 2014
  • 31. eCloud Ingestion Plan: Hangout # 2 (21 st June) 4. VU University Nine datasets scheduled for Q3 2013 Delivery to The European Library: July 2013 In Europeana by Q4 2013 one dataset scheduled for Q1 2014 Delivery to The European Library: January 2014 In Europeana by Q2 2014 one dataset scheduled for Q3 2014 Delivery to The European Library: July 2014 In Europeana by Q4 2014 5. Wales One dataset scheduled for Q1 2014 Delivery to The European Library: January 2014 In Europeana by Q2 2014
  • 32. eCloud Ingestion Plan: Hangout # 3 (3 rd July) 1. Bavarian State Library One dataset scheduled for Q4 2013 Delivery to The European Library: October 2013 In Europeana by Q4 2013 2. Debrecen University Library Three datasets scheduled for Q1 2015 Delivery to The European Library: January 2015 In Europeana by Q2 2015
  • 33. eCloud Ingestion Plan: Hangout # 3 (3 rd July) 3. HAZU Twenty-eight sub-sets scheduled for Q4 2013 Delivery to The European Library: October 2013 In Europeana by Q1 2014 One sub-set scheduled for Q2 2014 Delivery to The European Library: April 2014 In Europeana by Q3 2014
  • 34. eCloud Ingestion Plan: Number of Records Records promised = Records delivered Number of records promised needs to be the same of the number of records delivered to The European Library If a data provider cannot deliver the record promised  The Collections Team needs to be informed soon If a data provider has more records to deliver  It’s good news and we will be happy to ingest more Deliverable D4.1 (containing the ingestion schedule) is available on Basecamp and can be accessed by everyone
  • 35. Europeana Pro Website Europeana Pro is the Europeana Professional website http://pro.europeana.eu/ Here is possible to find Information about projects News Discussions Technical documentation  For data provider to make metadata  Europeana rights information
  • 36. Europeana Rights on Europeana Pro Europeana Rights... Define the rights to the digital object A definition is mandatory for each record Can be inserted into the metadata Can be sent via email (if the same statement is appliccable for each record) There are 12 rights statements to choose from  2 Public Domain  6 Creative Commons Licenses  4 Europeana Rights Reserved Statements Europeana Rights on Europeana Pro website http://pro.europeana.eu/web/guest/available-rights-statements
  • 37. Other Topics Thumbnails Are not mandatory but they enrich the collection Can be inserted into the metadata A pattern to thumbnails can be sent via email
  • 38. Other Topics De-duplication If two or more of your datasets share the same records, the data provider needs to Inform the Collections Team Help us to identify a pattern to de-duplicate records Or give us a list of identifiers to work with The European Library portal clusters similar records But Europrana does not accept duplications
  • 39. Other Topics Sub-sets If a dataset is made up of several sub-sets, the data provider needs to Inform the Collections Team Because tables and Ingestion Plan might need to be updated
  • 40. Other Topics Catalogue records and digital objects A catalogue record (bibliographic info) is recommended for each record A link to a digital object is mandatory for each record Link to digital objects need be inserted into the metadata Europeana does not accept records without links to digital objects
  • 41. Other Topics Example of record with no catalogue records or digital objects
  • 42. Other Topics Collection descriptions A data provider could enrich a dataset by sending us a collection description It would appear on the collection level page in The European Library portal It would improve retrieval of a dataset on Google search It supports data analysis for Content Ingestion Strategy A few examples Picture Archives and Graphics Collection, Austrian National Library Alba amicorum from the Koninklijke Bibliotheek, National Library of the Netherlands Digital Periodicals and Newspapers, National Library of Spain
  • 43. Other Topics Providers experience Comments about time table? Special issues regarding your own datasets? Assistance in preparing the data? Issues with number of records? Questions?
  • 44. Thank you! For every questions or feedback contact [email protected] Chiara Latronico [email protected]

Editor's Notes

  • #23: DC – shows the default but in XML.