Semantic Web, Linked Data:
the Europeana case(s)
Antoine Isaac
Europeana
Vogin-ip-lezing, March 20, 2014
Europeana.eu, Europe’s cultural heritage portal
Text
Image
Video
Sound
3D
30M objects from 2,200 galleries, museums, archives and libraries
A broad, heterogeneous network
Audiovisual
collections
National Aggregators
Regional Aggregators
Archives
Thematic collections
Libraries
Musées
Lausannois
Culture.frThe
European
Library
APEX
European Film
Gateway Europeana Fashion
2,300 galleries, museums, archives and libraries
A distribution channel
What Europeana gets (and makes available)
Descriptive
metadata
Link to digital
objects online
Making metadata work better
 Creating a new Europeana Data Model
http://pro.europeana.eu/edm-documentation
Prior to EDM: flat records
dc:contributor, dc:creator, dc:date, dc:format, dc:identifier, dc:language, dc:publisher, dc:relation,
dc:source, dcterms:alternative, dcterms:extent, dcterms:temporal, dcterms:medium,
dcterms:created, dcterms:provenance, dcterms:issued, dcterms:conformsTo, dcterms:hasFormat,
dcterms:isFormatOf, dcterms:hasVersion, dcterms:isVersionOf, dcterms:hasPart, dcterms:isPartOf,
dcterms:isReferencedBy, dcterms:references, dcterms:isReplacedBy, dcterms:replaces
dcterms:isRequiredBy, dcterms:requires dcterms:tableOfContents
europeana:type, europeana:dataProvider, europeana:provider, europeana:isShownAt,
europeana:isShownBy, europeana:object, europeana:rights
 No links e.g. between objects and context entities
(persons, places)
 Mixing data on real object and digital content
 A lot of mapping quality problems
EDM: an example
More granular metadata
Harvesting thesauri as linked data
Contextual resources – multilingual &
semantic linked data for Concepts
<skos:Concept rdf:about="http://www.mimo-db.eu/InstrumentsKeywords/2251">
<skos:prefLabel xml:lang="">Harpsichord</skos:prefLabel>
<skos:prefLabel xml:lang="de">Cembalo</skos:prefLabel>
<skos:prefLabel xml:lang="sv">Cembalo</skos:prefLabel>
<skos:prefLabel xml:lang="fr">Clavecin</skos:prefLabel>
<skos:prefLabel xml:lang="it">Clavicembalo</skos:prefLabel>
<skos:prefLabel xml:lang="en">Harpsichord</skos:prefLabel>
<skos:prefLabel xml:lang="nl">Klavecimbel</skos:prefLabel>
<skos:broader>
<skos:Concept rdf:about="http://www.mimo-
db.eu/InstrumentsKeywords/2239"/>
</skos:broader>
</skos:Concept>
Contextual Resources – Places
Collaborative, soft standardization
 EDM is a cross-community development, involving
library, archive and museum experts, plus academic
partners
 A data model that re-uses several existing Semantic
Web-based models
Different semantic grains
 Adopts Semantic Web principle of
specializing classes and properties
 Enables extensions, “applications
profiles”, based on needs and best
practices from specific sectors or
domains
Ready for metadata enrichment
 Already re-using third-party sources
• GEMET, GeoNames, DBpedia
 Enrichment by providers or Europeana
• In collaboration!
• Example: Getty vocabularies
http://www.getty.edu/research/tools
/vocabularies/lod/
Facilitating re-use
Multiple Channels
 Search API
Results in JSON, JSON-LD, RDF/XML
 Semantic mark-up on portal
schema.org, OpenGraph
 Linked Open Data prototype
Content negotiation, RDF dumps, SPARQL endpoint
Content - digital objects on the site of the provider
Metadata - descriptive object information
Different
options
The legal side of data
CC
Europeana and partners provide open metadata
Benefiting from existing R&D efforts
EuropeanaTech
community
pro.europeana.eu/europeana-tech
Innovation - Searching
http://eculture.cs.vu.nl/europeana/
Innovation - Annotation
Pundit @ DM2E project http://dm2e.eu
Advocating LOD http://vimeo.com/36752317
Conclusions
 Big opportunities and challenges for Europeana and
its partners
 Not implementing the full Semantic Web technical
stack at once already bring benefits
 Seeing where the general Linked Data vision can
change things
Thank you
#AllezCulture !
Antoine Isaac
antoine.isaac@europeana.eu
@EuropeanaTech
Useful links
 Europeana portal europeana.eu
 EuropeanaTech community pro.europeana.eu/europeana-
tech
 Europeana Data Model documentation
pro.europeana.eu/edm-documentation
 Europeana Twitter @EuropeanaEU
 EuropeanaTech Twitter @EuropeanaTech

Semantic Web, Linked Data: the Europeana case(s)

  • 1.
    Semantic Web, LinkedData: the Europeana case(s) Antoine Isaac Europeana Vogin-ip-lezing, March 20, 2014
  • 2.
    Europeana.eu, Europe’s culturalheritage portal Text Image Video Sound 3D 30M objects from 2,200 galleries, museums, archives and libraries
  • 3.
    A broad, heterogeneousnetwork Audiovisual collections National Aggregators Regional Aggregators Archives Thematic collections Libraries Musées Lausannois Culture.frThe European Library APEX European Film Gateway Europeana Fashion 2,300 galleries, museums, archives and libraries
  • 4.
  • 5.
    What Europeana gets(and makes available) Descriptive metadata Link to digital objects online
  • 6.
    Making metadata workbetter  Creating a new Europeana Data Model http://pro.europeana.eu/edm-documentation
  • 7.
    Prior to EDM:flat records dc:contributor, dc:creator, dc:date, dc:format, dc:identifier, dc:language, dc:publisher, dc:relation, dc:source, dcterms:alternative, dcterms:extent, dcterms:temporal, dcterms:medium, dcterms:created, dcterms:provenance, dcterms:issued, dcterms:conformsTo, dcterms:hasFormat, dcterms:isFormatOf, dcterms:hasVersion, dcterms:isVersionOf, dcterms:hasPart, dcterms:isPartOf, dcterms:isReferencedBy, dcterms:references, dcterms:isReplacedBy, dcterms:replaces dcterms:isRequiredBy, dcterms:requires dcterms:tableOfContents europeana:type, europeana:dataProvider, europeana:provider, europeana:isShownAt, europeana:isShownBy, europeana:object, europeana:rights  No links e.g. between objects and context entities (persons, places)  Mixing data on real object and digital content  A lot of mapping quality problems
  • 8.
  • 9.
  • 10.
  • 11.
    Contextual resources –multilingual & semantic linked data for Concepts <skos:Concept rdf:about="http://www.mimo-db.eu/InstrumentsKeywords/2251"> <skos:prefLabel xml:lang="">Harpsichord</skos:prefLabel> <skos:prefLabel xml:lang="de">Cembalo</skos:prefLabel> <skos:prefLabel xml:lang="sv">Cembalo</skos:prefLabel> <skos:prefLabel xml:lang="fr">Clavecin</skos:prefLabel> <skos:prefLabel xml:lang="it">Clavicembalo</skos:prefLabel> <skos:prefLabel xml:lang="en">Harpsichord</skos:prefLabel> <skos:prefLabel xml:lang="nl">Klavecimbel</skos:prefLabel> <skos:broader> <skos:Concept rdf:about="http://www.mimo- db.eu/InstrumentsKeywords/2239"/> </skos:broader> </skos:Concept>
  • 12.
  • 13.
    Collaborative, soft standardization EDM is a cross-community development, involving library, archive and museum experts, plus academic partners  A data model that re-uses several existing Semantic Web-based models
  • 14.
    Different semantic grains Adopts Semantic Web principle of specializing classes and properties  Enables extensions, “applications profiles”, based on needs and best practices from specific sectors or domains
  • 15.
    Ready for metadataenrichment  Already re-using third-party sources • GEMET, GeoNames, DBpedia  Enrichment by providers or Europeana • In collaboration! • Example: Getty vocabularies http://www.getty.edu/research/tools /vocabularies/lod/
  • 16.
  • 17.
    Multiple Channels  SearchAPI Results in JSON, JSON-LD, RDF/XML  Semantic mark-up on portal schema.org, OpenGraph  Linked Open Data prototype Content negotiation, RDF dumps, SPARQL endpoint
  • 18.
    Content - digitalobjects on the site of the provider Metadata - descriptive object information Different options The legal side of data CC Europeana and partners provide open metadata
  • 19.
    Benefiting from existingR&D efforts EuropeanaTech community pro.europeana.eu/europeana-tech
  • 20.
  • 21.
    Innovation - Annotation Pundit@ DM2E project http://dm2e.eu
  • 22.
  • 23.
    Conclusions  Big opportunitiesand challenges for Europeana and its partners  Not implementing the full Semantic Web technical stack at once already bring benefits  Seeing where the general Linked Data vision can change things
  • 24.
  • 25.
    Useful links  Europeanaportal europeana.eu  EuropeanaTech community pro.europeana.eu/europeana- tech  Europeana Data Model documentation pro.europeana.eu/edm-documentation  Europeana Twitter @EuropeanaEU  EuropeanaTech Twitter @EuropeanaTech

Editor's Notes

  • #3 Les Miserables: Victor Hugo’s handwritten manuscripts: http://www.europeana.eu/portal/record/9200103/5372912AF66AB529E188218BC1F747E75EB1A18F.html BnF, public domain Matisse ‘53 in the form of a double helix’ http://www.europeana.eu/portal/record/9200104/F8D60AB9136C8A59B59DF1CFEC278A6CABA8B0C6.htmlThe Wellcome Library (CC-BY-NC-ND) ‘söprűtánc’ – Hungarian traditional dance http://www.europeana.eu/portal/record/08901/E1A7B01BE4AED87FD239672F4F3941F52262D6B2.html Hungarian Academy of Sciences Institute for Musicology, public domain ‘Neurologico reggae’ Music album http://www.europeana.eu/portal/record/08901/ADC241BCBF8470988DBA6EEAFCF13F14D88E5534.html DISMARC – EuropeanaConnect Paid Access ‘Castle of Kavala’ 3D exploration of a Greek castle http://www.europeana.eu/portal/record/2020703/05607B24D15BD516EE2B765F74CDA39C7427F7FB.html Cultural and Educational Technology Institute - Research Centre Athen CARARE CC-BY-NC-ND
  • #4 All partners send us descriptions of their assets, which we aggregate in a single service
  • #6 Example used is: http://preview.europeana.eu/portal/record/90402/174D436CF5C61F8AA999090C98DA48B9C7024087.html Een vrouw met een kind in een kelderkamer by Pieter de Hooch, Rijksmuseum, public domain
  • #9 View the object at: http://www.europeana.eu/portal/record/09102/_CM_0161930.html
  • #16 Dm2e:writer definition at http://onto.dm2e.eu/schemas/dm2e/1.0/#writer
  • #19 Data is available as ‘data dumps’ for Linked Open Data initiatives from data.europeana.eu. Europeana&amp;apos;s move to CC0 is a step change in open data access. Releasing data from across the memory organisations of every EU country sets an important new international precedent, a decisive move away from the world of closed and controlled data. Note that previews can only be used in accordance with the rights information displayed next to them. HISPANA and Partage Plus both use the Europeana API to include Europeana search results on their own websites