DSpace standard Data model
and DSpace-CRIS
DCAT Discussion: exploring convergence
Andrea Bollini (CTIO)
<andrea.bollini@4science.it>
• DSpace
• focus on digital object providing more or less support for the
Portland Common Data Model
• Communities & collections match the PCDM collection scope
& definition
• Data model is pre-defined
• DSpace-CRIS
• allows to manage additional entities to support rich context
around the digital object
• Out-of-box it provides support for the research domain (CRIS)
• Data model is open and configurable: other domains can be
managed without change the database or write code
High level comparison
DSpace-CRIS in a nutshell
2009 - The University of Hong Kong decides to enhance its Institutional Repository
Publications
People
Organisations
Projects
..more
Better
interaction,
reputation,
dissemination
Documentation and software available on the DSpace Wiki:
Asia | US | Canada | Brasil | Europe | Australia | Africa
More than 80 DSpace-CRIS installations running all around the World
2012 - DSpace-CRIS is released as open source to the community
Today
https://wiki.duraspace.org/display/DSPACECRIS/DSpace-CRIS+Home
Data model
Publications
Dataset
DSpace
Publications
Dataset
Researcher Pages
Projects
OrgUnits
2nd Level Dynamic
Objects
DSpace-CRIS
Approach: dynamic, flexible, not hardcoded
Attributes: only a few predefined
Entities: only a few predefined
Uis: all aspects can be defined
Main characteristics of the DSpace-CRIS data model
Main DSpace-CRIS Entities
Managing people:
Tracking author’s identifiers, name variants, career;
Synchronized with ORCID;
Manageable out-of-box by end users
Organisation & projects: Defined as separated entities;
Allow future extensions, specific workflows and editing rules
Other entities:
A single entity characterized by a profile/type (ie: Journal, Prize, Event, etc.;
Each profile has its own set of properties (independently defined)
www.cineca.itAn integrated view of the research,
other domains supported!
The data model configuration
doesn’t require coding!
Tabs
Boxes
Fields
How to collect the additional data
• automatically synchronized with external sources,
out-of-box using spreadsheets
• manually created during publication / dataset
submission
• Rich researcher profiles can be automatically imported from the
ORCID registry
• Rich information can be collected as part of the item submission
and moved to the right entity when confirmed
• new records can be confirmed / integrated by the administrator
• created as DSpace items and automatically converted
into the right entity upon validation
• managed via UI by administrators
• Search & browse all the entites
• Global search and specialized section for any entity
• Aggregated statistics: by all entity (author, organisation,
project, etc.)
• Full ORCID integration (push and pull information)
• Network visualization
• Alert and report about potential duplicate, merge tool for
administrators
• Metrics framework: allows to store the reference period,
the date of collection exposing the information to be shown
in the list or used to sort. Ranking, count and aggregation
performed automatically
• Automatic scanning of external datasource (WoS, Scopus,
etc.)
Additional DSpace-CRIS benefits
ORCID Integration
Login via ORCID iD
also for external
co-authors (if liked)
Lookup the registry
Transfer information from the repository
to the ORCID Registry and viceversa
Connect local profile with the ORCID registry
Create new record - facilitate the creation
of new ORCID records for researchers
Screens
from a recent public installation with basic layout
customization at The Hong Kong Polytechnic University
http://ira.lib.polyu.edu.hk/http://ira.lib.polyu.edu.hk (link)
Run the latest released
version 5.5
Manage people,
organisation,
publications patents,
thesis
Integrated with
bibliometric database
Run by the library,
integrated with HR,
commercial database,
etc.
Global search across the whole site with
highlighting of found terms
Narrowing the result by a specific
entity type provide further faceting
options
Dynamic components are contextualized
(site level, researcher outputs, researchers,
etc.)
At the site level the recent submissions are
all journal articles
Explore Theses: contextualised recent
submissions and top viewed
At the site level the most viewed item is a
Thesis, the second one a Journal article
Explore Research Output: contextualised
top list, specific browse & search indexes
Explore Researchers: different browse &
search indexes; different components
Beside the metadata the system
shows all the available metrics
5
Rank as percentile compared to
the other objects in the
repository
Grown tracked locally
Metric value (citation number)
linked to an external detail page
when provided
Link to Google Scholar citations &
altmetric badge
Link to Google Scholar citations &
altmetric badge
Links to more detailed record for
authors, journals and any related
entities
Data are organized in
several pages (tabs)
Each page is composed
by several boxes
Each box includes one or more fields.
Different datatypes are supported to
allow storing of photo, text, number,
identifiers, link to other entities
External identifiers can be used to directly address the profile page
without knowing the internal CRIS identifier
/cris/rp/details.html?lt=orcid&lv=0000-0002-2312-0318
Any type of identifier is allowed (including the ones created locally
by the institution – e.g.: staff #)
/cris/rp/details.html?lt=<ext_id>&lv=<ext_value>
This permits an easy and direct integration with other
institutional sites (staff directory, etc.)
All the objects (publications, projects, etc.) linked to a
researcher can be automatically included in her profile using
dynamic components
The metric framework can be applied to any entity – out-of-
the-box support of Scopus, Web of Science, PubMed and
internal usage statistics for publications and researchers
It is possible to explore the collaboration network of a
researcher based on your own criteria (co-authoring
publications, coinvestigation projects, co-interest, etc.)
Once a co-researcher is selected it is possible to see the
details of the collaboration
Once a co-researcher is selected it is possible to see the
details of the collaboration
It is also possible to navigate through the hierarchy of
organizations
All the objects (publications, projects, etc.) linked to the
organization can be automatically included
…including indirect reference (via researchers) and their
metrics
Internal usage statistics are available at all levels of hierarchy,
including the whole repository
Internal usage statistics are available at all levels of hierarchy,
including the whole repository
Internal usage statistics are available to all levels of hierarchy,
including the whole repository
Andrea Bollini (CTIO)
<andrea.bollini@4science.it>
mobile: +39 333 934 1808
skype: a.bollini
linkedin: andreabollini
orcid: 0000-0002-9029-1854
Thanks for your attention

DSpace standard Data model and DSpace-CRIS

  • 1.
    DSpace standard Datamodel and DSpace-CRIS DCAT Discussion: exploring convergence Andrea Bollini (CTIO) <[email protected]>
  • 2.
    • DSpace • focuson digital object providing more or less support for the Portland Common Data Model • Communities & collections match the PCDM collection scope & definition • Data model is pre-defined • DSpace-CRIS • allows to manage additional entities to support rich context around the digital object • Out-of-box it provides support for the research domain (CRIS) • Data model is open and configurable: other domains can be managed without change the database or write code High level comparison
  • 3.
    DSpace-CRIS in anutshell 2009 - The University of Hong Kong decides to enhance its Institutional Repository Publications People Organisations Projects ..more Better interaction, reputation, dissemination Documentation and software available on the DSpace Wiki: Asia | US | Canada | Brasil | Europe | Australia | Africa More than 80 DSpace-CRIS installations running all around the World 2012 - DSpace-CRIS is released as open source to the community Today https://wiki.duraspace.org/display/DSPACECRIS/DSpace-CRIS+Home
  • 4.
    Data model Publications Dataset DSpace Publications Dataset Researcher Pages Projects OrgUnits 2ndLevel Dynamic Objects DSpace-CRIS Approach: dynamic, flexible, not hardcoded Attributes: only a few predefined Entities: only a few predefined Uis: all aspects can be defined Main characteristics of the DSpace-CRIS data model
  • 5.
    Main DSpace-CRIS Entities Managingpeople: Tracking author’s identifiers, name variants, career; Synchronized with ORCID; Manageable out-of-box by end users Organisation & projects: Defined as separated entities; Allow future extensions, specific workflows and editing rules Other entities: A single entity characterized by a profile/type (ie: Journal, Prize, Event, etc.; Each profile has its own set of properties (independently defined)
  • 6.
    www.cineca.itAn integrated viewof the research, other domains supported!
  • 7.
    The data modelconfiguration doesn’t require coding!
  • 11.
  • 12.
    How to collectthe additional data • automatically synchronized with external sources, out-of-box using spreadsheets • manually created during publication / dataset submission • Rich researcher profiles can be automatically imported from the ORCID registry • Rich information can be collected as part of the item submission and moved to the right entity when confirmed • new records can be confirmed / integrated by the administrator • created as DSpace items and automatically converted into the right entity upon validation • managed via UI by administrators
  • 13.
    • Search &browse all the entites • Global search and specialized section for any entity • Aggregated statistics: by all entity (author, organisation, project, etc.) • Full ORCID integration (push and pull information) • Network visualization • Alert and report about potential duplicate, merge tool for administrators • Metrics framework: allows to store the reference period, the date of collection exposing the information to be shown in the list or used to sort. Ranking, count and aggregation performed automatically • Automatic scanning of external datasource (WoS, Scopus, etc.) Additional DSpace-CRIS benefits
  • 14.
    ORCID Integration Login viaORCID iD also for external co-authors (if liked) Lookup the registry Transfer information from the repository to the ORCID Registry and viceversa Connect local profile with the ORCID registry Create new record - facilitate the creation of new ORCID records for researchers
  • 17.
    Screens from a recentpublic installation with basic layout customization at The Hong Kong Polytechnic University
  • 18.
    http://ira.lib.polyu.edu.hk/http://ira.lib.polyu.edu.hk (link) Run thelatest released version 5.5 Manage people, organisation, publications patents, thesis Integrated with bibliometric database Run by the library, integrated with HR, commercial database, etc.
  • 19.
    Global search acrossthe whole site with highlighting of found terms
  • 20.
    Narrowing the resultby a specific entity type provide further faceting options
  • 21.
    Dynamic components arecontextualized (site level, researcher outputs, researchers, etc.)
  • 22.
    At the sitelevel the recent submissions are all journal articles
  • 23.
    Explore Theses: contextualisedrecent submissions and top viewed
  • 24.
    At the sitelevel the most viewed item is a Thesis, the second one a Journal article
  • 25.
    Explore Research Output:contextualised top list, specific browse & search indexes
  • 26.
    Explore Researchers: differentbrowse & search indexes; different components
  • 27.
    Beside the metadatathe system shows all the available metrics
  • 28.
    5 Rank as percentilecompared to the other objects in the repository Grown tracked locally Metric value (citation number) linked to an external detail page when provided
  • 30.
    Link to GoogleScholar citations & altmetric badge
  • 31.
    Link to GoogleScholar citations & altmetric badge
  • 32.
    Links to moredetailed record for authors, journals and any related entities
  • 33.
    Data are organizedin several pages (tabs)
  • 34.
    Each page iscomposed by several boxes
  • 35.
    Each box includesone or more fields. Different datatypes are supported to allow storing of photo, text, number, identifiers, link to other entities
  • 36.
    External identifiers canbe used to directly address the profile page without knowing the internal CRIS identifier /cris/rp/details.html?lt=orcid&lv=0000-0002-2312-0318 Any type of identifier is allowed (including the ones created locally by the institution – e.g.: staff #) /cris/rp/details.html?lt=<ext_id>&lv=<ext_value> This permits an easy and direct integration with other institutional sites (staff directory, etc.)
  • 37.
    All the objects(publications, projects, etc.) linked to a researcher can be automatically included in her profile using dynamic components
  • 38.
    The metric frameworkcan be applied to any entity – out-of- the-box support of Scopus, Web of Science, PubMed and internal usage statistics for publications and researchers
  • 39.
    It is possibleto explore the collaboration network of a researcher based on your own criteria (co-authoring publications, coinvestigation projects, co-interest, etc.)
  • 40.
    Once a co-researcheris selected it is possible to see the details of the collaboration
  • 41.
    Once a co-researcheris selected it is possible to see the details of the collaboration
  • 43.
    It is alsopossible to navigate through the hierarchy of organizations
  • 44.
    All the objects(publications, projects, etc.) linked to the organization can be automatically included
  • 45.
    …including indirect reference(via researchers) and their metrics
  • 46.
    Internal usage statisticsare available at all levels of hierarchy, including the whole repository
  • 47.
    Internal usage statisticsare available at all levels of hierarchy, including the whole repository
  • 48.
    Internal usage statisticsare available to all levels of hierarchy, including the whole repository
  • 49.
    Andrea Bollini (CTIO) <[email protected]> mobile:+39 333 934 1808 skype: a.bollini linkedin: andreabollini orcid: 0000-0002-9029-1854 Thanks for your attention

Editor's Notes

  • #38 Link inverso alle pubblicazioni