DSpace-CRIS Workshop
Open Repository 2015, Indianapolis
Andrea Bollini
Head of Open Source and Open Standards Strategy
• Who we are...
• The DSpace-CRIS project history
• The DSpace-CRIS aims to provide a rich
data model
• New features
• ORCID integration, current status & next
steps
Agenda
⁄ Employees: > 900
⁄ Offices: Bologna, Milan & Rome
⁄ Turnover: >100M Euro
⁄ Interuniversity Consortium
⁄ Not for Profit
⁄ Founded in 1969
⁄ Headquartered in Bologna
⁄ 75 Members
⁄ 70 Universities
⁄ 4 Research Institutes
⁄ Ministry of Education
www.cineca.itWho we are...
EUNIS
Board Member | Interoperability TF |
Business Intelligence TF | BencHEIT
TF
euroCRIS
Board Member | DRIS/Best Practice
TG | Architecture TG | CERIF TG
CHEITA
Leading Member | Benchmarking IT
Group | Research Group
Groningen Declaration
Founder Member
DSpace
Steering Committee | 2 Committers |
Registered Service Provider |
DSpace-CRIS
EAIE
Student Mobility Standard: RS3G
OpenAIRE
National Open Access Desk
Pentaho
Member of the Advisory Board
Institutional Technological
International Activities in Tertiary Edu. www.cineca.it
Workflows mapping business
processes
System Integration (legacy,
national, …)
Reports &
Analytics
WHAT IRIS ADDS
Cineca’s perspective
Open Source
DSpace extended to all Research entities
To better support:
People, Organization, Projects, etc.
Interactions
Reputation
Dissemination
Institutional
Repository
www.cineca.it
The DSpace-CRIS project history
Partnership with Hong Kong University2009
11-2012
06-2013
08-2013
09-2013
01-2014
06-2014
Release the whole project as open source
(DSpace 1.8.2)
Second release DSpace 1.8.2, bug fix only
Third release DSpace 1.8.3, security fix
First release for DSpace 3.2  DSpace-CRIS
3.2.0
Technical documentation released
DSpace 4.1  DSpace-CRIS 4.1.0
Second release DSpace 4.2
Third release DSpace 4.3
First release DSpace 5.2  DSpace-CRIS 5.2.0
09-2014
05-2015
05-2015
The DSpace-CRIS project history
2009
11-2012
06-2013
08-2013
09-2013
06-2014
05-2015
01-2014
Data model
Standard DSpace --> publications & dataset
APPROACH
DYNAMIC
FLEXIBLE
NOT
HARDCODED
UIs
FEW
PREDEFINED
ATTRIBUTES
ENTITIES
FEW
PREDEFINED
DSpace-CRIS  other CERIF entites:
/ ResearcherPages
/ Projects
/ OrgUnits
/ 2nd Level Dynamic Objects
DEFINE
ANY
ASPECTS
Main DSpace-CRIS Entities
– a single entity specialized by a profile (type) ie Journal, Prize, Event,
etc.;
– Every profile has it own set of properties and nested objects
independently defined
– managing people;
– structural relations with System Users and pre-
configured properties to manage the name variants;
– This is the only entity that can be managed out-of-box by
non-system administrators
– defined as separated entities;
– Easy to develop future extensions, specializations, specific
workflows and editing rules
ResearcherPages
Projects and OrgUnits
Second level Dynamic Objects
Defining a DSpace-CRIS Entity
• UUID
• Public flag
• Dynamic Properties: every property belongs to a
PropertyDefinition
• System administrator can decide:
– Repeatable
– Mandatory
– Storage data type: String, int, double, date, html link, link to other
entities
– Start / End date
– Scope: link to a Semantic/Classification Object (itself a DynamicObject)
• Nested object: Repeatable, mandatory, list of properties
A note about the DSpace Metadata model
DSpace is known to support out-of-box, the Dublin Core
Initiative
Work has been done to update from the LoC profile
to a dcterms schema
DSpace metadata model is not bound to DC / DC
qualified or any other schema
DSpace metadata: a different point of view
– When allowed by the configuration any metadata can hold a
reference to other object (authority)
.. the technical agnostic one…
– Any metadata is characterized by a schema, element and
qualifier but…
schema = bibliographic information, administrative metadata, etc.;
element = relation with people;
qualifier = the type of relationship (author, editor, etc.)
– Which metadata are possible, which repeatable and which
mandatory is defined by configuration
CRIS entities management
• Dedicated administrative UI for data model definition
• Detail page for any entity organized in Tab and Box
themed with JQuery UI
• Faceted Searches & Customizable Browse indexes
• Network visualizations and analysis
Advantages for an Institutional Repository
• CRIS entities as authority for Item metadata values
• DSpace Items can be linked and displayed in the detail
page of any CRIS entity
• Ability to display selected publications (or any other
related entity) in the researcher profile
• Produce a list of selected publications (or any other
related entity
– Page vists, CRIS entity detail page
– Global & Top related CERIF Entity views & downloads referencing
the CRIS entity (ie, top projects for each researcher, top researcher
for each OrgUnit, etc.)
– Global & Top publication item views & downloads referencing the
CRIS entity
– email and RSS alerts
Advantages for an Institutional Repository
• Usage Statistics
• Article level metrics for PubMed (extensible)
– Cited-by count in the item page
– Number of articles for researcher
– Total citations for each researcher (only items in local
DSpace database will be counted)
New features_1
Two new standard home page components:
– Big infographics summarizing the CRIS
dimensions
– Global search (inspired from Narcis.nl) google
like with clustering and highlighting
16 16
www.cineca.it | IRIS & DSpace-CRIS Updates | May, 2015 | euroCRIS Membership Meeting
17
18
19
20
This is the
clustering level
Contextual
facets
21
The new features
Improved submission & ORCID integration
– Creation on-the-fly, of related objects
www.cineca.it | IRIS & DSpace-CRIS Updates | May, 2015 | euroCRIS Membership Meeting
23
The new features
Ability to create text fields that are resolved
automatically as a hyperlink (i.e. ORCID,
ResearcherID, etc.)
24 24
Data load
• We are implementing a generic tool to load,
update and delete data in DSpace-CRIS, using
spreedsheets
• This tool has been used to load data in the
euroCRIS repository
1. Using ORCID API,
2. grabbed ORCID record metadata,
3. updated Researcher Page
26
DSpace-CRIS & ORCID: next steps
– Research’s bibliographic data
(including the possibility to activate synchronisation for single field)
– Publications
(with the possibility to chose the publications to sync)
– Projects
(with the possibility to chose which projects to sync)
PUSH local
information to
ORCID
PULL extended
information
from ORCID
Ask user to import records retrieved from her
ORCID profile into DSpace-CRIS
Or
Automatically (if notification and preferences
allowed in DSpace-CRIS) when ORCID premium
membership is available
Next milestones
• This month (June 2015): PUSH services to
ORCID, Researcher Page  ORCID record
– Local identifier
– Biographic information
– Publications
– Projects
• September 2015: PULL services from ORCID
– Capture publications & projects
– Retrieve biographic information
29
Courtesy of Joao Moreira
http://www.slideshare.net/DuraSpace/32415-slides-new-possibilities-developments-with-dspace-and-orcid
Now: 5/7
September: 7/7
March 24, 2015
Thanks for your attention!
Let’s us to play a bit with a dspace-cris installation
Andrea Bollini
Head of Open Source and Open Standards Strategy
a.bollini@cineca.it
• Data model manipulation
• Linkage between CRIS Entities
• Linkage between Items and CRIS Entities
• ORCID integration
• Components
– To show reverse link in the target entity
– For statistics
• Network visualization
Demonstration

DSpace-CRIS Workshop OR2015: Slides

  • 1.
    DSpace-CRIS Workshop Open Repository2015, Indianapolis Andrea Bollini Head of Open Source and Open Standards Strategy
  • 2.
    • Who weare... • The DSpace-CRIS project history • The DSpace-CRIS aims to provide a rich data model • New features • ORCID integration, current status & next steps Agenda
  • 3.
    ⁄ Employees: >900 ⁄ Offices: Bologna, Milan & Rome ⁄ Turnover: >100M Euro ⁄ Interuniversity Consortium ⁄ Not for Profit ⁄ Founded in 1969 ⁄ Headquartered in Bologna ⁄ 75 Members ⁄ 70 Universities ⁄ 4 Research Institutes ⁄ Ministry of Education www.cineca.itWho we are...
  • 4.
    EUNIS Board Member |Interoperability TF | Business Intelligence TF | BencHEIT TF euroCRIS Board Member | DRIS/Best Practice TG | Architecture TG | CERIF TG CHEITA Leading Member | Benchmarking IT Group | Research Group Groningen Declaration Founder Member DSpace Steering Committee | 2 Committers | Registered Service Provider | DSpace-CRIS EAIE Student Mobility Standard: RS3G OpenAIRE National Open Access Desk Pentaho Member of the Advisory Board Institutional Technological International Activities in Tertiary Edu. www.cineca.it
  • 5.
    Workflows mapping business processes SystemIntegration (legacy, national, …) Reports & Analytics WHAT IRIS ADDS Cineca’s perspective Open Source DSpace extended to all Research entities To better support: People, Organization, Projects, etc. Interactions Reputation Dissemination Institutional Repository www.cineca.it
  • 6.
    The DSpace-CRIS projecthistory Partnership with Hong Kong University2009 11-2012 06-2013 08-2013 09-2013 01-2014 06-2014 Release the whole project as open source (DSpace 1.8.2) Second release DSpace 1.8.2, bug fix only Third release DSpace 1.8.3, security fix First release for DSpace 3.2  DSpace-CRIS 3.2.0 Technical documentation released DSpace 4.1  DSpace-CRIS 4.1.0 Second release DSpace 4.2 Third release DSpace 4.3 First release DSpace 5.2  DSpace-CRIS 5.2.0 09-2014 05-2015 05-2015
  • 7.
    The DSpace-CRIS projecthistory 2009 11-2012 06-2013 08-2013 09-2013 06-2014 05-2015 01-2014
  • 8.
    Data model Standard DSpace--> publications & dataset APPROACH DYNAMIC FLEXIBLE NOT HARDCODED UIs FEW PREDEFINED ATTRIBUTES ENTITIES FEW PREDEFINED DSpace-CRIS  other CERIF entites: / ResearcherPages / Projects / OrgUnits / 2nd Level Dynamic Objects DEFINE ANY ASPECTS
  • 9.
    Main DSpace-CRIS Entities –a single entity specialized by a profile (type) ie Journal, Prize, Event, etc.; – Every profile has it own set of properties and nested objects independently defined – managing people; – structural relations with System Users and pre- configured properties to manage the name variants; – This is the only entity that can be managed out-of-box by non-system administrators – defined as separated entities; – Easy to develop future extensions, specializations, specific workflows and editing rules ResearcherPages Projects and OrgUnits Second level Dynamic Objects
  • 10.
    Defining a DSpace-CRISEntity • UUID • Public flag • Dynamic Properties: every property belongs to a PropertyDefinition • System administrator can decide: – Repeatable – Mandatory – Storage data type: String, int, double, date, html link, link to other entities – Start / End date – Scope: link to a Semantic/Classification Object (itself a DynamicObject) • Nested object: Repeatable, mandatory, list of properties
  • 11.
    A note aboutthe DSpace Metadata model DSpace is known to support out-of-box, the Dublin Core Initiative Work has been done to update from the LoC profile to a dcterms schema DSpace metadata model is not bound to DC / DC qualified or any other schema
  • 12.
    DSpace metadata: adifferent point of view – When allowed by the configuration any metadata can hold a reference to other object (authority) .. the technical agnostic one… – Any metadata is characterized by a schema, element and qualifier but… schema = bibliographic information, administrative metadata, etc.; element = relation with people; qualifier = the type of relationship (author, editor, etc.) – Which metadata are possible, which repeatable and which mandatory is defined by configuration
  • 13.
    CRIS entities management •Dedicated administrative UI for data model definition • Detail page for any entity organized in Tab and Box themed with JQuery UI • Faceted Searches & Customizable Browse indexes • Network visualizations and analysis
  • 14.
    Advantages for anInstitutional Repository • CRIS entities as authority for Item metadata values • DSpace Items can be linked and displayed in the detail page of any CRIS entity • Ability to display selected publications (or any other related entity) in the researcher profile • Produce a list of selected publications (or any other related entity
  • 15.
    – Page vists,CRIS entity detail page – Global & Top related CERIF Entity views & downloads referencing the CRIS entity (ie, top projects for each researcher, top researcher for each OrgUnit, etc.) – Global & Top publication item views & downloads referencing the CRIS entity – email and RSS alerts Advantages for an Institutional Repository • Usage Statistics • Article level metrics for PubMed (extensible) – Cited-by count in the item page – Number of articles for researcher – Total citations for each researcher (only items in local DSpace database will be counted)
  • 16.
    New features_1 Two newstandard home page components: – Big infographics summarizing the CRIS dimensions – Global search (inspired from Narcis.nl) google like with clustering and highlighting 16 16 www.cineca.it | IRIS & DSpace-CRIS Updates | May, 2015 | euroCRIS Membership Meeting
  • 17.
  • 18.
  • 19.
  • 20.
    20 This is the clusteringlevel Contextual facets
  • 21.
  • 22.
    The new features Improvedsubmission & ORCID integration – Creation on-the-fly, of related objects www.cineca.it | IRIS & DSpace-CRIS Updates | May, 2015 | euroCRIS Membership Meeting
  • 23.
  • 24.
    The new features Abilityto create text fields that are resolved automatically as a hyperlink (i.e. ORCID, ResearcherID, etc.) 24 24
  • 25.
    Data load • Weare implementing a generic tool to load, update and delete data in DSpace-CRIS, using spreedsheets • This tool has been used to load data in the euroCRIS repository 1. Using ORCID API, 2. grabbed ORCID record metadata, 3. updated Researcher Page
  • 26.
  • 27.
    DSpace-CRIS & ORCID:next steps – Research’s bibliographic data (including the possibility to activate synchronisation for single field) – Publications (with the possibility to chose the publications to sync) – Projects (with the possibility to chose which projects to sync) PUSH local information to ORCID PULL extended information from ORCID Ask user to import records retrieved from her ORCID profile into DSpace-CRIS Or Automatically (if notification and preferences allowed in DSpace-CRIS) when ORCID premium membership is available
  • 28.
    Next milestones • Thismonth (June 2015): PUSH services to ORCID, Researcher Page  ORCID record – Local identifier – Biographic information – Publications – Projects • September 2015: PULL services from ORCID – Capture publications & projects – Retrieve biographic information
  • 29.
    29 Courtesy of JoaoMoreira http://www.slideshare.net/DuraSpace/32415-slides-new-possibilities-developments-with-dspace-and-orcid Now: 5/7 September: 7/7 March 24, 2015
  • 30.
    Thanks for yourattention! Let’s us to play a bit with a dspace-cris installation Andrea Bollini Head of Open Source and Open Standards Strategy [email protected]
  • 31.
    • Data modelmanipulation • Linkage between CRIS Entities • Linkage between Items and CRIS Entities • ORCID integration • Components – To show reverse link in the target entity – For statistics • Network visualization Demonstration