Dataverse: Helping Researchers Publish Their Data
Through Automation
Eleni Castro, Research Coordinator
IQSS, Harvard University
IDCC 2016 - Feb 24, 2016
@dataverseorg Dataverse.org
Helping Researchers Share & Archive Data At Their Point of Need
catalog.archives.gov/id/554290
2
Our Quest For Interoperability and Automation
●  OAI-PMH for harvesting metadata from Dataverse
●  SWORD API: depositing metadata + data from a SWORD client into
Dataverse
●  Search API: searching dataverses, datasets and files within Dataverse
●  Data Access API: downloading files from datasets found in Dataverse
●  Native API: for performing GUI and super-user functionality programmatically
via REST
In 2016: adding meta-tags and schema.org metadata for datasets
More info at: http://guides.dataverse.org/en/latest/api/index.html 3
Research Life Cycle Workflow
Modelled off UCI Libraries diagram: http://previous.lib.uci.edu/dss/images/lifecycle.jpg
4
1.  Planning Phase
5
Future Integration with DMPTool
See: http://blog.dmptool.org/2016/01/22/dmptool-maintenance-and-a-roadmap
6
2. Implementation Phase
7
OSF Dataverse Add-On to archive data
via SWORD API
See: https://osf.io/getting-started/#dataverse 8
R package to deposit data & search Dataverse
Thomas Leeper’s code: https://github.com/rOpenSci/dvn
9
Data Visualizations from Dataverse...
10
Data Visualizations from Dataverse via WorldMap
11http://worldmap.harvard.edu
Data Visualizations and Analysis with ClioInfra
https://www.clio-infra.eu/
via Data Access API
+ Native API
12
3. Publishing Phase
13
Integrate Journal and Data Publishing Workflows
Paper: http://journal.code4lib.org/articles/10989 14
Future: Integrate data quality review + verification
15
http://ajps.org/2015/03/26/the-ajps-replication-policy-innovations-and-revisions/
Future: Dataverse / ORCID Integration
See: Requiring ORCID in Publication Workflows: Open Letter 16
1.  Allow users to authenticate using their ORCID ID.
2.  Automatically insert ORCID ID into Dataset and
search ORCID ID to insert for co-authors.
3.  Add to and update ORCID records (Subject to
permissions granted by iD holders).
4. Discovery & Impact Phase
17
Expand Dataset Discovery via SHARE Notify
http://www.share-research.org/projects/share-notify/ 18
Send Dataset Metadata to DataCite
Coming soon in Dataverse
19DataCite Metadata 3.0
Future: Measure Dataset Impact with Altmetrics
Example from Univ of Southampton
Example from Univ of Zurich
20
See Repository Badges documentation:
https://www.altmetric.com/products/free-tools/institutional-repository-badges/
5. Preservation Phase
21
Scholars Portal Dataverse Integration With Archivematica
Image source & read more: https://wiki.archivematica.org/Dataverse 22
Helping Future
Researchers Re-Use
Data
23
Thank You!
Questions?
ecastro@fas.harvard.edu 24

More Related Content

PPTX
ACS 248th Paper 108 NIST-IUPAC Solubility Data
PPTX
DSpace-CRIS 7: What is Coming? OR2020
PDF
Role of PIDs in connecting scholarly works
PDF
New PID developments
PPTX
How OpenAIRE uses persistent identifiers for discovery, enrichment, and linki...
PPTX
Enhancing a library OPAC with linked data
PPTX
Tripal v3, the Collaborative Online Database Platform Supporting an Internati...
PPTX
Big Data Initiatives for Agroecosystems
ACS 248th Paper 108 NIST-IUPAC Solubility Data
DSpace-CRIS 7: What is Coming? OR2020
Role of PIDs in connecting scholarly works
New PID developments
How OpenAIRE uses persistent identifiers for discovery, enrichment, and linki...
Enhancing a library OPAC with linked data
Tripal v3, the Collaborative Online Database Platform Supporting an Internati...
Big Data Initiatives for Agroecosystems

What's hot (20)

PPTX
Tool collection as linkeddata
PPTX
Biothings APIs: high-performance bioentity-centric web services
PPTX
Data, data, everywhere? Not nearly enough!
PDF
DBpedia mobile
PPTX
Organising principles
PDF
Cataloger 3.0: Competencies and Education for the BIBFRAME Catalog
PPTX
How open is open? An evaluation rubric for public knowledgebases
PDF
Organising principles
PDF
The methods and practices of Linked Open Data
PPTX
Library Support For Ref
PPTX
The benefits of using Crossref metadata for libraries and scientists - Crossr...
PPT
OpenAGRIS: using bibliographical data for linking into the agricultural knowl...
PDF
Visualizing data
PPT
Linked library data
PDF
MOCHA 2018 Challenge @ ESWC2018
PPTX
VRA_2015_CatalogingRoundup_Seneff
PPTX
NCompass Live: Beyond MARC: BIBFRAME and the Future of Bibliographic Data
PDF
Event Data - Crossref LIVE South Africa
Tool collection as linkeddata
Biothings APIs: high-performance bioentity-centric web services
Data, data, everywhere? Not nearly enough!
DBpedia mobile
Organising principles
Cataloger 3.0: Competencies and Education for the BIBFRAME Catalog
How open is open? An evaluation rubric for public knowledgebases
Organising principles
The methods and practices of Linked Open Data
Library Support For Ref
The benefits of using Crossref metadata for libraries and scientists - Crossr...
OpenAGRIS: using bibliographical data for linking into the agricultural knowl...
Visualizing data
Linked library data
MOCHA 2018 Challenge @ ESWC2018
VRA_2015_CatalogingRoundup_Seneff
NCompass Live: Beyond MARC: BIBFRAME and the Future of Bibliographic Data
Event Data - Crossref LIVE South Africa
Ad

Viewers also liked (14)

PPTX
Data Publishing Workflows with Dataverse
PDF
Dataverse Netowrk Project
PDF
Dataverse 4.0 UX by Elizabeth Quigley
PPTX
International DMP workshop presentation, IDCC, Feb 2016
PDF
DMPTool2 demo for DMPTool-DMPonline Workshop IDCC 2014
PDF
Dataverse in the Universe of Data by Christine L. Borgman
PDF
Dataverse opportunities
 
PPTX
IDCC Presentation on the Future of Data Management Planning, Feb 2016
PPTX
Jisc research data shared service overview IDCC 2016
PDF
IDCC 17 26 Annexe 1 critères classants etam et cadres
PDF
APLIC 2014 - Dataverse Project
PPTX
Value&impact research dataservices_idcc_2017
PDF
idcc17-dmp-talk-20feb
PPTX
IDCC workshop: OpenAIRE services and tools for Open Research Data in H2020
Data Publishing Workflows with Dataverse
Dataverse Netowrk Project
Dataverse 4.0 UX by Elizabeth Quigley
International DMP workshop presentation, IDCC, Feb 2016
DMPTool2 demo for DMPTool-DMPonline Workshop IDCC 2014
Dataverse in the Universe of Data by Christine L. Borgman
Dataverse opportunities
 
IDCC Presentation on the Future of Data Management Planning, Feb 2016
Jisc research data shared service overview IDCC 2016
IDCC 17 26 Annexe 1 critères classants etam et cadres
APLIC 2014 - Dataverse Project
Value&impact research dataservices_idcc_2017
idcc17-dmp-talk-20feb
IDCC workshop: OpenAIRE services and tools for Open Research Data in H2020
Ad

Similar to Dataverse: Helping Researchers Publish Their Data Through Automation (20)

PPTX
OAI-PMH
PPTX
Panel members v2_datajournals_repositories_repofringe3aug2015
PPTX
CORE APIv3
PPTX
Scholze liber 2015-06-25_final
PPTX
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
PPTX
FAIR Workflows and Research Objects get a Workout
PPTX
Access the world’s research outputs through the CORE API
PDF
Tripal within the Arabidopsis Information Portal - PAG XXIII
PPTX
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
PDF
Tag.bio: Self Service Data Mesh Platform
PPTX
PRANSHU_FINAL_PgjjcfgcghjjjjfgnnjPT.pptx
PPTX
Publishing the Full Research Data Lifecycle
PDF
Globus Integrations (GlobusWorld Tour - UMich)
PPTX
UKSG 2018 Lightning Talk - Annotations as research objects: findable, indexab...
PPTX
RDA-WDS Publishing Data Interest Group
PDF
OpenAIRE: Science. Set Free, Iryna Kuchma, EIFL
PPTX
Arabidopsis Information Portal: A Community-Extensible Platform for Open Data
PDF
Dataverse, Cloud Dataverse, and DataTags
PPTX
Linking Software: citations, roles, references and more
PPTX
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...
OAI-PMH
Panel members v2_datajournals_repositories_repofringe3aug2015
CORE APIv3
Scholze liber 2015-06-25_final
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
FAIR Workflows and Research Objects get a Workout
Access the world’s research outputs through the CORE API
Tripal within the Arabidopsis Information Portal - PAG XXIII
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Tag.bio: Self Service Data Mesh Platform
PRANSHU_FINAL_PgjjcfgcghjjjjfgnnjPT.pptx
Publishing the Full Research Data Lifecycle
Globus Integrations (GlobusWorld Tour - UMich)
UKSG 2018 Lightning Talk - Annotations as research objects: findable, indexab...
RDA-WDS Publishing Data Interest Group
OpenAIRE: Science. Set Free, Iryna Kuchma, EIFL
Arabidopsis Information Portal: A Community-Extensible Platform for Open Data
Dataverse, Cloud Dataverse, and DataTags
Linking Software: citations, roles, references and more
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...

Recently uploaded (20)

PPT
Chinku Sharma Internship in the summer internship project
PPTX
PPT for Diseases (1)-2, types of diseases.pptx
PPTX
Phase1_final PPTuwhefoegfohwfoiehfoegg.pptx
PPTX
AI AND ML PROPOSAL PRESENTATION MUST.pptx
PPT
dsa Lec-1 Introduction FOR THE STUDENTS OF bscs
PPT
expt-design-lecture-12 hghhgfggjhjd (1).ppt
PDF
ahaaaa shbzjs yaiw jsvssv bdjsjss shsusus s
PPTX
DATA MODELING, data model concepts, types of data concepts
PPTX
Introduction to Fundamentals of Data Security
PPTX
Business_Capability_Map_Collection__pptx
PDF
technical specifications solar ear 2025.
PPTX
cp-and-safeguarding-training-2018-2019-mmfv2-230818062456-767bc1a7.pptx
PPTX
1 hour to get there before the game is done so you don’t need a car seat for ...
PPTX
inbound6529290805104538764.pptxmmmmmmmmm
PDF
CS3352FOUNDATION OF DATA SCIENCE _1_MAterial.pdf
PPTX
Statisticsccdxghbbnhhbvvvvvvvvvv. Dxcvvvhhbdzvbsdvvbbvv ccc
PPTX
ch20 Database System Architecture by Rizvee
PPTX
9 Bioterrorism.pptxnsbhsjdgdhdvkdbebrkndbd
PDF
REPORT CARD OF GRADE 2 2025-2026 MATATAG
PDF
2025-08 San Francisco FinOps Meetup: Tiering, Intelligently.
Chinku Sharma Internship in the summer internship project
PPT for Diseases (1)-2, types of diseases.pptx
Phase1_final PPTuwhefoegfohwfoiehfoegg.pptx
AI AND ML PROPOSAL PRESENTATION MUST.pptx
dsa Lec-1 Introduction FOR THE STUDENTS OF bscs
expt-design-lecture-12 hghhgfggjhjd (1).ppt
ahaaaa shbzjs yaiw jsvssv bdjsjss shsusus s
DATA MODELING, data model concepts, types of data concepts
Introduction to Fundamentals of Data Security
Business_Capability_Map_Collection__pptx
technical specifications solar ear 2025.
cp-and-safeguarding-training-2018-2019-mmfv2-230818062456-767bc1a7.pptx
1 hour to get there before the game is done so you don’t need a car seat for ...
inbound6529290805104538764.pptxmmmmmmmmm
CS3352FOUNDATION OF DATA SCIENCE _1_MAterial.pdf
Statisticsccdxghbbnhhbvvvvvvvvvv. Dxcvvvhhbdzvbsdvvbbvv ccc
ch20 Database System Architecture by Rizvee
9 Bioterrorism.pptxnsbhsjdgdhdvkdbebrkndbd
REPORT CARD OF GRADE 2 2025-2026 MATATAG
2025-08 San Francisco FinOps Meetup: Tiering, Intelligently.

Dataverse: Helping Researchers Publish Their Data Through Automation

  • 1. Dataverse: Helping Researchers Publish Their Data Through Automation Eleni Castro, Research Coordinator IQSS, Harvard University IDCC 2016 - Feb 24, 2016 @dataverseorg Dataverse.org
  • 2. Helping Researchers Share & Archive Data At Their Point of Need catalog.archives.gov/id/554290 2
  • 3. Our Quest For Interoperability and Automation ●  OAI-PMH for harvesting metadata from Dataverse ●  SWORD API: depositing metadata + data from a SWORD client into Dataverse ●  Search API: searching dataverses, datasets and files within Dataverse ●  Data Access API: downloading files from datasets found in Dataverse ●  Native API: for performing GUI and super-user functionality programmatically via REST In 2016: adding meta-tags and schema.org metadata for datasets More info at: http://guides.dataverse.org/en/latest/api/index.html 3
  • 4. Research Life Cycle Workflow Modelled off UCI Libraries diagram: http://previous.lib.uci.edu/dss/images/lifecycle.jpg 4
  • 6. Future Integration with DMPTool See: http://blog.dmptool.org/2016/01/22/dmptool-maintenance-and-a-roadmap 6
  • 8. OSF Dataverse Add-On to archive data via SWORD API See: https://osf.io/getting-started/#dataverse 8
  • 9. R package to deposit data & search Dataverse Thomas Leeper’s code: https://github.com/rOpenSci/dvn 9
  • 10. Data Visualizations from Dataverse... 10
  • 11. Data Visualizations from Dataverse via WorldMap 11http://worldmap.harvard.edu
  • 12. Data Visualizations and Analysis with ClioInfra https://www.clio-infra.eu/ via Data Access API + Native API 12
  • 14. Integrate Journal and Data Publishing Workflows Paper: http://journal.code4lib.org/articles/10989 14
  • 15. Future: Integrate data quality review + verification 15 http://ajps.org/2015/03/26/the-ajps-replication-policy-innovations-and-revisions/
  • 16. Future: Dataverse / ORCID Integration See: Requiring ORCID in Publication Workflows: Open Letter 16 1.  Allow users to authenticate using their ORCID ID. 2.  Automatically insert ORCID ID into Dataset and search ORCID ID to insert for co-authors. 3.  Add to and update ORCID records (Subject to permissions granted by iD holders).
  • 17. 4. Discovery & Impact Phase 17
  • 18. Expand Dataset Discovery via SHARE Notify http://www.share-research.org/projects/share-notify/ 18
  • 19. Send Dataset Metadata to DataCite Coming soon in Dataverse 19DataCite Metadata 3.0
  • 20. Future: Measure Dataset Impact with Altmetrics Example from Univ of Southampton Example from Univ of Zurich 20 See Repository Badges documentation: https://www.altmetric.com/products/free-tools/institutional-repository-badges/
  • 22. Scholars Portal Dataverse Integration With Archivematica Image source & read more: https://wiki.archivematica.org/Dataverse 22