Melanie Imming
EU Projects manager, LIBER
Open Science, Open Data:
towards a new transparent and
reproducible ecosystem
LIBER Europe
 Research Libraries
 Founded in 1971
 More than 400
national, university and other libraries from over 40
countries
LIBER Conference
LIBER EU Projects
LIBER Europe
“LIBER is Re-inventing
the Library
for the Future”
LIBER Europe
A central part of LIBER’s mission is to provide an
information infrastructure that enables research in
LIBER institutions to be world class.
For this infrastructure to thrive, it must be part
of an ecosystem that can accommodate and
nurture the changing nature of research and
innovation in the digital age.
Open Science
Open Science Definition
“The conduction of science in a way that
others can collaborate and contribute,
where research data, lab notes and other
research processes are freely available,
with terms that allow reuse, redistribution
and reproduction of the research”
https://www.fosteropenscience.eu/foster-taxonomy/open-science-definition
CC BY Openaccessbutton.org
LIBER and Open Science
So, we need alternative mechanisms for the recognition of
excellence in Open Science, e.g. ranking systems, to
Open up Science.
•From Publish or Perish to Open Science
•Scientific tools used need to be cited, and, in order to
make scientific experiments recreatable, there need to be
incentives to create open and sustainable software
Science Code Manifesto
Science Code Manifesto
•Code
source code written specifically to process data for a
published paper must be available to the reviewers and
readers of the paper.
Science Code Manifesto
•Code
•Copyright
The copyright ownership and license of any released
source code must be clearly stated.
Science Code Manifesto
•Code
•Copyright
•Citation
Researchers who use or adapt science source code in their
research must credit the code’s creators in resulting
publications
Science Code Manifesto
•Code
•Copyright
•Citation
•Credit
Software contributions must be included in systems of
scientific assessment, credit, and recognition.
Science Code Manifesto
•Code
•Copyright
•Citation
•Credit
•Curation
Source code must remain available, linked to related
materials, for the useful lifetime of the publication.
LIBER and Open Science
Curation
‘Making sure data is stored in a controlled
way and can be (re) used today and in the
future is an important element in Open
Science’.
LIBER and Open Science
Standardisation of file formats will ensure (re-)
usability today and in the future, as it:
• enables processing and preservation of data in a
controlled way
• ensures outputs that are really open and
accessible in the long term
• improves interoperability of new tools and services
Workshop Nov 2015:
“Text and Data Mining in Europe:
Challenges and Action”
Participants: content providers (publishers,
data centers, museums and libraries)
Technical challenges identified:
•Quality of datasets
•Lack of a secure infrastructure
Solutions:
•Develop and use open standards
•Develop templates for metadata and content
•Allow for peer review of data quality, develop
validation tools,
•Appraise good quality data
•Organisations should invest resources to
improve the quality of their data
Workshop Nov 2015:
“Text and Data Mining in Europe:
Challenges and Action”
Workshop Feb 2016:
European Open Science Cloud
Opening paragraph of The European Open Science Cloud for Research Rome
Workshop Report:
‘The creation of a trusted environment for hosting and
processing research data (..) will help overcome many key
challenges currently facing scientific disciplines. These
challenges include a huge lack of awareness of the value of
data and the incentives for data sharing, a continued lack of
and urgent need for common standards to ensure
interoperability of data…’
Netherlands EU Presidency Open Science Conference
Amsterdam, 4/5 April 2016
Libraries enabling Open Science
Data Issue Libraries and data centres opportunities
Availability - Lower barriers to researchers to make their data available
- Integrate data sets into retrieval services
Findability - Support of persistent identifiers
- Engage in developing common meta description schemas
and common citation practices
- Promote use of common standards and tools among
researchers
Interpretability - Support crosslinks between publications and datasets
- Provide and help researchers understand meta-descriptions
of datasets
- Establish and maintain a knowledge base about data and
their context
.
Libraries enabling Open Science
Data Issue Libraries and data centres opportunities
Re-usability -Curate and preserve datasets
-Archive software needed for re-analysis of data
-Be transparent about conditions under which data sets can be
re-used (expert knowledge needed, software needed)
Citability -Engage in establishing uniform data citation standards
-Support and promote persistent identifiers
Curation/ -Transparency about curation of submitted data
Preservation -Promote good data management practice
-Collaborate with data creators
-Instruct researchers on discipline specific best practices in
data creation (preservation formats, documentation of
experiment,…)
Libraries enabling Open Science
Focus on Research Data Management:
• Growing variety of data types and volume
• Curation of data from the planning stage of research
projects
Libraries enabling Open Science
Awareness, trust and community building
•Institutions - develop policies and roadmaps
•Researchers - highlight benefits of open science
•(Other) Stakeholders at institutional level and
internationally
Libraries enabling Open Science
•Stay in control!
•Unite!
•Be active in projects like Preforma
•Advocate & Engage
Open Science
What can you do?
•Release data under CC0
•media components and arrangements of data
under CC BY
•Work from what is already working
•Use what is really open: freely available, can be
freely adopted, implemented and extended (no
license fees)
•Sign The Hague Declaration!
Elsevier TDM Policy
• Access through API only
• Text only- no images, tables
• Research must register details
• Click-through licence
• Terms can change any time
• Reproducibility of results
Thank you!
• The Hague Declaration: http://thehaguedeclaration.com
• LERU Roadmap for Research Data
http://www.leru.org/index.php/public/news/press-release-leru-
roadmap-for-research-data
• http://sciencecodemanifesto.org
• Research Data Alliance https://rd-alliance.org
• LIBER 10 Recommendations on Getting Started in RDM
http://libereurope.eu/wp-content/uploads/The%20research%20data
%20group%202012%20v7%20final.pdf
• OpenAire https://www.openaire.eu
• San Francisco Declaration
• http://www.ascb.org/dora-old/files/SFDeclarationFINAL.pdf

Open Science, Open Data: towards a new transparent and reproducible ecosystem

  • 1.
    Melanie Imming EU Projectsmanager, LIBER Open Science, Open Data: towards a new transparent and reproducible ecosystem
  • 2.
    LIBER Europe  ResearchLibraries  Founded in 1971  More than 400 national, university and other libraries from over 40 countries
  • 3.
  • 4.
  • 5.
  • 6.
    “LIBER is Re-inventing theLibrary for the Future”
  • 7.
    LIBER Europe A centralpart of LIBER’s mission is to provide an information infrastructure that enables research in LIBER institutions to be world class. For this infrastructure to thrive, it must be part of an ecosystem that can accommodate and nurture the changing nature of research and innovation in the digital age.
  • 8.
  • 9.
    Open Science Definition “Theconduction of science in a way that others can collaborate and contribute, where research data, lab notes and other research processes are freely available, with terms that allow reuse, redistribution and reproduction of the research” https://www.fosteropenscience.eu/foster-taxonomy/open-science-definition
  • 10.
  • 11.
    LIBER and OpenScience So, we need alternative mechanisms for the recognition of excellence in Open Science, e.g. ranking systems, to Open up Science. •From Publish or Perish to Open Science •Scientific tools used need to be cited, and, in order to make scientific experiments recreatable, there need to be incentives to create open and sustainable software
  • 12.
  • 13.
    Science Code Manifesto •Code sourcecode written specifically to process data for a published paper must be available to the reviewers and readers of the paper.
  • 14.
    Science Code Manifesto •Code •Copyright Thecopyright ownership and license of any released source code must be clearly stated.
  • 15.
    Science Code Manifesto •Code •Copyright •Citation Researcherswho use or adapt science source code in their research must credit the code’s creators in resulting publications
  • 16.
    Science Code Manifesto •Code •Copyright •Citation •Credit Softwarecontributions must be included in systems of scientific assessment, credit, and recognition.
  • 17.
    Science Code Manifesto •Code •Copyright •Citation •Credit •Curation Sourcecode must remain available, linked to related materials, for the useful lifetime of the publication.
  • 18.
    LIBER and OpenScience Curation ‘Making sure data is stored in a controlled way and can be (re) used today and in the future is an important element in Open Science’.
  • 19.
    LIBER and OpenScience Standardisation of file formats will ensure (re-) usability today and in the future, as it: • enables processing and preservation of data in a controlled way • ensures outputs that are really open and accessible in the long term • improves interoperability of new tools and services
  • 20.
    Workshop Nov 2015: “Textand Data Mining in Europe: Challenges and Action” Participants: content providers (publishers, data centers, museums and libraries) Technical challenges identified: •Quality of datasets •Lack of a secure infrastructure
  • 21.
    Solutions: •Develop and useopen standards •Develop templates for metadata and content •Allow for peer review of data quality, develop validation tools, •Appraise good quality data •Organisations should invest resources to improve the quality of their data Workshop Nov 2015: “Text and Data Mining in Europe: Challenges and Action”
  • 22.
    Workshop Feb 2016: EuropeanOpen Science Cloud Opening paragraph of The European Open Science Cloud for Research Rome Workshop Report: ‘The creation of a trusted environment for hosting and processing research data (..) will help overcome many key challenges currently facing scientific disciplines. These challenges include a huge lack of awareness of the value of data and the incentives for data sharing, a continued lack of and urgent need for common standards to ensure interoperability of data…’
  • 23.
    Netherlands EU PresidencyOpen Science Conference Amsterdam, 4/5 April 2016
  • 24.
    Libraries enabling OpenScience Data Issue Libraries and data centres opportunities Availability - Lower barriers to researchers to make their data available - Integrate data sets into retrieval services Findability - Support of persistent identifiers - Engage in developing common meta description schemas and common citation practices - Promote use of common standards and tools among researchers Interpretability - Support crosslinks between publications and datasets - Provide and help researchers understand meta-descriptions of datasets - Establish and maintain a knowledge base about data and their context .
  • 25.
    Libraries enabling OpenScience Data Issue Libraries and data centres opportunities Re-usability -Curate and preserve datasets -Archive software needed for re-analysis of data -Be transparent about conditions under which data sets can be re-used (expert knowledge needed, software needed) Citability -Engage in establishing uniform data citation standards -Support and promote persistent identifiers Curation/ -Transparency about curation of submitted data Preservation -Promote good data management practice -Collaborate with data creators -Instruct researchers on discipline specific best practices in data creation (preservation formats, documentation of experiment,…)
  • 26.
    Libraries enabling OpenScience Focus on Research Data Management: • Growing variety of data types and volume • Curation of data from the planning stage of research projects
  • 27.
    Libraries enabling OpenScience Awareness, trust and community building •Institutions - develop policies and roadmaps •Researchers - highlight benefits of open science •(Other) Stakeholders at institutional level and internationally
  • 28.
    Libraries enabling OpenScience •Stay in control! •Unite! •Be active in projects like Preforma •Advocate & Engage
  • 29.
    Open Science What canyou do? •Release data under CC0 •media components and arrangements of data under CC BY •Work from what is already working •Use what is really open: freely available, can be freely adopted, implemented and extended (no license fees) •Sign The Hague Declaration!
  • 30.
    Elsevier TDM Policy •Access through API only • Text only- no images, tables • Research must register details • Click-through licence • Terms can change any time • Reproducibility of results
  • 31.
    Thank you! • TheHague Declaration: http://thehaguedeclaration.com • LERU Roadmap for Research Data http://www.leru.org/index.php/public/news/press-release-leru- roadmap-for-research-data • http://sciencecodemanifesto.org • Research Data Alliance https://rd-alliance.org • LIBER 10 Recommendations on Getting Started in RDM http://libereurope.eu/wp-content/uploads/The%20research%20data %20group%202012%20v7%20final.pdf • OpenAire https://www.openaire.eu • San Francisco Declaration • http://www.ascb.org/dora-old/files/SFDeclarationFINAL.pdf

Editor's Notes

  • #3 First a few words about LIBER LIBER is the Association of European Research Libraries, the main network for research libraries in Europe. Founded in 1971, the association now includes over 400 national, university and other special libraries.
  • #4 We bring our members together through our annual conference. This year in Helsinki, Next year in Patras
  • #5 And we bring our members together through participation in EU Projects, together with about 50 of our members. The LIBER office is involved in nine EU projects, all to do with addressing barriers on the path towards Open Science. We have projects on Text and Data mining, Open Access training, Open Access Policies, Research Data Management etcetera
  • #6 Steering committees, working groups, three strategic directions What is happening in Research Libraries nowadays is not « daily routine ». It’s about a revolution. We have to re-invent the library for the future!
  • #7 As a library membership organization, LIBER works on addressing Open Science barriers.
  • #9 What is the Open Science ecosystem? First of all: Open Science is more than just open access to publications. It also means open access to data, API’s, licenses, policies etcetera
  • #10 It should be an ecosystem of sharing and collaboration, of re use and redistribution. Open Science is difficult to define, but here is a definition from the FOSTER project: a project on open science training. Shared as early as possible Open unless…
  • #11 In this eco system, the researcher is key. Researchers should contribute and benefit at the same time. Incentives are needed for researchers to share and (re) use their data. In the end Open Science will change work habits and business models.
  • #12 What we need is institutional recognition of alternative metrics. The publishing of software should rewarded, not the publication of the article describing the software research project. Why would researchers put effort in, for instance, to focus on an element of Open Science connected to the workshop of today; sustainable and open software if this is not rewarded at all?
  • #13 Not new; an example: Out for a few years now, 1140 endorsements
  • #19 This Science Code Manifesto speaks about Curation of open source software, but curation is needed in every area of the open science eco system
  • #21 A few examples of my own: some Open Science events we co-organised as LIBER where standardisation of file formats was mentioned as part of OS: In November, I co-hosted a Workshop aimed at content providers: publishers, data centers, museums and libraries, that are open to the use of Tekst and Data Mining on their data.
  • #23 Another workshop co-organised by LIBER: The European Open Science Cloud workshop. The workshop report begins with this statement:
  • #24 And sligthly different, but just as interting for this crowd: just this week, we co-organised four Open science cafes where we discussed all kinds of aspects of open science in an open discussion: one of the discussion statements was: “libraries should spend money on preserving software in order to keep data available for re-use”
  • #25 So, how do we see libraries in this? How can libraries help to enable Open Science? Open Data: There are lots of opportunities to address issues with open data Libraries have always played an important role in the curation of data
  • #26 And, of course, all of this should be as open as possible.
  • #27 Next to projects like Preforma, libraries can be active in Research Data Management Slide 26: no mention of Digital Preservation yet: part of research data management from the start of projects on
  • #28 We see not only technical challenges on the road towards open science, but also social challenges: awareness, trust and community building is needed in order to ensure uptake of eg. standards.
  • #30 People say that we need registries of open source software, e.g. for TDM, but what about Github? Work from what is already working Focus on really Open standards Definition in the line of Open Science: freely available, can be freely adopted, implemented and extended. ( no license fees)
  • #31 As a library membership organization, LIBER brought global experts together to draft a collective statement, The Hague Declaration on knowledge discovery in the digital age, to show policy makers the strength of support in the research community for better access to facts, data and ideas. The Hague Declaration has over 700 signatories - and counting.