Publication of raw and curated NMR spectroscopic data for organic molecules
Christoph Steinbeck
Publication of raw and curated
NMR spectroscopic data for
organic molecules
https://slideshare.net/csteinbeck
Publication of raw and curated NMR spectroscopic data for organic molecules
😀
Nuclear Magnetic Resonance (NMR)
in Synthetic Organic Chemistry
Source: https://www.beilstein-journals.org/bjoc/articles/14/188
Provides marginal evidence in experimental section
that the reported structure is what we say it is.
Nuclear Magnetic Resonance (NMR)
in Natural Products Chemistry
Hoang, V. D. et al., Phytochemistry 59, 325–329 (2002).
Chemical Name
Nuclear Magnetic Resonance (NMR)
in Natural Products Chemistry
Hoang, V. D. et al., Phytochemistry 59, 325–329 (2002).
Chemical Class
Chemical Name
Nuclear Magnetic Resonance (NMR)
in Natural Products Chemistry
Hoang, V. D. et al., Phytochemistry 59, 325–329 (2002).
Chemical Class
Chemical Name
Biol. Species
Nuclear Magnetic Resonance (NMR)
in Natural Products Chemistry
Hoang, V. D. et al., Phytochemistry 59, 325–329 (2002).
Chemical Class
Chemical Name
Biol. Species
Biol.Activity
Nuclear Magnetic Resonance (NMR)
in Natural Products Chemistry
Hoang, V. D. et al., Phytochemistry 59, 325–329 (2002).
Chemical Class
Chemical Name
Biol. Species
Biol.Activity
Phys.chem. data
Nuclear Magnetic Resonance (NMR)
in Natural Products Chemistry
Hoang, V. D. et al., Phytochemistry 59, 325–329 (2002).
Chemical Class
Chemical Name
Biol. Species
Biol.Activity
Structure Diagram
Phys.chem. data
Nuclear Magnetic Resonance (NMR)
in Natural Products Chemistry
Hoang, V. D. et al., Phytochemistry 59, 325–329 (2002).
Chemical Class
Chemical Name
Biol. Species
Biol.Activity
Structure Diagram
Phys.chem. data
Atom Numbers
Nuclear Magnetic Resonance (NMR)
in Natural Products Chemistry
Hoang, V. D. et al., Phytochemistry 59, 325–329 (2002).
Chemical Class
Chemical Name
Biol. Species
Biol.Activity
Structure Diagram
Phys.chem. data
Atom Numbers
Spectral data
Nuclear Magnetic Resonance (NMR)
in Natural Products Chemistry
Hoang, V. D. et al., Phytochemistry 59, 325–329 (2002).
Nuclear Magnetic Resonance (NMR)
in Natural Products Chemistry
Liu, F. et al., J. Nat. Prod., doi:10.1021/acs.jnatprod.7b01074
Image (!) from the supplemental information
of Nat. Prod., 2018, 81 (7), pp 1553–1560
Thanks to https://twitter.com/srp/status/1030192949802487809
Research Data Sharing is
becoming the norm …
•… rather than the exception
•but some disciplines are a little
more behind than others …
© https://matheplanet.com
Why do we archive, curate and
disseminate (raw) research data?
•Data re-use (including data use, reanalysis
and repurposing)
•Reproduction of scientific results
• For this we actually need to share the computational
workflow for processing the data as well (see http://
www.researchobject.org/)
•Validation of methods
Why people do not
share data
• Fear of not being able to generate enough
publications from their data
• Fear of being scooped by other researchers
(“Research Parasites”)
• Fear that one’s own study is not replicable
• Fear to expose badly managed, flawed or
inconsistent data
• Patient confidentiality
• Technical reasons Smith, R. & Roberts, I. F1000Research 5, 781 (2016).
Publication of raw and curated NMR spectroscopic data for organic molecules
Publication of raw and curated NMR spectroscopic data for organic molecules
What is NMReData?
What is NMReData? Machine-Readable
Representation
What is NMReData? Machine-Readable
Representation
based on industry
standard format
What is NMReData? Machine-Readable
Representation
of
chemical Structure
based on industry
standard format
What is NMReData? Machine-Readable
Representation
of
chemical Structure
and
assigned NMR Data
based on industry
standard format
What is NMReData? Machine-Readable
Representation
of
chemical Structure
and
assigned NMR Data
based on industry
standard format
linked to the
raw NMR Data
Why NMReData?
Why NMReData?
• Improved quality of the NMR data for the community
Why NMReData?
• Improved quality of the NMR data for the community
• Straightforward inclusion of NMR data in reports and articles
Why NMReData?
• Improved quality of the NMR data for the community
• Straightforward inclusion of NMR data in reports and articles
• Simplified referee work
Why NMReData?
• Improved quality of the NMR data for the community
• Straightforward inclusion of NMR data in reports and articles
• Simplified referee work
• Compatibility with electronic storage in databases
Why NMReData?
• Improved quality of the NMR data for the community
• Straightforward inclusion of NMR data in reports and articles
• Simplified referee work
• Compatibility with electronic storage in databases
• Easier comparison of dataset
Why NMReData?
• Improved quality of the NMR data for the community
• Straightforward inclusion of NMR data in reports and articles
• Simplified referee work
• Compatibility with electronic storage in databases
• Easier comparison of dataset
• Improved searchability of NMR data
Why NMReData?
• Improved quality of the NMR data for the community
• Straightforward inclusion of NMR data in reports and articles
• Simplified referee work
• Compatibility with electronic storage in databases
• Easier comparison of dataset
• Improved searchability of NMR data
• Automatic validation by computational means (e.g. CASE)
How is NMReData supported?
• Support by major NMR software companies
• Mag. Res. Chem. will require submission of NMReData
• Leads to raw NMR data becoming regularly available
• More journals needed - how about Journal of Natural Products?
• I/O in NMRShiftDB2 available
• Broad support by software ecosystem crucial for enduring success
• For updates see http://www.nmredata.org/wiki/
Compatible_software
McAlpine, J. B. et al., Nat. Prod. Rep. 33, 1028 (2018).
McAlpine, J. B. et al., Nat. Prod. Rep. 33, 1028 (2018).
Why do we need a raw
NMR Archive?
Publication of raw and curated NMR spectroscopic data for organic molecules
The Raw NMR Archive
(…to be built)
The Raw NMR Archive
(…to be built)
• There is momentum for building a raw NMR data archive
The Raw NMR Archive
(…to be built)
• There is momentum for building a raw NMR data archive
• Requirements:
The Raw NMR Archive
(…to be built)
• There is momentum for building a raw NMR data archive
• Requirements:
• Stable funding and operation (EBI, NCBI, ELIXIR node, etc)
The Raw NMR Archive
(…to be built)
• There is momentum for building a raw NMR data archive
• Requirements:
• Stable funding and operation (EBI, NCBI, ELIXIR node, etc)
• Community agreed Minimum Information Standard
The Raw NMR Archive
(…to be built)
• There is momentum for building a raw NMR data archive
• Requirements:
• Stable funding and operation (EBI, NCBI, ELIXIR node, etc)
• Community agreed Minimum Information Standard
• see https://fairsharing.org/ for examples, could be based on MIABE and MIBiG
The Raw NMR Archive
(…to be built)
• There is momentum for building a raw NMR data archive
• Requirements:
• Stable funding and operation (EBI, NCBI, ELIXIR node, etc)
• Community agreed Minimum Information Standard
• see https://fairsharing.org/ for examples, could be based on MIABE and MIBiG
• Framework for handling submissions of raw data and meta-data
The Raw NMR Archive
(…to be built)
• There is momentum for building a raw NMR data archive
• Requirements:
• Stable funding and operation (EBI, NCBI, ELIXIR node, etc)
• Community agreed Minimum Information Standard
• see https://fairsharing.org/ for examples, could be based on MIABE and MIBiG
• Framework for handling submissions of raw data and meta-data
The Raw NMR Archive
(…to be built)
• There is momentum for building a raw NMR data archive
• Requirements:
• Stable funding and operation (EBI, NCBI, ELIXIR node, etc)
• Community agreed Minimum Information Standard
• see https://fairsharing.org/ for examples, could be based on MIABE and MIBiG
• Framework for handling submissions of raw data and meta-data
• Ability to process, visualise and search 1D and 2D NMR data
The Raw NMR Archive
(…to be built)
• There is momentum for building a raw NMR data archive
• Requirements:
• Stable funding and operation (EBI, NCBI, ELIXIR node, etc)
• Community agreed Minimum Information Standard
• see https://fairsharing.org/ for examples, could be based on MIABE and MIBiG
• Framework for handling submissions of raw data and meta-data
• Ability to process, visualise and search 1D and 2D NMR data
• Total openness and support for the FAIR principle
The Archive Framework
Lives at https://www.ebi.ac.uk/metabolights
or https://www.metabolights.org
The Archive Framework
Lives at https://www.ebi.ac.uk/metabolights
or https://www.metabolights.org
I am not suggesting that this is it.
I am just saying:
Here is an open source framework
with a scent of chemistry
that does it all and it does it well
Publication of raw and curated NMR spectroscopic data for organic molecules
Publication of raw and curated NMR spectroscopic data for organic molecules
Publication of raw and curated NMR spectroscopic data for organic molecules
Publication of raw and curated NMR spectroscopic data for organic molecules
Publication of raw and curated NMR spectroscopic data for organic molecules
Sansone,… Steinbeck et al. (2012)
Toward interoperable bioscience data.
Nature Genetics, 44, 121–126.
Data Submission to MetaboLights
Sansone,… Steinbeck et al. (2012)
Toward interoperable bioscience data.
Nature Genetics, 44, 121–126.
ControlledVocabularies
Ontologies
Data Submission to MetaboLights
Sansone,… Steinbeck et al. (2012)
Toward interoperable bioscience data.
Nature Genetics, 44, 121–126.
ControlledVocabularies
Ontologies
Minimum Information Standards
Data Submission to MetaboLights
ISA-Creator
ISA-Creator
ControlledVocabularies
Ontologies
ISA-Creator
ControlledVocabularies
Ontologies
Publication of raw and curated NMR spectroscopic data for organic molecules
Takeaways
• Global momentum for sharing raw and curated NMR data
• Open raw meta-data and data formats exist
• Open frameworks for meta-data handling exist
• Technical, chemistry-enabled archive frameworks available
• Editors’ buy-in indispensable
• It could start right now (Figshare->DOI->Manuscript)
Stop clinging to your precious …
• We are still seeing a multitude of new, small
databases popping up
• All locked behind registration walls and web interfaces allowing only for accessing
individual datasets
My precious
… and share your chemistry data!

More Related Content

PPTX
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
PDF
Connecting the dots: drug information and Linked Data
PDF
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
PDF
ICIC 2017: Freeware and public databases: Towards a Wiki Drug Discovery?
PDF
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
PPTX
Leveraging publication metadata to help overcome the data ingest bottleneck
PDF
Canadian health census to lod
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
Connecting the dots: drug information and Linked Data
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
ICIC 2017: Freeware and public databases: Towards a Wiki Drug Discovery?
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
Leveraging publication metadata to help overcome the data ingest bottleneck
Canadian health census to lod

What's hot (20)

PPT
The importance of standards for data exchange and interchange on the Royal So...
PDF
An Open Repository Model for Acquiring Knowledge About Scientific Experiments
PDF
Acs collaborative computational technologies for biomedical research an enabl...
PPT
Hosting a compound centric community resource for chemistry data
PPTX
PRIDE-ProteomeXchange
PPTX
Being Reproducible: SSBSS Summer School 2017
PPTX
Mass spectrometry resources at the EBI
PDF
Research Shared: researchobject.org
PPTX
The Rhetoric of Research Objects
PPTX
schema.org and biomedical ontologies
PDF
The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata ...
PPTX
ACS 248th Paper 71 ChAMP Project
PPTX
Reproducibility, Research Objects and Reality, Leiden 2016
PDF
Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ...
PPTX
Reproducibility (and the R*) of Science: motivations, challenges and trends
PPTX
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
PPTX
Software Sustainability: Better Software Better Science
PPTX
Facilitating semantic alignment.-biohackathon-jupp
PPTX
Mtsr2015 goble-keynote
PPTX
Reuse of public proteomics data
The importance of standards for data exchange and interchange on the Royal So...
An Open Repository Model for Acquiring Knowledge About Scientific Experiments
Acs collaborative computational technologies for biomedical research an enabl...
Hosting a compound centric community resource for chemistry data
PRIDE-ProteomeXchange
Being Reproducible: SSBSS Summer School 2017
Mass spectrometry resources at the EBI
Research Shared: researchobject.org
The Rhetoric of Research Objects
schema.org and biomedical ontologies
The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata ...
ACS 248th Paper 71 ChAMP Project
Reproducibility, Research Objects and Reality, Leiden 2016
Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ...
Reproducibility (and the R*) of Science: motivations, challenges and trends
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
Software Sustainability: Better Software Better Science
Facilitating semantic alignment.-biohackathon-jupp
Mtsr2015 goble-keynote
Reuse of public proteomics data
Ad

Similar to Publication of raw and curated NMR spectroscopic data for organic molecules (20)

PPTX
Cshl minseqe 2013_ouellette
PPTX
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
PPTX
ISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, Japan
PDF
iMicrobe_ASLO_2015
PPTX
Aug2013 NIST program slides
PPT
Activities at the Royal Society of Chemistry to gather, extract and analyze b...
PPT
The UK National Chemical Database Service – an integration of commercial and ...
PPTX
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
PPTX
Serving the medicinal chemistry community with Royal Society of Chemistry che...
PPTX
150219 agbt giab_poster_marc
PPT
Royal society of chemistry activities to develop a data repository for chemis...
PPT
Royal society of chemistry activities to develop a data repository for chemis...
PPT
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
PPTX
Major databases in bioinformatics
PDF
Making project data avalialble eNanomapper through Database
PPT
eScience Resources for the Chemistry Community from the Royal Society of Chem...
PPTX
Microarrays Databases.pptx
PPT
The expansive reach of ChemSpider as a resource for the chemistry community
PPSX
SRA-System (7).ppsx
PPT
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Cshl minseqe 2013_ouellette
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
ISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, Japan
iMicrobe_ASLO_2015
Aug2013 NIST program slides
Activities at the Royal Society of Chemistry to gather, extract and analyze b...
The UK National Chemical Database Service – an integration of commercial and ...
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Serving the medicinal chemistry community with Royal Society of Chemistry che...
150219 agbt giab_poster_marc
Royal society of chemistry activities to develop a data repository for chemis...
Royal society of chemistry activities to develop a data repository for chemis...
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
Major databases in bioinformatics
Making project data avalialble eNanomapper through Database
eScience Resources for the Chemistry Community from the Royal Society of Chem...
Microarrays Databases.pptx
The expansive reach of ChemSpider as a resource for the chemistry community
SRA-System (7).ppsx
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Ad

More from Christoph Steinbeck (17)

PDF
Molecular Informatics and FAIR Data Management in Natural Products Research
PPTX
The COCONUT Natural Products Database, Talk at ICCS 2025
PDF
AI in Chemistry: Deep Learning Models Love Really Big Data
PDF
Developments in Metabolomics leading to PhenoMeNal
PDF
Computer-Assisted Structure Elucidation (CloudMet 2017)
PDF
Building a Model Organism Metabolome Database
PDF
PhenoMeNal: Large scale computing with medical metabolic phenotyping data
PDF
Developing an Efficient Infrastruture, Standards and Data-Flow for Metabolomics
PDF
Building an efficient infrastructure, standards and data flow for metabolomics
PDF
World-wide data exchange in metabolomics, Wageningen, October 2016
PDF
Skolnik symposium ACS Meeting Philadelphia 2016
PDF
Multi-Omics Bioinformatics across Application Domains
PDF
The time is right to focus on a model organism database
PDF
PhenoMeNal presentation at STFC-ELIXIR Meeting Hinxon
PDF
16 years of the Chemistry Development Kit (CDK)
PDF
Large Scale computing with medical metabolic phenotyping data
PDF
Sharing data from clinical and medical research
Molecular Informatics and FAIR Data Management in Natural Products Research
The COCONUT Natural Products Database, Talk at ICCS 2025
AI in Chemistry: Deep Learning Models Love Really Big Data
Developments in Metabolomics leading to PhenoMeNal
Computer-Assisted Structure Elucidation (CloudMet 2017)
Building a Model Organism Metabolome Database
PhenoMeNal: Large scale computing with medical metabolic phenotyping data
Developing an Efficient Infrastruture, Standards and Data-Flow for Metabolomics
Building an efficient infrastructure, standards and data flow for metabolomics
World-wide data exchange in metabolomics, Wageningen, October 2016
Skolnik symposium ACS Meeting Philadelphia 2016
Multi-Omics Bioinformatics across Application Domains
The time is right to focus on a model organism database
PhenoMeNal presentation at STFC-ELIXIR Meeting Hinxon
16 years of the Chemistry Development Kit (CDK)
Large Scale computing with medical metabolic phenotyping data
Sharing data from clinical and medical research

Recently uploaded (20)

PDF
software engineering for computer science
PDF
chemical-kinetics-Basics for Btech .pdf
PPT
ZooLec Chapter 13 (Digestive System).ppt
PPTX
Earth-and-Life-Pieces-of-Evidence-Q2.pptx
PDF
Pentose Phosphate Pathway by Rishikanta Usham, Dhanamanjuri University
PDF
Microplastics: Environmental Impact and Remediation Strategies
PPTX
INTRODUCTION TO CELL STRUCTURE_LESSON.pptx
PDF
Sujay Rao Mandavilli Degrowth delusion FINAL FINAL FINAL FINAL FINAL.pdf
PDF
Physics of Bitcoin #30 Perrenod Santostasi.pdf
PDF
Unit Four Lesson in Carbohydrates chemistry
PDF
Thyroid Hormone by Iqra Nasir detail.pdf
PDF
SOCIAL PSYCHOLOGY_ CHAPTER 2.pdf- the self in a social world
PDF
SOCIAL PSYCHOLOGY chapter 1-what is social psychology and its definition
PPT
Chapter 52 introductory biology course Camp
PPT
INSTRUMENTAL ANALYSIS (Electrochemical processes )-1.ppt
PDF
Pharmacokinetics Lecture_Study Material.pdf
PDF
Glycolysis by Rishikanta Usham, Dhanamanjuri University
PPTX
Cutaneous tuberculosis Dermatology
PDF
No dilute core produced in simulations of giant impacts on to Jupiter
PDF
Human Anatomy (Anatomy and Physiology A)
software engineering for computer science
chemical-kinetics-Basics for Btech .pdf
ZooLec Chapter 13 (Digestive System).ppt
Earth-and-Life-Pieces-of-Evidence-Q2.pptx
Pentose Phosphate Pathway by Rishikanta Usham, Dhanamanjuri University
Microplastics: Environmental Impact and Remediation Strategies
INTRODUCTION TO CELL STRUCTURE_LESSON.pptx
Sujay Rao Mandavilli Degrowth delusion FINAL FINAL FINAL FINAL FINAL.pdf
Physics of Bitcoin #30 Perrenod Santostasi.pdf
Unit Four Lesson in Carbohydrates chemistry
Thyroid Hormone by Iqra Nasir detail.pdf
SOCIAL PSYCHOLOGY_ CHAPTER 2.pdf- the self in a social world
SOCIAL PSYCHOLOGY chapter 1-what is social psychology and its definition
Chapter 52 introductory biology course Camp
INSTRUMENTAL ANALYSIS (Electrochemical processes )-1.ppt
Pharmacokinetics Lecture_Study Material.pdf
Glycolysis by Rishikanta Usham, Dhanamanjuri University
Cutaneous tuberculosis Dermatology
No dilute core produced in simulations of giant impacts on to Jupiter
Human Anatomy (Anatomy and Physiology A)

Publication of raw and curated NMR spectroscopic data for organic molecules

  • 2. Christoph Steinbeck Publication of raw and curated NMR spectroscopic data for organic molecules https://slideshare.net/csteinbeck
  • 5. Nuclear Magnetic Resonance (NMR) in Synthetic Organic Chemistry Source: https://www.beilstein-journals.org/bjoc/articles/14/188 Provides marginal evidence in experimental section that the reported structure is what we say it is.
  • 6. Nuclear Magnetic Resonance (NMR) in Natural Products Chemistry Hoang, V. D. et al., Phytochemistry 59, 325–329 (2002).
  • 7. Chemical Name Nuclear Magnetic Resonance (NMR) in Natural Products Chemistry Hoang, V. D. et al., Phytochemistry 59, 325–329 (2002).
  • 8. Chemical Class Chemical Name Nuclear Magnetic Resonance (NMR) in Natural Products Chemistry Hoang, V. D. et al., Phytochemistry 59, 325–329 (2002).
  • 9. Chemical Class Chemical Name Biol. Species Nuclear Magnetic Resonance (NMR) in Natural Products Chemistry Hoang, V. D. et al., Phytochemistry 59, 325–329 (2002).
  • 10. Chemical Class Chemical Name Biol. Species Biol.Activity Nuclear Magnetic Resonance (NMR) in Natural Products Chemistry Hoang, V. D. et al., Phytochemistry 59, 325–329 (2002).
  • 11. Chemical Class Chemical Name Biol. Species Biol.Activity Phys.chem. data Nuclear Magnetic Resonance (NMR) in Natural Products Chemistry Hoang, V. D. et al., Phytochemistry 59, 325–329 (2002).
  • 12. Chemical Class Chemical Name Biol. Species Biol.Activity Structure Diagram Phys.chem. data Nuclear Magnetic Resonance (NMR) in Natural Products Chemistry Hoang, V. D. et al., Phytochemistry 59, 325–329 (2002).
  • 13. Chemical Class Chemical Name Biol. Species Biol.Activity Structure Diagram Phys.chem. data Atom Numbers Nuclear Magnetic Resonance (NMR) in Natural Products Chemistry Hoang, V. D. et al., Phytochemistry 59, 325–329 (2002).
  • 14. Chemical Class Chemical Name Biol. Species Biol.Activity Structure Diagram Phys.chem. data Atom Numbers Spectral data Nuclear Magnetic Resonance (NMR) in Natural Products Chemistry Hoang, V. D. et al., Phytochemistry 59, 325–329 (2002).
  • 15. Nuclear Magnetic Resonance (NMR) in Natural Products Chemistry Liu, F. et al., J. Nat. Prod., doi:10.1021/acs.jnatprod.7b01074 Image (!) from the supplemental information of Nat. Prod., 2018, 81 (7), pp 1553–1560
  • 17. Research Data Sharing is becoming the norm … •… rather than the exception •but some disciplines are a little more behind than others … © https://matheplanet.com
  • 18. Why do we archive, curate and disseminate (raw) research data? •Data re-use (including data use, reanalysis and repurposing) •Reproduction of scientific results • For this we actually need to share the computational workflow for processing the data as well (see http:// www.researchobject.org/) •Validation of methods
  • 19. Why people do not share data • Fear of not being able to generate enough publications from their data • Fear of being scooped by other researchers (“Research Parasites”) • Fear that one’s own study is not replicable • Fear to expose badly managed, flawed or inconsistent data • Patient confidentiality • Technical reasons Smith, R. & Roberts, I. F1000Research 5, 781 (2016).
  • 23. What is NMReData? Machine-Readable Representation
  • 24. What is NMReData? Machine-Readable Representation based on industry standard format
  • 25. What is NMReData? Machine-Readable Representation of chemical Structure based on industry standard format
  • 26. What is NMReData? Machine-Readable Representation of chemical Structure and assigned NMR Data based on industry standard format
  • 27. What is NMReData? Machine-Readable Representation of chemical Structure and assigned NMR Data based on industry standard format linked to the raw NMR Data
  • 29. Why NMReData? • Improved quality of the NMR data for the community
  • 30. Why NMReData? • Improved quality of the NMR data for the community • Straightforward inclusion of NMR data in reports and articles
  • 31. Why NMReData? • Improved quality of the NMR data for the community • Straightforward inclusion of NMR data in reports and articles • Simplified referee work
  • 32. Why NMReData? • Improved quality of the NMR data for the community • Straightforward inclusion of NMR data in reports and articles • Simplified referee work • Compatibility with electronic storage in databases
  • 33. Why NMReData? • Improved quality of the NMR data for the community • Straightforward inclusion of NMR data in reports and articles • Simplified referee work • Compatibility with electronic storage in databases • Easier comparison of dataset
  • 34. Why NMReData? • Improved quality of the NMR data for the community • Straightforward inclusion of NMR data in reports and articles • Simplified referee work • Compatibility with electronic storage in databases • Easier comparison of dataset • Improved searchability of NMR data
  • 35. Why NMReData? • Improved quality of the NMR data for the community • Straightforward inclusion of NMR data in reports and articles • Simplified referee work • Compatibility with electronic storage in databases • Easier comparison of dataset • Improved searchability of NMR data • Automatic validation by computational means (e.g. CASE)
  • 36. How is NMReData supported? • Support by major NMR software companies • Mag. Res. Chem. will require submission of NMReData • Leads to raw NMR data becoming regularly available • More journals needed - how about Journal of Natural Products? • I/O in NMRShiftDB2 available • Broad support by software ecosystem crucial for enduring success • For updates see http://www.nmredata.org/wiki/ Compatible_software
  • 37. McAlpine, J. B. et al., Nat. Prod. Rep. 33, 1028 (2018).
  • 38. McAlpine, J. B. et al., Nat. Prod. Rep. 33, 1028 (2018).
  • 39. Why do we need a raw NMR Archive?
  • 41. The Raw NMR Archive (…to be built)
  • 42. The Raw NMR Archive (…to be built) • There is momentum for building a raw NMR data archive
  • 43. The Raw NMR Archive (…to be built) • There is momentum for building a raw NMR data archive • Requirements:
  • 44. The Raw NMR Archive (…to be built) • There is momentum for building a raw NMR data archive • Requirements: • Stable funding and operation (EBI, NCBI, ELIXIR node, etc)
  • 45. The Raw NMR Archive (…to be built) • There is momentum for building a raw NMR data archive • Requirements: • Stable funding and operation (EBI, NCBI, ELIXIR node, etc) • Community agreed Minimum Information Standard
  • 46. The Raw NMR Archive (…to be built) • There is momentum for building a raw NMR data archive • Requirements: • Stable funding and operation (EBI, NCBI, ELIXIR node, etc) • Community agreed Minimum Information Standard • see https://fairsharing.org/ for examples, could be based on MIABE and MIBiG
  • 47. The Raw NMR Archive (…to be built) • There is momentum for building a raw NMR data archive • Requirements: • Stable funding and operation (EBI, NCBI, ELIXIR node, etc) • Community agreed Minimum Information Standard • see https://fairsharing.org/ for examples, could be based on MIABE and MIBiG • Framework for handling submissions of raw data and meta-data
  • 48. The Raw NMR Archive (…to be built) • There is momentum for building a raw NMR data archive • Requirements: • Stable funding and operation (EBI, NCBI, ELIXIR node, etc) • Community agreed Minimum Information Standard • see https://fairsharing.org/ for examples, could be based on MIABE and MIBiG • Framework for handling submissions of raw data and meta-data
  • 49. The Raw NMR Archive (…to be built) • There is momentum for building a raw NMR data archive • Requirements: • Stable funding and operation (EBI, NCBI, ELIXIR node, etc) • Community agreed Minimum Information Standard • see https://fairsharing.org/ for examples, could be based on MIABE and MIBiG • Framework for handling submissions of raw data and meta-data • Ability to process, visualise and search 1D and 2D NMR data
  • 50. The Raw NMR Archive (…to be built) • There is momentum for building a raw NMR data archive • Requirements: • Stable funding and operation (EBI, NCBI, ELIXIR node, etc) • Community agreed Minimum Information Standard • see https://fairsharing.org/ for examples, could be based on MIABE and MIBiG • Framework for handling submissions of raw data and meta-data • Ability to process, visualise and search 1D and 2D NMR data • Total openness and support for the FAIR principle
  • 51. The Archive Framework Lives at https://www.ebi.ac.uk/metabolights or https://www.metabolights.org
  • 52. The Archive Framework Lives at https://www.ebi.ac.uk/metabolights or https://www.metabolights.org I am not suggesting that this is it. I am just saying: Here is an open source framework with a scent of chemistry that does it all and it does it well
  • 58. Sansone,… Steinbeck et al. (2012) Toward interoperable bioscience data. Nature Genetics, 44, 121–126. Data Submission to MetaboLights
  • 59. Sansone,… Steinbeck et al. (2012) Toward interoperable bioscience data. Nature Genetics, 44, 121–126. ControlledVocabularies Ontologies Data Submission to MetaboLights
  • 60. Sansone,… Steinbeck et al. (2012) Toward interoperable bioscience data. Nature Genetics, 44, 121–126. ControlledVocabularies Ontologies Minimum Information Standards Data Submission to MetaboLights
  • 65. Takeaways • Global momentum for sharing raw and curated NMR data • Open raw meta-data and data formats exist • Open frameworks for meta-data handling exist • Technical, chemistry-enabled archive frameworks available • Editors’ buy-in indispensable • It could start right now (Figshare->DOI->Manuscript)
  • 66. Stop clinging to your precious … • We are still seeing a multitude of new, small databases popping up • All locked behind registration walls and web interfaces allowing only for accessing individual datasets My precious … and share your chemistry data!