SlideShare a Scribd company logo
Alejandra González-Beltrán, Ph.D
University of Oxford e-Research Centre, UK
From experimental planning to data publication:
the ISA infrastructure
and case studies in toxicology
alejandra.gonzalezbeltran@oerc.ox.ac.uk
OpenTox Europe - Mainz, Germany - 30th September, 2013
1
2
The data workflow
Data
Scientist
Visualization
Analysis
Planning
Data
Management
Data CollectionPublication
Use existing
data
Perform new
experiment
3
The data workflow
Data
Scientist
Visualization
Analysis
Planning
Data
Management
Data CollectionPublication
Use existing
data
Perform new
experiment
metadata
metadata
metadata
metadata
metadata
metadata
metadata tracking
infrastructure
4
Data
Scientist
Visualization
Analysis
Planning
Data
Management
Data CollectionPublication
Use existing
data
Perform new
experiment
metadata
metadata
metadata
metadata
metadata
metadata
Traceability
Assessment
Accountability
Evidence
Reusability
Reproducibility
Storage
Mining
Provenance
5
sem
antics
structure
6
sem
antics
structure
investigation
study
assay
7
8
infrastructureThe
generic format for experimental
description and data exchange
open source software toolscommunity engagement
OpenTox Europe 2013
OpenTox Europe 2013
11
Run Assays4
SAMPLE1
SAMPLE2
SAMPLE3
SAMPLE4
SAMPLE5
SAMPLE6
SAMPLE7
SAMPLE8
SAMPLE9
SAMPLE10
SAMPLE11
SAMPLE 1
SAMPLE 2
SAMPLE 3
SAMPLE 4
SAMPLE 5
SAMPLE 6
SAMPLE 7
SAMPLE 8
SAMPLE 9
SAMPLE 10
SAMPLE 11
FILE 1
FILE 2
FILE 3
FILE 4
FILE 5
FILE 6
FILE 7
FILE 8
FIL
FIL
FIL
Experiment Design Analysis
Arabidopsis thaliana
Treatment groups
70% 90% 100%
Collect Samples1 2 3 5
6
Parses ISA-Tab datasets into R objects, allowing to update them and save them after
analysis.
Bridges the ISA-Tab metadata to analysis pipelines of specific assay types, by building
objects for use in other R packages downstream: currently considering mass
spectrometry (xmcs package, xcmsSet) and DNA microarray (Biobase package,
ExpressionSet)
Suggests packages in BioConductor that might be relevant for an assay type, according
to the BioCViews annotations.
Gonzalez-Beltran et al. The Risa R/Bioconductor package:
integrative data analysis from experimental metadata and
back again. In press
OpenTox Europe 2013
OpenTox Europe 2013
OpenTox Europe 2013
Data Publication with
• New open-access, online-only publication for
descriptions of scientifically valuable datasets
• Only content type: Data Descriptor, narrative
+ structured parts
• Initially focused on the life, environmental and
biomedical sciences
• Data Descriptor will be complementary to
traditional research journals and data
repositories
• Designed to foster data sharing and reuse, and
ultimately to accelerate scientific discoverywww.nature.com/scientificdata
Data Publication with
http://www.nature.com/scientificdata/
• New open-access, online-only publication for
descriptions of scientifically valuable datasets
• Only content type: Data Descriptor, narrative
+ structured parts
• Initially focused on the life, environmental and
biomedical sciences
• Data Descriptor will be complementary to
traditional research journals and data
repositories
• Designed to foster data sharing and reuse, and
ultimately to accelerate scientific discoverywww.nature.com/scientificdata
Data Publication with
http://www.nature.com/scientificdata/
http://gigasciencejournal.com
1
OpenTox Europe 2013
20
A growing ecosystem of over 30 public and internal resources
using the ISA metadata tracking framework (ISA-Tab and/or
format) to facilitate standards-compliant collection, curation,
management and reuse of investigations in an increasingly diverse set
of life science domains, including:
• stem cell discovery
• system biology
• transcriptomics
• toxicogenomics
• also by communities working to build a library of cellular
signatures
• environmental health
• environmental genomics
• metabolomics
• metagenomics
• nanotechnology
• proteomics
21
Toxicity data
http://xkcd.com/1260/
22
Suter et al 2011. EU Framework 6 Project: Predictive Toxicology (PredTox)—overview and outcome.
Boitier et al 2011.A comparative integrated transcript analysis and functional characterization of differential mechanisms
for induction of liver hypertrophy in the rat
InnoMed PredTox Project
Goal: earlier pre-clinical safety evaluation by combining results from ‘omics
technologies and conventional toxicology methods
23
2-week systemic rat study using male Wistar rats (N=15 per dose group)
14 proprietary drug
candidates from
participating companies
and 2 reference toxic
compounds
24
25
26 http://www.ebi.ac.uk/bioinvindex/study.seam?studyId=BII-S-8
27
Data Infrastructure for Chemical Safety
http://www.dixa-fp7.eu/about
28
Kohonen et al. 2013 The ToxBank Data Warehouse: a research cluster of 7
EU FP7 Health systems toxicology and toxicogenomics projects.
Safety Evaluation Ultimately Replacing Animal Testing-1 (SEURAT-1): looking at improving safety
assessment without the need for animal experiments
ToxBank: cross-cluster infrastructure project
http://toxbank.net
29
https://wiki.nci.nih.gov/display/ICR/ISA-TAB-Nano
Nanotechnology
Informatics Working Group
Thomas et al. 2013 ISA-TAB-Nano: A specification for sharing nanomaterial
research data in spreadsheet-based format
Baker et al. 2013 Standardizing data
ISA-TAB-Nano
Extension of ISA-TAB format to represent
nano-materials, small molecules and
biological specimens along with their assay
characterisation data
30
Data
Scientist
Visualization
Analysis
Planning
Data
Management
Data CollectionPublication
31
Questions?
You can email us...
isatools@googlegroups.com
View our blog
http://isatools.wordpress.com
Follow us onTwitter
@isatools
View our website
http://www.isa-tools.org
View our Git repo & contribute
http://github.com/ISA-tools

More Related Content

PDF
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Alejandra Gonzalez-Beltran
 
PDF
ISMB Workshop 2014
Alejandra Gonzalez-Beltran
 
PDF
Drug Discovery- ELRIG -2012
Alejandra Gonzalez-Beltran
 
PDF
Beyond the PDF 2, 2013
Alejandra Gonzalez-Beltran
 
PDF
BioSharing.org - mapping the landscape of community standards, databases, dat...
Alejandra Gonzalez-Beltran
 
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Alejandra Gonzalez-Beltran
 
ISMB Workshop 2014
Alejandra Gonzalez-Beltran
 
Drug Discovery- ELRIG -2012
Alejandra Gonzalez-Beltran
 
Beyond the PDF 2, 2013
Alejandra Gonzalez-Beltran
 
BioSharing.org - mapping the landscape of community standards, databases, dat...
Alejandra Gonzalez-Beltran
 

What's hot (20)

PDF
Ontomaton icbo2013-alternative order-t_wv3
Philippe Rocca-Serra
 
PPTX
Aspects of Reproducibility in Earth Science
Raul Palma
 
PPTX
Being Reproducible: SSBSS Summer School 2017
Carole Goble
 
PPTX
ROHub
Raul Palma
 
PPT
DCC Keynote 2007
Carole Goble
 
PPTX
The Research Object Initiative: Frameworks and Use Cases
Carole Goble
 
PPTX
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Carole Goble
 
PPTX
2016 davis-plantbio
c.titus.brown
 
PDF
CV_10/17
Gautam Machiraju
 
PPTX
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Carole Goble
 
PDF
Advanced Bioinformatics for Genomics and BioData Driven Research
European Bioinformatics Institute
 
PPTX
RARE and FAIR Science: Reproducibility and Research Objects
Carole Goble
 
PPTX
The Rhetoric of Research Objects
Carole Goble
 
PPTX
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
Carole Goble
 
PPT
The beauty of workflows and models
myGrid team
 
PPT
The Seven Deadly Sins of Bioinformatics
Duncan Hull
 
PPTX
SEEK for Science: A Data and Model Management Platform to support Open and Re...
Carole Goble
 
PPTX
Advances in Scientific Workflow Environments
Carole Goble
 
Ontomaton icbo2013-alternative order-t_wv3
Philippe Rocca-Serra
 
Aspects of Reproducibility in Earth Science
Raul Palma
 
Being Reproducible: SSBSS Summer School 2017
Carole Goble
 
ROHub
Raul Palma
 
DCC Keynote 2007
Carole Goble
 
The Research Object Initiative: Frameworks and Use Cases
Carole Goble
 
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Carole Goble
 
2016 davis-plantbio
c.titus.brown
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Carole Goble
 
Advanced Bioinformatics for Genomics and BioData Driven Research
European Bioinformatics Institute
 
RARE and FAIR Science: Reproducibility and Research Objects
Carole Goble
 
The Rhetoric of Research Objects
Carole Goble
 
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
Carole Goble
 
The beauty of workflows and models
myGrid team
 
The Seven Deadly Sins of Bioinformatics
Duncan Hull
 
SEEK for Science: A Data and Model Management Platform to support Open and Re...
Carole Goble
 
Advances in Scientific Workflow Environments
Carole Goble
 
Ad

Similar to OpenTox Europe 2013 (20)

PPT
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
Carole Goble
 
PPTX
Data Integration vs Transparency: Tackling the tension
Paul Groth
 
PDF
ANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MINING
ijbbjournal
 
PDF
Standards and tools for model management in biomedical research
University Medicine Greifswald
 
PPTX
Integrated Analysis of Toxicology Data supported by ToxBank
Barry Hardy
 
PDF
eScience-School-Oct2012-Campinas-Brazil
Susanna-Assunta Sansone
 
PDF
Metadata challenges research and re-usable data - BioSharing, ISA and STATO
Alejandra Gonzalez-Beltran
 
PDF
Overall Vision for NRNB: 2015-2020
Alexander Pico
 
PPTX
Transparency in the Data Supply Chain
Paul Groth
 
PPT
2011-10-11 Open PHACTS at BioIT World Europe
open_phacts
 
PPTX
International Cancer Genomics Consortium (ICGC) Data Coordinating Center
Neuro, McGill University
 
PDF
OpenTox - an open community and framework supporting predictive toxicology an...
Barry Hardy
 
PDF
ISA - a short overview - Dec 2013
Susanna-Assunta Sansone
 
PDF
2015 GU-ICBI Poster (third printing)
Michael Atkins
 
PPT
2011-11-28 Open PHACTS at RSC CICAG
open_phacts
 
PPT
The eCrystals Federation
ManjulaPatel
 
PDF
From data to knowledge – the Ondex System for integrating Life Sciences data ...
Catherine Canevet
 
PDF
Seminario en CIFASIS, Rosario, Argentina - Seminar in CIFASIS, Rosario, Argen...
Alejandra Gonzalez-Beltran
 
PDF
Data Mining for Systems Biology Methods and Protocols 1st Edition Koji Tsuda
tmzrhojmjt2906
 
PDF
Model repositories and standard formats for model reusability
University Medicine Greifswald
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
Carole Goble
 
Data Integration vs Transparency: Tackling the tension
Paul Groth
 
ANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MINING
ijbbjournal
 
Standards and tools for model management in biomedical research
University Medicine Greifswald
 
Integrated Analysis of Toxicology Data supported by ToxBank
Barry Hardy
 
eScience-School-Oct2012-Campinas-Brazil
Susanna-Assunta Sansone
 
Metadata challenges research and re-usable data - BioSharing, ISA and STATO
Alejandra Gonzalez-Beltran
 
Overall Vision for NRNB: 2015-2020
Alexander Pico
 
Transparency in the Data Supply Chain
Paul Groth
 
2011-10-11 Open PHACTS at BioIT World Europe
open_phacts
 
International Cancer Genomics Consortium (ICGC) Data Coordinating Center
Neuro, McGill University
 
OpenTox - an open community and framework supporting predictive toxicology an...
Barry Hardy
 
ISA - a short overview - Dec 2013
Susanna-Assunta Sansone
 
2015 GU-ICBI Poster (third printing)
Michael Atkins
 
2011-11-28 Open PHACTS at RSC CICAG
open_phacts
 
The eCrystals Federation
ManjulaPatel
 
From data to knowledge – the Ondex System for integrating Life Sciences data ...
Catherine Canevet
 
Seminario en CIFASIS, Rosario, Argentina - Seminar in CIFASIS, Rosario, Argen...
Alejandra Gonzalez-Beltran
 
Data Mining for Systems Biology Methods and Protocols 1st Edition Koji Tsuda
tmzrhojmjt2906
 
Model repositories and standard formats for model reusability
University Medicine Greifswald
 
Ad

More from Alejandra Gonzalez-Beltran (12)

PDF
The Software Sustainability Institute Fellowship
Alejandra Gonzalez-Beltran
 
PDF
CMSO Minimal reporting requirements
Alejandra Gonzalez-Beltran
 
PDF
The DATS model: datasets descriptions for data discovery in DataMed
Alejandra Gonzalez-Beltran
 
PDF
Datasets with bioschemas
Alejandra Gonzalez-Beltran
 
PDF
Data publication: Discover, Explore, Visualise
Alejandra Gonzalez-Beltran
 
PDF
ISA commons - overview and latest developments
Alejandra Gonzalez-Beltran
 
PDF
Metadata for Interoperable Bioscience
Alejandra Gonzalez-Beltran
 
PDF
From peer-reviewed to peer-reproduced: a role for research objects in scholar...
Alejandra Gonzalez-Beltran
 
PDF
Brazil-UK Frontiers of Engineering - Big data in healthcare session
Alejandra Gonzalez-Beltran
 
PDF
COPO kick-off meeting
Alejandra Gonzalez-Beltran
 
PDF
SELENfest 2012
Alejandra Gonzalez-Beltran
 
The Software Sustainability Institute Fellowship
Alejandra Gonzalez-Beltran
 
CMSO Minimal reporting requirements
Alejandra Gonzalez-Beltran
 
The DATS model: datasets descriptions for data discovery in DataMed
Alejandra Gonzalez-Beltran
 
Datasets with bioschemas
Alejandra Gonzalez-Beltran
 
Data publication: Discover, Explore, Visualise
Alejandra Gonzalez-Beltran
 
ISA commons - overview and latest developments
Alejandra Gonzalez-Beltran
 
Metadata for Interoperable Bioscience
Alejandra Gonzalez-Beltran
 
From peer-reviewed to peer-reproduced: a role for research objects in scholar...
Alejandra Gonzalez-Beltran
 
Brazil-UK Frontiers of Engineering - Big data in healthcare session
Alejandra Gonzalez-Beltran
 
COPO kick-off meeting
Alejandra Gonzalez-Beltran
 

Recently uploaded (20)

PDF
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
PDF
The Future of Artificial Intelligence (AI)
Mukul
 
PDF
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Precisely
 
PDF
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
PDF
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
PDF
How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdf
Artjoker Software Development Company
 
PDF
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
PDF
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
PPTX
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
PDF
REPORT: Heating appliances market in Poland 2024
SPIUG
 
PDF
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
PDF
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
codernjn73
 
PDF
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
PPTX
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
PDF
The Future of Mobile Is Context-Aware—Are You Ready?
iProgrammer Solutions Private Limited
 
PPTX
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
PDF
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
The Future of Artificial Intelligence (AI)
Mukul
 
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Precisely
 
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdf
Artjoker Software Development Company
 
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
REPORT: Heating appliances market in Poland 2024
SPIUG
 
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
codernjn73
 
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
The Future of Mobile Is Context-Aware—Are You Ready?
iProgrammer Solutions Private Limited
 
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 

OpenTox Europe 2013