SlideShare a Scribd company logo
BD2K and why bioinformatics matters
relevance to Australia
EMBL - Australia AHM 2016
Vivien Bonazzi
Senior Advisor for Data Science Technologies
ADDs (Assoc. Director for Data Science) Office
Office of the Director (OD)
National Institutes of Health (NIH)
The NIH Data Commons
Digital Ecosystems for using and sharing FAIR Data
EMBL - Australia AHM 2016
Vivien Bonazzi
Senior Advisor for Data Science Technologies
ADDs (Assoc. Director for Data Science) Office
Office of the Director (OD)
National Institutes of Health (NIH)
http://datascience.nih.gov/bd2k
A word about BD2K
What’s driving the need for a
Data Commons?
Convergence of factors
Mountains of Data
Increasing need and support for Data sharing
Availability of digital technologies and
infrastructures that support Data at scale
EMBL Australian Bioinformatics Resource AHM   - Data Commons
EMBL Australian Bioinformatics Resource AHM   - Data Commons
https://gds.nih.gov/
Went into effect January 25, 2015
NCI guidance:
http://www.cancer.gov/grants-training/grants-management/nci-
policies/genomic-data
Requires public sharing of genomic data sets
9
Recommendation #4: A national cancer data ecosystem for sharing and analysis.
Create a National Cancer Data Ecosystem to collect, share, and interconnect a broad
array of large datasets so that researchers, clinicians, and patients will be able to both
contribute and analyze data, facilitating discovery that will ultimately improve patient
care and outcomes.
9
EMBL Australian Bioinformatics Resource AHM   - Data Commons
EMBL Australian Bioinformatics Resource AHM   - Data Commons
Challenges with Biomedical Data
The Journal Article is the end goal
Data is a means to an ends (low value)
Data is not FAIR
Findable, Accessible, Interoperable, Reproducible
Limited e-infrastructures to support FAIR data
What’s
Changing?
Digital
ecosystems
Development of the
NIH Data Commons
 How do we find data, software, standards?
 How can we make (large) data, annotations, software,
metadata accessible?
 How do we reuse data, tools and standards?
 How do we make more data machine readable?
 How do we leverage existing digital technologies systems,
infrastructures?
 How do we collaborate?
 How do we enable digital ecosystem?
Changing the conversation around
Data sharing and access
NIH Data Commons
Data Commons
enabling data driven science
Enable investigators to leverage all possible data and tools
in the effort to accelerate biomedical discoveries, therapies
and cures
by
driving the development of data infrastructure and data
science capabilities through collaborative research and
robust engineering
Matthew Trunnel, FHC
Data Commons’s
Developing a Data Commons
 Treats products of research – data, methods, papers etc.
as digital objects
 These digital objects exist in a shared virtual space
• Find, Deposit, Manage, Share, and Reuse data,
software, metadata and workflows
 Digital object compliance through FAIR principles:
• Findable
• Accessible (and usable)
• Interoperable
• Reusable
The Data Commons
is a framework
that supports
FAIR data access and sharing
and
fosters the development
of a digital ecosystem
https://datascience.nih.gov/commons
The Data Commons Framework
Compute Platform: Cloud
Services: APIs, Containers, Indexing,
Software: Services & Tools
scientific analysis tools/workflows
Data
“Reference” Data Sets
User defined data
DigitalObjectCompliance
App store/User Interface
PaaS
SaaS
IaaS
https://datascience.nih.gov/commons
Current Data Commons Pilots
Current Data Commons Pilots
Explore feasibility of the Commons Framework
Facilitate collaboration and interoperability
Making large and/or high impact NIH funded data sets and tools
accessible in the cloud
Developing Data and Software indexing methods
Leveraging BD2K Efforts: bioCADDIE and others.
Collaborating with external groups
Provide access to cloud (IaaS) and PaaS/SaaS via credits
Connecting credits to the grants system
Reference Data Sets Pilot
Large, High-Impact Datasets in the Cloud
Commons Framework Pilots
Software and Services
Commons Framework
• FAIRness Metrics
• Data-object registry
• Interoperability of APIs
• Workflow sharing and docker registry
• Commons Framework Publications
Resource Search & Indexing
Discoverability of data and software
Cloud Credits Model
$ denominated NIH credits to use
cloud resources (IaaS) and services (PaaS/SaaS)
The Data Commons Framework
Compute Platform: Cloud
Services: APIs, Containers, Indexing,
Software: Services & Tools
scientific analysis tools/workflows
Data
“Reference” Data Sets
User defined data
DigitalObjectCompliance
App store/User Interface
PaaS
SaaS
IaaS
https://datascience.nih.gov/commons
Authorization /authentication layer
Digital Ecosystem
Considerations and
Concluding Thoughts
Considerations
 Metrics – Understanding and accounting of data usage patterns
 Cost
• Cloud Storage
• Pay for use cloud compute (NIH credits pilot)
• Indirect costs for cloud
 Hybrid Clouds – Institution (private) and commercial (public) clouds
 Managing Open vs Controlled access data
• Auth: single sign on - dreams/nightmares?
 Archive vs Working and versioning Copies of data
 Interoperability with other Commons (clouds)
 Standards – Metadata, UIDs, APIs
 Discoverability – Finding digital objects across clouds
 Interfaces – For users with different needs and capabilities
 Consent – Reconsenting data, Dynamic consents?
 Policies
• Data sharing policies that are useful and effective
• Keep pace with use of technology (e.g. dbGAP data in the Cloud)
 Incentives
• Access to, and shareability of FAIR Data as part of NIH grant review
criteria
 Governance – Community involvement in governance models
 Sustainability – Long term support
Relevance to Australia?
Relevance to Australia
 The value of Australian Data *
 Unique flora and fauna
 e.g Marsupials
 Indigenous Australians
 Understanding of genomic structure – health & disease
 Medicinal products
 Making this data (securely) available
 With high quality annotation and metadata
 Attributions to original authors
 On the cloud
 Via open standard APIs
 Aggregation of data via an Australian wide Commons?
Authorization /authentication layer
Oz Digital Ecosystem
Summary
 We need an unprecedented level of convergence and
collaboration to drive biomedical science to the next level.
 Supporting this model of data-intensive collaborative science
requires a shift in academic research culture and new
investments in data infrastructure and capabilities.
Matthew Trunnel, FHC
Acknowledgments
• ADDS Office: Jennie Larkin, Phil Bourne, Michelle Dunn,Mark Guyer, Allen Dearry, Sonynka Ngosso,
Tonya Scott, Lisa Dunneback, Vivek Navale (CIT/ADDS)
• NCBI: George Komatsoulis
• NHGRI: Valentina di Francesco
• NIGMS: Susan Gregurick
• CIT: Andrea Norris, Debbie Sinmao
• NIH Common Fund: Jim Anderson , Betsy Wilder, Leslie Derr
• NCI Cloud Pilots/ GDC: Warren Kibbe, Tony Kerlavage, Tanja Davidsen
• Commons Reference Data Set Working Group: Weiniu
Gan (HL), Ajay Pillai (HG), Elaine Ayres, (BITRIS), Sean Davis (NCI), Vinay Pai (NIBIB),
Maria Giovanni (AI), Leslie Derr (CF), Claire Schulkey (AI)
• RIWG Core Team: Ron Margolis (DK), Ian Fore, (NCI), Alison Yao (AI),
Claire Schulkey (AI), Eric Choi (AI)
• OSP: Dina Paltoo, Kris Langlais, Erin Luetkemeier, Agnes Rooke,
• Research and Industry: Mathew Trunnell (FHC), Bob Grossman (Chicago), Toby Bloom (NYGC)
Stay in
Touch
QR Business Card
LinkedIn
@Vivien.Bonazzi
Slideshare
Blog
(Coming soon!)

More Related Content

PPTX
Bonazzi commons bd2 k ahm 2016 v2
Vivien Bonazzi
 
PPTX
The NIH Data Commons - BD2K All Hands Meeting 2015
Vivien Bonazzi
 
PPTX
BD2K and the Commons : ELIXR All Hands
Vivien Bonazzi
 
PPTX
NIH Data Commons - Note: Presentation has animations
Vivien Bonazzi
 
PPTX
NIH Data Summit - The NIH Data Commons
Vivien Bonazzi
 
PPTX
Bonazzi data commons nhgri council feb 2017
Vivien Bonazzi
 
PPTX
Data Commons Garvan - 2016
Vivien Bonazzi
 
PPTX
Data commons bonazzi bd2 k fundamentals of science feb 2017
Vivien Bonazzi
 
Bonazzi commons bd2 k ahm 2016 v2
Vivien Bonazzi
 
The NIH Data Commons - BD2K All Hands Meeting 2015
Vivien Bonazzi
 
BD2K and the Commons : ELIXR All Hands
Vivien Bonazzi
 
NIH Data Commons - Note: Presentation has animations
Vivien Bonazzi
 
NIH Data Summit - The NIH Data Commons
Vivien Bonazzi
 
Bonazzi data commons nhgri council feb 2017
Vivien Bonazzi
 
Data Commons Garvan - 2016
Vivien Bonazzi
 
Data commons bonazzi bd2 k fundamentals of science feb 2017
Vivien Bonazzi
 

What's hot (20)

PPTX
D4Science Data Infrastructure - Facilitator for a FAIR Data Management
Blue BRIDGE
 
PPTX
NDS Relevant Update from the NIH Data Science (ADDS) Office
Philip Bourne
 
PPTX
Data Harmonization for a Molecularly Driven Health System
Warren Kibbe
 
PDF
Baker - Evolution of Data Products and Designated Audiences
National Information Standards Organization (NISO)
 
PPTX
dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021
dkNET
 
PPTX
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
SEAD
 
PPT
BD2K Update
Philip Bourne
 
PDF
Integration of research literature and data (InFoLiS)
Philipp Zumstein
 
PPTX
Komatsoulis internet2 global forum 2015
George Komatsoulis
 
PPTX
SEAD slide set (October 2011)
SEAD
 
PPTX
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
SEAD
 
PPTX
A Big Picture in Research Data Management
Carole Goble
 
PPTX
ESA14 Workshop on SEAD's Data Services and Tools
SEAD
 
PPTX
Komatsoulis internet2 executive track
George Komatsoulis
 
PPTX
Paving the way to open and interoperable research data service workflows Prog...
ResearchSpace
 
PPTX
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...
University of California Curation Center
 
PPT
A Framework for Geospatial Web Services for Public Health by Dr. Leslie Lenert
Wansoo Im
 
PPTX
Supporting UC Research Data Management
slabrams
 
PPTX
Imaging dearry ncrdc 11062017
imgcommcall
 
PDF
Some Proposed Principles for Interoperating Cloud Based Data Platforms
Robert Grossman
 
D4Science Data Infrastructure - Facilitator for a FAIR Data Management
Blue BRIDGE
 
NDS Relevant Update from the NIH Data Science (ADDS) Office
Philip Bourne
 
Data Harmonization for a Molecularly Driven Health System
Warren Kibbe
 
Baker - Evolution of Data Products and Designated Audiences
National Information Standards Organization (NISO)
 
dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021
dkNET
 
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
SEAD
 
BD2K Update
Philip Bourne
 
Integration of research literature and data (InFoLiS)
Philipp Zumstein
 
Komatsoulis internet2 global forum 2015
George Komatsoulis
 
SEAD slide set (October 2011)
SEAD
 
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
SEAD
 
A Big Picture in Research Data Management
Carole Goble
 
ESA14 Workshop on SEAD's Data Services and Tools
SEAD
 
Komatsoulis internet2 executive track
George Komatsoulis
 
Paving the way to open and interoperable research data service workflows Prog...
ResearchSpace
 
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...
University of California Curation Center
 
A Framework for Geospatial Web Services for Public Health by Dr. Leslie Lenert
Wansoo Im
 
Supporting UC Research Data Management
slabrams
 
Imaging dearry ncrdc 11062017
imgcommcall
 
Some Proposed Principles for Interoperating Cloud Based Data Platforms
Robert Grossman
 
Ad

Similar to EMBL Australian Bioinformatics Resource AHM - Data Commons (20)

PPTX
The Commons: Leveraging the Power of the Cloud for Big Data
Philip Bourne
 
PPT
Opportunities and Challenges for International Cooperation Around Big Data
Philip Bourne
 
PPT
Meeting the Computational Challenges Associated with Human Health
Philip Bourne
 
PPTX
Reproducibility: A Funder and Data Science Perspective
Philip Bourne
 
PDF
A Data Biosphere for Biomedical Research
Robert Grossman
 
PPTX
Big Data as a Catalyst for Collaboration & Innovation
Philip Bourne
 
PPT
Yale Day of Data
Philip Bourne
 
PPTX
The NIH Commons: A Cloud-based Training Environment
Philip Bourne
 
PPT
Data Science BD2K Update for NIH
Philip Bourne
 
PPTX
Biomedical Data Sciences - New Name and New Opportunities for Change?
Philip Bourne
 
PPTX
Towards the Digital Research Enterprise
Philip Bourne
 
PDF
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
Robert Grossman
 
PPT
AMIA 2014
Philip Bourne
 
PPTX
Towards a Platform for Global Health
Philip Bourne
 
PPT
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
Philip Bourne
 
PPTX
BD2K Update
Philip Bourne
 
PDF
What is Data Commons and How Can Your Organization Build One?
Robert Grossman
 
PPT
Data!
Philip Bourne
 
PPT
The Thinking Behind Big Data at the NIH
Philip Bourne
 
PPT
Data at the NIH
Philip Bourne
 
The Commons: Leveraging the Power of the Cloud for Big Data
Philip Bourne
 
Opportunities and Challenges for International Cooperation Around Big Data
Philip Bourne
 
Meeting the Computational Challenges Associated with Human Health
Philip Bourne
 
Reproducibility: A Funder and Data Science Perspective
Philip Bourne
 
A Data Biosphere for Biomedical Research
Robert Grossman
 
Big Data as a Catalyst for Collaboration & Innovation
Philip Bourne
 
Yale Day of Data
Philip Bourne
 
The NIH Commons: A Cloud-based Training Environment
Philip Bourne
 
Data Science BD2K Update for NIH
Philip Bourne
 
Biomedical Data Sciences - New Name and New Opportunities for Change?
Philip Bourne
 
Towards the Digital Research Enterprise
Philip Bourne
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
Robert Grossman
 
AMIA 2014
Philip Bourne
 
Towards a Platform for Global Health
Philip Bourne
 
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
Philip Bourne
 
BD2K Update
Philip Bourne
 
What is Data Commons and How Can Your Organization Build One?
Robert Grossman
 
The Thinking Behind Big Data at the NIH
Philip Bourne
 
Data at the NIH
Philip Bourne
 
Ad

Recently uploaded (20)

PDF
The Cosmic Symphony: How Photons Shape the Universe and Our Place Within It
kutatomoshi
 
PPTX
Hydrocarbons Pollution. OIL pollutionpptx
AkCreation33
 
PPTX
Brain_stem_Medulla oblongata_functions of pons_mid brain
muralinath2
 
PPTX
The Obesity Paradox. Friend or Foe ?pptx
drdgd1972
 
PDF
A deep Search for Ethylene Glycol and Glycolonitrile in the V883 Ori Protopla...
Sérgio Sacani
 
PPT
1a. Basic Principles of Medical Microbiology Part 2 [Autosaved].ppt
separatedwalk
 
PDF
Approximating manifold orbits by means of Machine Learning Techniques
Esther Barrabés Vera
 
PDF
Evaluating Benchmark Quality: a Mutation-Testing- Based Methodology
ESUG
 
PPTX
Role of GIS in precision farming.pptx
BikramjitDeuri
 
PDF
Migrating Katalon Studio Tests to Playwright with Model Driven Engineering
ESUG
 
PPTX
Limbic system_components_connections_ functions.pptx
muralinath2
 
PPTX
fghvqwhfugqaifbiqufbiquvbfuqvfuqyvfqvfouiqvfq
PERMISONJERWIN
 
PPTX
Feeding stratagey for climate change dairy animals.
Dr.Zulfy haq
 
PPTX
The Toxic Effects of Aflatoxin B1 and Aflatoxin M1 on Kidney through Regulati...
OttokomaBonny
 
PPTX
ANTIANGINAL DRUGS.pptx m pharm pharmacology
46JaybhayAshwiniHari
 
PPTX
RED ROT DISEASE OF SUGARCANE.pptx
BikramjitDeuri
 
PPTX
Sleep_pysilogy_types_REM_NREM_duration_Sleep center
muralinath2
 
PPT
1. Basic Principles of Medical Microbiology Part 1.ppt
separatedwalk
 
PPTX
Home Garden as a Component of Agroforestry system : A survey-based Study
AkhangshaRoy
 
PPTX
General Characters and Classification of Su class Apterygota.pptx
Dr Showkat Ahmad Wani
 
The Cosmic Symphony: How Photons Shape the Universe and Our Place Within It
kutatomoshi
 
Hydrocarbons Pollution. OIL pollutionpptx
AkCreation33
 
Brain_stem_Medulla oblongata_functions of pons_mid brain
muralinath2
 
The Obesity Paradox. Friend or Foe ?pptx
drdgd1972
 
A deep Search for Ethylene Glycol and Glycolonitrile in the V883 Ori Protopla...
Sérgio Sacani
 
1a. Basic Principles of Medical Microbiology Part 2 [Autosaved].ppt
separatedwalk
 
Approximating manifold orbits by means of Machine Learning Techniques
Esther Barrabés Vera
 
Evaluating Benchmark Quality: a Mutation-Testing- Based Methodology
ESUG
 
Role of GIS in precision farming.pptx
BikramjitDeuri
 
Migrating Katalon Studio Tests to Playwright with Model Driven Engineering
ESUG
 
Limbic system_components_connections_ functions.pptx
muralinath2
 
fghvqwhfugqaifbiqufbiquvbfuqvfuqyvfqvfouiqvfq
PERMISONJERWIN
 
Feeding stratagey for climate change dairy animals.
Dr.Zulfy haq
 
The Toxic Effects of Aflatoxin B1 and Aflatoxin M1 on Kidney through Regulati...
OttokomaBonny
 
ANTIANGINAL DRUGS.pptx m pharm pharmacology
46JaybhayAshwiniHari
 
RED ROT DISEASE OF SUGARCANE.pptx
BikramjitDeuri
 
Sleep_pysilogy_types_REM_NREM_duration_Sleep center
muralinath2
 
1. Basic Principles of Medical Microbiology Part 1.ppt
separatedwalk
 
Home Garden as a Component of Agroforestry system : A survey-based Study
AkhangshaRoy
 
General Characters and Classification of Su class Apterygota.pptx
Dr Showkat Ahmad Wani
 

EMBL Australian Bioinformatics Resource AHM - Data Commons

  • 1. BD2K and why bioinformatics matters relevance to Australia EMBL - Australia AHM 2016 Vivien Bonazzi Senior Advisor for Data Science Technologies ADDs (Assoc. Director for Data Science) Office Office of the Director (OD) National Institutes of Health (NIH)
  • 2. The NIH Data Commons Digital Ecosystems for using and sharing FAIR Data EMBL - Australia AHM 2016 Vivien Bonazzi Senior Advisor for Data Science Technologies ADDs (Assoc. Director for Data Science) Office Office of the Director (OD) National Institutes of Health (NIH)
  • 4. What’s driving the need for a Data Commons?
  • 5. Convergence of factors Mountains of Data Increasing need and support for Data sharing Availability of digital technologies and infrastructures that support Data at scale
  • 8. https://gds.nih.gov/ Went into effect January 25, 2015 NCI guidance: http://www.cancer.gov/grants-training/grants-management/nci- policies/genomic-data Requires public sharing of genomic data sets
  • 9. 9 Recommendation #4: A national cancer data ecosystem for sharing and analysis. Create a National Cancer Data Ecosystem to collect, share, and interconnect a broad array of large datasets so that researchers, clinicians, and patients will be able to both contribute and analyze data, facilitating discovery that will ultimately improve patient care and outcomes. 9
  • 12. Challenges with Biomedical Data The Journal Article is the end goal Data is a means to an ends (low value) Data is not FAIR Findable, Accessible, Interoperable, Reproducible Limited e-infrastructures to support FAIR data
  • 14. Development of the NIH Data Commons
  • 15.  How do we find data, software, standards?  How can we make (large) data, annotations, software, metadata accessible?  How do we reuse data, tools and standards?  How do we make more data machine readable?  How do we leverage existing digital technologies systems, infrastructures?  How do we collaborate?  How do we enable digital ecosystem? Changing the conversation around Data sharing and access NIH Data Commons
  • 16. Data Commons enabling data driven science Enable investigators to leverage all possible data and tools in the effort to accelerate biomedical discoveries, therapies and cures by driving the development of data infrastructure and data science capabilities through collaborative research and robust engineering Matthew Trunnel, FHC
  • 18. Developing a Data Commons  Treats products of research – data, methods, papers etc. as digital objects  These digital objects exist in a shared virtual space • Find, Deposit, Manage, Share, and Reuse data, software, metadata and workflows  Digital object compliance through FAIR principles: • Findable • Accessible (and usable) • Interoperable • Reusable
  • 19. The Data Commons is a framework that supports FAIR data access and sharing and fosters the development of a digital ecosystem https://datascience.nih.gov/commons
  • 20. The Data Commons Framework Compute Platform: Cloud Services: APIs, Containers, Indexing, Software: Services & Tools scientific analysis tools/workflows Data “Reference” Data Sets User defined data DigitalObjectCompliance App store/User Interface PaaS SaaS IaaS https://datascience.nih.gov/commons
  • 22. Current Data Commons Pilots Explore feasibility of the Commons Framework Facilitate collaboration and interoperability Making large and/or high impact NIH funded data sets and tools accessible in the cloud Developing Data and Software indexing methods Leveraging BD2K Efforts: bioCADDIE and others. Collaborating with external groups Provide access to cloud (IaaS) and PaaS/SaaS via credits Connecting credits to the grants system
  • 23. Reference Data Sets Pilot Large, High-Impact Datasets in the Cloud
  • 25. Commons Framework • FAIRness Metrics • Data-object registry • Interoperability of APIs • Workflow sharing and docker registry • Commons Framework Publications
  • 26. Resource Search & Indexing Discoverability of data and software
  • 27. Cloud Credits Model $ denominated NIH credits to use cloud resources (IaaS) and services (PaaS/SaaS)
  • 28. The Data Commons Framework Compute Platform: Cloud Services: APIs, Containers, Indexing, Software: Services & Tools scientific analysis tools/workflows Data “Reference” Data Sets User defined data DigitalObjectCompliance App store/User Interface PaaS SaaS IaaS https://datascience.nih.gov/commons
  • 31. Considerations  Metrics – Understanding and accounting of data usage patterns  Cost • Cloud Storage • Pay for use cloud compute (NIH credits pilot) • Indirect costs for cloud  Hybrid Clouds – Institution (private) and commercial (public) clouds  Managing Open vs Controlled access data • Auth: single sign on - dreams/nightmares?  Archive vs Working and versioning Copies of data  Interoperability with other Commons (clouds)
  • 32.  Standards – Metadata, UIDs, APIs  Discoverability – Finding digital objects across clouds  Interfaces – For users with different needs and capabilities  Consent – Reconsenting data, Dynamic consents?  Policies • Data sharing policies that are useful and effective • Keep pace with use of technology (e.g. dbGAP data in the Cloud)  Incentives • Access to, and shareability of FAIR Data as part of NIH grant review criteria  Governance – Community involvement in governance models  Sustainability – Long term support
  • 34. Relevance to Australia  The value of Australian Data *  Unique flora and fauna  e.g Marsupials  Indigenous Australians  Understanding of genomic structure – health & disease  Medicinal products  Making this data (securely) available  With high quality annotation and metadata  Attributions to original authors  On the cloud  Via open standard APIs  Aggregation of data via an Australian wide Commons?
  • 36. Summary  We need an unprecedented level of convergence and collaboration to drive biomedical science to the next level.  Supporting this model of data-intensive collaborative science requires a shift in academic research culture and new investments in data infrastructure and capabilities. Matthew Trunnel, FHC
  • 37. Acknowledgments • ADDS Office: Jennie Larkin, Phil Bourne, Michelle Dunn,Mark Guyer, Allen Dearry, Sonynka Ngosso, Tonya Scott, Lisa Dunneback, Vivek Navale (CIT/ADDS) • NCBI: George Komatsoulis • NHGRI: Valentina di Francesco • NIGMS: Susan Gregurick • CIT: Andrea Norris, Debbie Sinmao • NIH Common Fund: Jim Anderson , Betsy Wilder, Leslie Derr • NCI Cloud Pilots/ GDC: Warren Kibbe, Tony Kerlavage, Tanja Davidsen • Commons Reference Data Set Working Group: Weiniu Gan (HL), Ajay Pillai (HG), Elaine Ayres, (BITRIS), Sean Davis (NCI), Vinay Pai (NIBIB), Maria Giovanni (AI), Leslie Derr (CF), Claire Schulkey (AI) • RIWG Core Team: Ron Margolis (DK), Ian Fore, (NCI), Alison Yao (AI), Claire Schulkey (AI), Eric Choi (AI) • OSP: Dina Paltoo, Kris Langlais, Erin Luetkemeier, Agnes Rooke, • Research and Industry: Mathew Trunnell (FHC), Bob Grossman (Chicago), Toby Bloom (NYGC)
  • 38. Stay in Touch QR Business Card LinkedIn @Vivien.Bonazzi Slideshare Blog (Coming soon!)

Editor's Notes

  • #2: Current snapshot of Commons status
  • #3: Current snapshot of Commons status
  • #8: The mission of the Office of Science and Technology Policy is threefold; provide the President and his senior staff with accurate, relevant, and timely scientific and technical advice on all matters of consequence; to ensure that the policies of the Executive Branch are informed by sound science; 3) to ensure that the scientific and technical work of the Executive Branch is properly coordinated so as to provide the greatest benefit to society.
  • #21: Detailed description of the Commons Framework can be found at : https://datascience.nih.gov/commons
  • #29: Detailed description of the Commons Framework can be found at : https://datascience.nih.gov/commons