SlideShare a Scribd company logo
Bionimbus: A Cloud-Based Infrastructure for Managing, Analyzing and Sharing Genomics Data April 21, 2011Robert GrossmanInstitute for Genomics & Systems Biology (IGSB)Computation InstituteUniversity of ChicagoandOpen Cloud Consortium
Background
Growth of Genomic DataSequence everythingAWS  HadoopGFSSequence environment200620082003Sequence speciesENCODEHGP20032001197719952005Sanger SequencingMicroarray technology454, Solexa sequencing10^10Genbank10^510^8
Source: Lincoln Stein
The Challenge is to Support Cubes of High Throughput Sequence DataEach cell in data cube can be ChIP-chip, ChIP-seq, RNA-seq,  movie, etc. data set.Different developmental stagesDifferent pathologiesPerturb the environment
We Have a Problem…vsMore and more of your colleagues produce so much data that they cannot easily manage, move, analyze and share it.  Centers and large projects build their own infrastructure.Every else is on their own.
Part 1.  Using Bionimbuswww.bionimbus.org
Bionimbus is a community cloud for storing, analyzing and sharing genomics and related data.8
Enabling a broad community to utilize genome researchUser1.3.2.9Bionimbus CloudSequencing Partner or Center
Step 1. Prepare a Sample
Step 2.  Login to Bionimbus and get a Bionimbus Key.
Step 3.  Fedex your sample to CGI.
Step 4.  Login on to Bionimbus and view your data
Step 5.  Use Bionimbus to perform standard and custom pipelines.Using the ability of Bionimbus to launch multiple virtual machines reduced this analysis from 25 days to 1 day.
Step 2. Send sample tobe sequenced.Step 1. Get Bionimbus ID (BID), assign project, private/community, public cloud, etc.InternalSequencersBID GeneratorCGIStep 5.  Cloud based analysis using IGSB and 3rdparty tools and applications. Step 3a. Return rawreads.Step 3b. Returnvariant calls, CNV, annotation…Bionimbus Private Cloud UCBionimbus Community CloudStep 4. Secure datarouting to appropriatecloud based upon BID.Bionimbus Private Cloud XYAmazondbGaP
Part 2. Introduction to Clouds
Clouds provide on-demand computing and storage resources at the scale and with the reliability of a data center.Computer scientists were caught by surprise.17
What is a Cloud?18Software as a Service (SaaS)
What Else a Cloud?19Infrastructure as a Service (IaaS)Users get one or more virtual machines “on demand”
Are There Other Types of Clouds?20ad targeting Hadoop was developed for processing Internet scale data for ad targeting and related applications but is now used for processing genomics data and may other applications.
What is a new about clouds?21
22Scale is New
Elastic, On-Demand Computing with Usage Based Pricing Is New23costs the same as1 computer in a rack for 120 hours120 computers in  three racks for 1 hourData center scale computing often leverages virtualization technologies.
Part 3. Some BionimbusCases
Case Study: Public Datasets in Bionimbus
Bionimbus - Northwestern CGI Workshop 4-21-2011
Case Study:  ModENCODEBionimbus is used to process the modENCODE data from the White lab (over 1000 experiments).BionimbusVMs were used for some of the integrative analysis.Bionimbus is used as a backup for the modENCODE DCC
28>300 ChIP datasetsChromatin/RNA timecourse
CBP
PolII
Pho/silencers
HDACs
Insulators
TFsPredictions537 silencers2,307 new promoters12,285 enhancers14,145 insulatorswww.modencode.orgwww.cistrack.orgNegre et al. Nature 2011
Case Study: IGSBAll samples processed by the Institute for Genomics & Systems Biology High-Throughput Genome Analysis Core (HGAC) at the University of Chicago use Bionimbus.
Bionimbus Virtual Machine Releases 30
Part 431Data Centers for Science
200410x-100x197610x-100xdatascience1670250xsimulation science160930xexperimental science
Open Science Data CloudAstronomical dataBiological data (Bionimbus)NSF-PIRE OSDC Data ChallengeEarth science data (& disaster relief)
The goal is to build a data center in Chicago for biological, scientific, medical and health care data in 4 to 5 years.
Part 5. More About Bionimbus
GWT-based Front EndElastic Cloud ServicesDatabase ServicesAnalysis Pipelines & Re-analysis ServicesIntercloud ServicesLarge Data Cloud ServicesData Ingestion Services
(Eucalyptus,OpenStack)GWT-based Front EndElastic Cloud Services(PostgreSQL)Database ServicesAnalysis Pipelines & Re-analysis ServicesIntercloud Services(IDs, etc.)Large Data Cloud Services(UDT, replication)Data Ingestion Services(Hadoop,Sector/Sphere)

More Related Content

PPTX
Bionimbus Cambridge Workshop (3-28-11, v7)
Robert Grossman
 
PPTX
Open Science Data Cloud - CCA 11
Robert Grossman
 
PPTX
Open Science Data Cloud (IEEE Cloud 2011)
Robert Grossman
 
PPT
Large Scale On-Demand Image Processing For Disaster Relief
Robert Grossman
 
PPTX
Health & Status Monitoring (2010-v8)
Robert Grossman
 
PPTX
An Overview of Bionimbus (March 2010)
Robert Grossman
 
PPT
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...
Robert Grossman
 
PDF
ieee cloud 2015 keynote talk
Microsoft Azure for Research
 
Bionimbus Cambridge Workshop (3-28-11, v7)
Robert Grossman
 
Open Science Data Cloud - CCA 11
Robert Grossman
 
Open Science Data Cloud (IEEE Cloud 2011)
Robert Grossman
 
Large Scale On-Demand Image Processing For Disaster Relief
Robert Grossman
 
Health & Status Monitoring (2010-v8)
Robert Grossman
 
An Overview of Bionimbus (March 2010)
Robert Grossman
 
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...
Robert Grossman
 
ieee cloud 2015 keynote talk
Microsoft Azure for Research
 

What's hot (20)

PDF
Accelerating your Research with Microsoft Azure (June 2015)
Microsoft Azure for Research
 
PPT
Aaas Data Intensive Science And Grid
Ian Foster
 
PDF
Keynote IEEE International Workshop on Cloud Analytics. Dennis Gannon
Microsoft Azure for Research
 
PPTX
Data Tribology: Overcoming Data Friction with Cloud Automation
Ian Foster
 
PDF
Doing Research in the Cloud - NIH Workshop Dennis Gannon
Microsoft Azure for Research
 
PPTX
A4 r overview deck_1.7
Microsoft Azure for Research
 
PDF
The pulse of cloud computing with bioinformatics as an example
Enis Afgan
 
PDF
Accelerating your research with Microsoft Azure
Microsoft Azure for Research
 
PPTX
Coding the Continuum
Ian Foster
 
PDF
Reproducible Research and the Cloud
Microsoft Azure for Research
 
PDF
What is a Data Commons and Why Should You Care?
Robert Grossman
 
PPT
Grid Projects In The US July 2008
Ian Foster
 
PPTX
NERSC, AI and the Superfacility, Debbie Bard
PacificResearchPlatform
 
PDF
Eyeo 2019-Lightning-Cytoscape
Keiichiro Ono
 
PPTX
Laurie Goodman: Sharing and Reusing Cell Image Data, ASCB/EMBO 2017 Subgroup ...
GigaScience, BGI Hong Kong
 
PDF
Future of hpc
Putchong Uthayopas
 
PPTX
Cognitive Hardware and Software Ecosystem Community Infrastructure (CHASE-CI)
Larry Smarr
 
PPTX
Accelerating data-intensive science by outsourcing the mundane
Ian Foster
 
PDF
Azure Brain: 4th paradigm, scientific discovery & (really) big data
Microsoft Technet France
 
Accelerating your Research with Microsoft Azure (June 2015)
Microsoft Azure for Research
 
Aaas Data Intensive Science And Grid
Ian Foster
 
Keynote IEEE International Workshop on Cloud Analytics. Dennis Gannon
Microsoft Azure for Research
 
Data Tribology: Overcoming Data Friction with Cloud Automation
Ian Foster
 
Doing Research in the Cloud - NIH Workshop Dennis Gannon
Microsoft Azure for Research
 
A4 r overview deck_1.7
Microsoft Azure for Research
 
The pulse of cloud computing with bioinformatics as an example
Enis Afgan
 
Accelerating your research with Microsoft Azure
Microsoft Azure for Research
 
Coding the Continuum
Ian Foster
 
Reproducible Research and the Cloud
Microsoft Azure for Research
 
What is a Data Commons and Why Should You Care?
Robert Grossman
 
Grid Projects In The US July 2008
Ian Foster
 
NERSC, AI and the Superfacility, Debbie Bard
PacificResearchPlatform
 
Eyeo 2019-Lightning-Cytoscape
Keiichiro Ono
 
Laurie Goodman: Sharing and Reusing Cell Image Data, ASCB/EMBO 2017 Subgroup ...
GigaScience, BGI Hong Kong
 
Future of hpc
Putchong Uthayopas
 
Cognitive Hardware and Software Ecosystem Community Infrastructure (CHASE-CI)
Larry Smarr
 
Accelerating data-intensive science by outsourcing the mundane
Ian Foster
 
Azure Brain: 4th paradigm, scientific discovery & (really) big data
Microsoft Technet France
 
Ad

Similar to Bionimbus - Northwestern CGI Workshop 4-21-2011 (20)

PPTX
Bionimbus - An Overview (2010-v6)
Robert Grossman
 
PDF
Ntino Krampis GSC 2011
Ntino Krampis
 
PPTX
Climb bath
Tom Connor
 
PDF
E Afgan - Zero to a bioinformatics analysis platform in four minutes
Jan Aerts
 
PDF
F02-Cloud-Cloud BioLinux
Bioinformatics Open Source Conference
 
ODP
Cloud BioLinux S.Africa
Ntino Krampis
 
PDF
Chi next gen-ntino-krampis
Ntino Krampis
 
PDF
Bosc2011 ntino-krampis-full
Bioinformatics Open Source Conference
 
PDF
CHPC Workshop Morning Session
Ntino Krampis
 
PDF
MawereC- Ubuntunet paper publication 2015
CEPHAS MAWERE
 
PPT
Fabricio Silva: Cloud Computing Technologies for Genomic Big Data Analysis
Flávio Codeço Coelho
 
PDF
Big Data, The Community and The Commons (May 12, 2014)
Robert Grossman
 
PDF
Cloud ntino-krampis
Ntino Krampis
 
PDF
BIPMed at Cloud
Welliton Souza
 
PPTX
2016 05 sanger
Chris Dwan
 
PPTX
The Transformation of Systems Biology Into A Large Data Science
Robert Grossman
 
DOCX
As next-generation technology ratchets the price of sequen.docx
bob8allen25075
 
PPTX
Climb stateoftheartintro
thomasrconnor
 
PPTX
Desktop as a Service supporting Environmental ‘omics
David Wallom
 
PPTX
Cloud Computing: Safe Haven from the Data Deluge? AGBT 2011
Toby Bloom
 
Bionimbus - An Overview (2010-v6)
Robert Grossman
 
Ntino Krampis GSC 2011
Ntino Krampis
 
Climb bath
Tom Connor
 
E Afgan - Zero to a bioinformatics analysis platform in four minutes
Jan Aerts
 
F02-Cloud-Cloud BioLinux
Bioinformatics Open Source Conference
 
Cloud BioLinux S.Africa
Ntino Krampis
 
Chi next gen-ntino-krampis
Ntino Krampis
 
Bosc2011 ntino-krampis-full
Bioinformatics Open Source Conference
 
CHPC Workshop Morning Session
Ntino Krampis
 
MawereC- Ubuntunet paper publication 2015
CEPHAS MAWERE
 
Fabricio Silva: Cloud Computing Technologies for Genomic Big Data Analysis
Flávio Codeço Coelho
 
Big Data, The Community and The Commons (May 12, 2014)
Robert Grossman
 
Cloud ntino-krampis
Ntino Krampis
 
BIPMed at Cloud
Welliton Souza
 
2016 05 sanger
Chris Dwan
 
The Transformation of Systems Biology Into A Large Data Science
Robert Grossman
 
As next-generation technology ratchets the price of sequen.docx
bob8allen25075
 
Climb stateoftheartintro
thomasrconnor
 
Desktop as a Service supporting Environmental ‘omics
David Wallom
 
Cloud Computing: Safe Haven from the Data Deluge? AGBT 2011
Toby Bloom
 
Ad

More from Robert Grossman (20)

PDF
Some Frameworks for Improving Analytic Operations at Your Company
Robert Grossman
 
PDF
Some Proposed Principles for Interoperating Cloud Based Data Platforms
Robert Grossman
 
PDF
A Gen3 Perspective of Disparate Data
Robert Grossman
 
PDF
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
Robert Grossman
 
PDF
A Data Biosphere for Biomedical Research
Robert Grossman
 
PDF
What is Data Commons and How Can Your Organization Build One?
Robert Grossman
 
PDF
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
Robert Grossman
 
PDF
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
Robert Grossman
 
PDF
AnalyticOps - Chicago PAW 2016
Robert Grossman
 
PDF
Keynote on 2015 Yale Day of Data
Robert Grossman
 
PDF
How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...
Robert Grossman
 
PDF
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
Robert Grossman
 
PDF
Clouds and Commons for the Data Intensive Science Community (June 8, 2015)
Robert Grossman
 
PDF
Architectures for Data Commons (XLDB 15 Lightning Talk)
Robert Grossman
 
PDF
Practical Methods for Identifying Anomalies That Matter in Large Datasets
Robert Grossman
 
PDF
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Robert Grossman
 
PDF
What Are Science Clouds?
Robert Grossman
 
PDF
Adversarial Analytics - 2013 Strata & Hadoop World Talk
Robert Grossman
 
PDF
The Matsu Project - Open Source Software for Processing Satellite Imagery Data
Robert Grossman
 
PDF
Using the Open Science Data Cloud for Data Science Research
Robert Grossman
 
Some Frameworks for Improving Analytic Operations at Your Company
Robert Grossman
 
Some Proposed Principles for Interoperating Cloud Based Data Platforms
Robert Grossman
 
A Gen3 Perspective of Disparate Data
Robert Grossman
 
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
Robert Grossman
 
A Data Biosphere for Biomedical Research
Robert Grossman
 
What is Data Commons and How Can Your Organization Build One?
Robert Grossman
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
Robert Grossman
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
Robert Grossman
 
AnalyticOps - Chicago PAW 2016
Robert Grossman
 
Keynote on 2015 Yale Day of Data
Robert Grossman
 
How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...
Robert Grossman
 
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
Robert Grossman
 
Clouds and Commons for the Data Intensive Science Community (June 8, 2015)
Robert Grossman
 
Architectures for Data Commons (XLDB 15 Lightning Talk)
Robert Grossman
 
Practical Methods for Identifying Anomalies That Matter in Large Datasets
Robert Grossman
 
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Robert Grossman
 
What Are Science Clouds?
Robert Grossman
 
Adversarial Analytics - 2013 Strata & Hadoop World Talk
Robert Grossman
 
The Matsu Project - Open Source Software for Processing Satellite Imagery Data
Robert Grossman
 
Using the Open Science Data Cloud for Data Science Research
Robert Grossman
 

Recently uploaded (20)

PDF
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
PDF
The Future of Mobile Is Context-Aware—Are You Ready?
iProgrammer Solutions Private Limited
 
PPTX
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
PPTX
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
PDF
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
PDF
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
PPTX
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
PDF
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
PDF
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
PDF
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
PDF
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
PDF
Brief History of Internet - Early Days of Internet
sutharharshit158
 
PDF
Event Presentation Google Cloud Next Extended 2025
minhtrietgect
 
PPTX
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
PDF
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
PDF
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
PDF
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
PDF
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
PDF
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
The Future of Mobile Is Context-Aware—Are You Ready?
iProgrammer Solutions Private Limited
 
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
Brief History of Internet - Early Days of Internet
sutharharshit158
 
Event Presentation Google Cloud Next Extended 2025
minhtrietgect
 
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 

Bionimbus - Northwestern CGI Workshop 4-21-2011

  • 1. Bionimbus: A Cloud-Based Infrastructure for Managing, Analyzing and Sharing Genomics Data April 21, 2011Robert GrossmanInstitute for Genomics & Systems Biology (IGSB)Computation InstituteUniversity of ChicagoandOpen Cloud Consortium
  • 3. Growth of Genomic DataSequence everythingAWS HadoopGFSSequence environment200620082003Sequence speciesENCODEHGP20032001197719952005Sanger SequencingMicroarray technology454, Solexa sequencing10^10Genbank10^510^8
  • 5. The Challenge is to Support Cubes of High Throughput Sequence DataEach cell in data cube can be ChIP-chip, ChIP-seq, RNA-seq, movie, etc. data set.Different developmental stagesDifferent pathologiesPerturb the environment
  • 6. We Have a Problem…vsMore and more of your colleagues produce so much data that they cannot easily manage, move, analyze and share it. Centers and large projects build their own infrastructure.Every else is on their own.
  • 7. Part 1. Using Bionimbuswww.bionimbus.org
  • 8. Bionimbus is a community cloud for storing, analyzing and sharing genomics and related data.8
  • 9. Enabling a broad community to utilize genome researchUser1.3.2.9Bionimbus CloudSequencing Partner or Center
  • 10. Step 1. Prepare a Sample
  • 11. Step 2. Login to Bionimbus and get a Bionimbus Key.
  • 12. Step 3. Fedex your sample to CGI.
  • 13. Step 4. Login on to Bionimbus and view your data
  • 14. Step 5. Use Bionimbus to perform standard and custom pipelines.Using the ability of Bionimbus to launch multiple virtual machines reduced this analysis from 25 days to 1 day.
  • 15. Step 2. Send sample tobe sequenced.Step 1. Get Bionimbus ID (BID), assign project, private/community, public cloud, etc.InternalSequencersBID GeneratorCGIStep 5. Cloud based analysis using IGSB and 3rdparty tools and applications. Step 3a. Return rawreads.Step 3b. Returnvariant calls, CNV, annotation…Bionimbus Private Cloud UCBionimbus Community CloudStep 4. Secure datarouting to appropriatecloud based upon BID.Bionimbus Private Cloud XYAmazondbGaP
  • 17. Clouds provide on-demand computing and storage resources at the scale and with the reliability of a data center.Computer scientists were caught by surprise.17
  • 18. What is a Cloud?18Software as a Service (SaaS)
  • 19. What Else a Cloud?19Infrastructure as a Service (IaaS)Users get one or more virtual machines “on demand”
  • 20. Are There Other Types of Clouds?20ad targeting Hadoop was developed for processing Internet scale data for ad targeting and related applications but is now used for processing genomics data and may other applications.
  • 21. What is a new about clouds?21
  • 23. Elastic, On-Demand Computing with Usage Based Pricing Is New23costs the same as1 computer in a rack for 120 hours120 computers in three racks for 1 hourData center scale computing often leverages virtualization technologies.
  • 24. Part 3. Some BionimbusCases
  • 25. Case Study: Public Datasets in Bionimbus
  • 27. Case Study: ModENCODEBionimbus is used to process the modENCODE data from the White lab (over 1000 experiments).BionimbusVMs were used for some of the integrative analysis.Bionimbus is used as a backup for the modENCODE DCC
  • 29. CBP
  • 30. PolII
  • 32. HDACs
  • 34. TFsPredictions537 silencers2,307 new promoters12,285 enhancers14,145 insulatorswww.modencode.orgwww.cistrack.orgNegre et al. Nature 2011
  • 35. Case Study: IGSBAll samples processed by the Institute for Genomics & Systems Biology High-Throughput Genome Analysis Core (HGAC) at the University of Chicago use Bionimbus.
  • 37. Part 431Data Centers for Science
  • 39. Open Science Data CloudAstronomical dataBiological data (Bionimbus)NSF-PIRE OSDC Data ChallengeEarth science data (& disaster relief)
  • 40. The goal is to build a data center in Chicago for biological, scientific, medical and health care data in 4 to 5 years.
  • 41. Part 5. More About Bionimbus
  • 42. GWT-based Front EndElastic Cloud ServicesDatabase ServicesAnalysis Pipelines & Re-analysis ServicesIntercloud ServicesLarge Data Cloud ServicesData Ingestion Services
  • 43. (Eucalyptus,OpenStack)GWT-based Front EndElastic Cloud Services(PostgreSQL)Database ServicesAnalysis Pipelines & Re-analysis ServicesIntercloud Services(IDs, etc.)Large Data Cloud Services(UDT, replication)Data Ingestion Services(Hadoop,Sector/Sphere)
  • 44. Bionimbus Deployment OptionsBionimbus Community Cloudwww.bionimbus.orgBionimbusAMIs & Amazon hosted applicationsBionimbus Private Clouds
  • 45. A successful cloud will…3. High performance ingestion and transport of data.2. Provide Compute services at the scale of a data center.1. Provide long term persistent storage services at the scale of a data center.
  • 46. A successful cloud will…6. Peer with private genomics clouds.5. Peer with public clouds.4. Support the liberation of data.
  • 47. Bionimbus satisfies each of these six requirements.
  • 48. Bionimbus Road MapOver the next 3 to 4 months, we will:Launch Bionimbus (we are in a pre-launch)Add Galaxy-based workflow to BionimbusAdd secure routing of genomesAdd more public datasetsAdd more pipelines