“Quantified Self- 
On Being a Personal Genomic Observatory” 
Keynote in the 
“Humans as Genomic Observatories” Meeting 
Session in the Genomics Standards Consortium GSC 15 
April 24, 2013 
Dr. Larry Smarr 
Director, California Institute for Telecommunications and Information Technology 
Harry E. Gruber Professor, 
Dept. of Computer Science and Engineering 
Jacobs School of Engineering, UCSD 
http://lsmarr.calit2.net 
1
Calit2 Community Cyberinfrastructure for Advanced 
Microbial Ecology Research and Analysis (CAMERA) 
512 Processors 
~5 Teraflops 
Source: Phil Papadopoulos, SDSC, Calit2 
~ 200 Terabytes Storage 1GbE and 
10GbE 
Switched/ 
Routed 
Core 
~200TB 
Sun 
X4500 
Storage 
10GbE 
5000 Users 
90 Countries
Access to Computing Resources Tailored by User’s 
Requirements and Resources 
NSF/SDSC 
Gordon 
Infrastructure Services Extend 
CAMERA Computations to 
3rd Party Compute Resources 
UCSD Triton 
NSF/SDSC 
Trestles 
NSF/RCAC 
Steele 
NSF/TACC 
Lonestar 
NSF/TACC 
Ranger 
Core CAMERA 
HPC Resource 
EAGER: Multi-Domain, Workflow-Driven 
Computation System for 
Microbial Ecology Research and Analysis 
Source: 
Jeff Grethe, 
CRBS, UCSD
CAMERA 
A Community Gateway to Data & Analysis Functions 
Data 
Data Analysis
Marine Genome Sequencing Project – 
CAMERA Anchor Dataset Launched March 13, 2007 
Measuring the Genetic Diversity 
of Ocean Microbes 
Specify 
Ocean Data 
Each Sample 
~2000 
Microbial 
Species
The Human Microbiome Is a Microbial Environment 
Being Metagenomically Sampled
CAMERA and NIH Funded Weizhong Li Group’s Metagenomic 
Computational NextGen Sequencing Pipeline 
Raw reads 
Reads QC 
HQ reads: 
Filter human 
Bowtie/BWA against 
Human genome and 
mRNAs 
Unique reads 
CD-HIT-Dup 
For single or PE reads 
Read recruitment Filter errors 
Further filtered 
reads 
Filtered reads 
Filter duplicate 
Cluster-based 
Denoising 
Assemble 
Contigs 
Velvet, 
SOAPdenovo, 
Abyss 
------- 
K-mer setting 
Mapping BWA Bowtie 
Contigs with 
Abundance 
Taxonomy binning 
FR-HIT against 
Non-redundant 
microbial genomes 
FRV 
Visualization 
tRNA-scan 
rRNA - HMM 
tRNAs 
rRNAs 
ORFs 
ORF-finder 
Megagene 
Cd-hit at 95% 
Non redundant 
ORFs 
Cd-hit at 60% 
Core ORF clusters 
Cd-hit at 30% 1e-6 
Protein families 
Function 
Pathway 
Annotation 
Pfam 
Tigrfam 
COG 
KOG 
PRK 
KEGG 
eggNOG 
Hmmer 
RPS-blast 
blast 
PI: (Weizhong Li, UCSD): 
NIH R01HG005978 (2010-2013, $1.1M)
What is a “Healthy” Gut Microbiome? 
Dominated by Bacteroidetes and Firmicute Phyla 
Source: “Structure, function and diversity of the healthy human 
microbiome,” HMP Consortium, Nature, 486, 207-212 (2012)
To Map My Gut Microbes, I Sent a Stool Sample to 
the Venter Institute for Metagenomic Sequencing 
Shipped Stool Sample 
December 28, 2011 
Gel Image of Extract from Smarr Sample-Next is Library Construction 
Manny Torralba, Project Lead - Human Genomic Medicine 
J Craig Venter Institute 
January 25, 2012 
I Received 
a Disk Drive April 3, 2012 
With 35 GB FASTQ Files 
Weizhong Li, UCSD 
NGS Pipeline: 
230M Reads 
Only 0.2% Human 
Required 1/2 cpu-yr 
Per Person Analyzed! 
Sequencing 
Funding 
Provided by 
UCSD School of 
Health Sciences
Phyla Gut Microbial Abundance Without Viruses: 
LS, Crohn’s, UC, and Healthy Subjects 
Source: Weizhong Li, UCSD; Calit2 FuturePatient Expedition 
Crohn’s Ulcerative 
LS Healthy 
Colitis 
Toward Noninvasive 
Microbial Ecology Diagnostics
Almost All Abundant Species (≥1%) in Healthy Subjects 
Are Severely Depleted in LS Gut 
Source: Sequencing JCVI; Analysis Weizhong Li, UCSD 
LS December 28, 2011 Stool Sample
Top 20 Most Abundant Microbial Species 
In LS vs. Average Healthy Subject 
152x 
765x 
148x 
849x 
483x 
220x 
201x 
169x 
522x 
Number Above 
LS Blue Bar is Multiple 
of LS Abundance 
Compared to Average 
Healthy Abundance 
Per Species 
Source: Sequencing JCVI; Analysis Weizhong Li, UCSD 
LS December 28, 2011 Stool Sample
Comparing 3 LS Time Snapshots (Left) 
with Healthy, Crohn’s, UC (Right Top to Bottom) 
Calit2 VROOM-FuturePatient Expedition
We Find Major Shifts in Microbial Ecology 
Between Healthy and Two Forms of IBD 
Collapse of 
Bacteroidetes 
Microbiome “Dysbiosis” 
or “Mass Extinction”? 
Explosion of 
Proteobacteria 
On the IBD Spectrum
I Have Massive Reduction 
in the Families of the Bacteroidetes Phylum in My Gut 
Calit2 FuturePatient Expedition
Major Changes in LS Microbiome Before and After 
1 Month Antibiotic & 2 Month Prednisone Therapy 
Reduced 45x 
Reduced 90x 
Therapy Greatly Reduced Two Phyla, 
But Massive Reduction in Bacteroidetes 
And Large % Proteobacteria Remain 
Small Changes 
With No Therapy 
How Does One Get Back 
to a “Healthy” Gut Microbiome?
From War 
to Gardening 
“I would like to lose the language of warfare,” 
said Julie Segre, a senior investigator at 
the National Human Genome Research Institute. 
”It does a disservice to all the bacteria 
that have co-evolved with us 
and are maintaining the health of our bodies.”
From Taxonomy to Function: 
Analysis of LS Clusters of Orthologous Groups (COGs) 
Analysis: Weizhong Li & Sitao Wu, UCSD
What is Adequate Metadata 
to Define the Environment of the Human Microbiome? 
• Need the Variables that Determine Relative 
Abundances of Microbial Species 
– Genetics of Host 
– Immune System Variables 
– Other Environmental Variables (Food, Antibiotics, etc.) 
• At What Scale Do We Need These Metadata Variables? 
– SNPs vs. Full Genome 
– Medical Tests vs. Proteomics, Metabolomics, 
Transcriptomics 
– Phenotyping of Signs and Symptoms

More Related Content

PDF
Bayesian Taxonomic Assignment for the Next-Generation Metagenomics
PPTX
Analyzing the Human Gut Microbiome Dynamics in Health and Disease Using Super...
PPTX
Using Supercomputers and Gene Sequencers to Discover Your Inner Microbiome
PPTX
Using Supercomputers to Discover the 100 Trillion Bacteria Living Within Each...
PPT
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
PPT
Case studies of HTS / NGS applications
PPT
Creating a Cyberinfrastructure for Advanced Marine Microbial Ecology Research...
PPTX
Trends In Genomics
Bayesian Taxonomic Assignment for the Next-Generation Metagenomics
Analyzing the Human Gut Microbiome Dynamics in Health and Disease Using Super...
Using Supercomputers and Gene Sequencers to Discover Your Inner Microbiome
Using Supercomputers to Discover the 100 Trillion Bacteria Living Within Each...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Case studies of HTS / NGS applications
Creating a Cyberinfrastructure for Advanced Marine Microbial Ecology Research...
Trends In Genomics

What's hot (20)

PPT
Using Supercomputers and Supernetworks to Explore the Ocean of Life
PPTX
Supercomputing Your Inner Microbiome
PPTX
Bioinformatics as a tool for understanding carcinogenesis
PPTX
Using Supercomputers and Data Science to Reveal Your Inner Microbiome
PDF
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...
PDF
Clinical Metagenomics for Rapid Detection of Enteric Pathogens and Characteri...
PDF
Building bioinformatics resources for the global community
PPTX
Cross-Kingdom Standards in Genomics, Epigenomics and Metagenomics
PPT
Microbial Metagenomics and Human Health
PPT
Living in a Microbial World
PPT
Building an Information Infrastructure to Support Microbial Metagenomic Sciences
PPT
The OptIPlanet Collaboratory Supporting Microbial Metagenomics Researchers Wo...
PPT
Quantifying Your Superorganism Body Using Big Data Supercomputing
PPTX
The Chills and Thrills of Whole Genome Sequencing
PPTX
Metagenomics
PPT
The Emerging Global Collaboratory for Microbial Metagenomics Researchers
PPT
How Studying Astrophysics and Coral Reefs Enabled Me to Become an Empowered,...
PPTX
Creating a High Performance Cyberinfrastructure to Support Analysis of Illumi...
PPT
Advancing the Metagenomics Revolution
PPTX
Big data nebraska
Using Supercomputers and Supernetworks to Explore the Ocean of Life
Supercomputing Your Inner Microbiome
Bioinformatics as a tool for understanding carcinogenesis
Using Supercomputers and Data Science to Reveal Your Inner Microbiome
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...
Clinical Metagenomics for Rapid Detection of Enteric Pathogens and Characteri...
Building bioinformatics resources for the global community
Cross-Kingdom Standards in Genomics, Epigenomics and Metagenomics
Microbial Metagenomics and Human Health
Living in a Microbial World
Building an Information Infrastructure to Support Microbial Metagenomic Sciences
The OptIPlanet Collaboratory Supporting Microbial Metagenomics Researchers Wo...
Quantifying Your Superorganism Body Using Big Data Supercomputing
The Chills and Thrills of Whole Genome Sequencing
Metagenomics
The Emerging Global Collaboratory for Microbial Metagenomics Researchers
How Studying Astrophysics and Coral Reefs Enabled Me to Become an Empowered,...
Creating a High Performance Cyberinfrastructure to Support Analysis of Illumi...
Advancing the Metagenomics Revolution
Big data nebraska
Ad

Viewers also liked (20)

PPT
Digital Culture and the Future Internet
PPTX
Commercializing Space: From the Moon to Mars
PPT
Observing the Dynamics of the Human Immune System Coupled to the Microbiome i...
PPTX
Deciphering the Dynamic Coupling of the Human Immune System and the Gut Micro...
PPT
An Integrated Science Cyberinfrastructure for Data-Intensive Research
PPT
A Systems Approach to Personalized Medicine
PPTX
The Deeply Quantified Self: A Case Study
PPTX
Four Disruptive Trends for the Next Decade
PPTX
Building a Regional 100G Collaboration Infrastructure
PDF
Using Dell’s HPC Cloud & Advanced Analytic Software to Discover Radical Chang...
PPTX
The Pacific Research Platform: A Science-Driven Big-Data Freeway System
PPT
The History and Possible Futures of the Internet
PPT
Introduction to the UCSD Division of Calit2
PPT
Deep Self - Quantifying the State of Your Body
PPT
Will the Quantified-Self Movement Disrupt Healthcare
PPT
Discovering Yourself with Computational Bioinformatics
PPT
The Pacific Research Platform: A Science-Driven Big-Data Freeway System
PPTX
Quantifying The Dynamics of Your Superorganism Body Using Big Data Supercompu...
PPT
Big Data and Superorganism Genomics: Microbial Metagenomics Meets Human Genomics
PPT
The UCSD Big Data Freeway System
Digital Culture and the Future Internet
Commercializing Space: From the Moon to Mars
Observing the Dynamics of the Human Immune System Coupled to the Microbiome i...
Deciphering the Dynamic Coupling of the Human Immune System and the Gut Micro...
An Integrated Science Cyberinfrastructure for Data-Intensive Research
A Systems Approach to Personalized Medicine
The Deeply Quantified Self: A Case Study
Four Disruptive Trends for the Next Decade
Building a Regional 100G Collaboration Infrastructure
Using Dell’s HPC Cloud & Advanced Analytic Software to Discover Radical Chang...
The Pacific Research Platform: A Science-Driven Big-Data Freeway System
The History and Possible Futures of the Internet
Introduction to the UCSD Division of Calit2
Deep Self - Quantifying the State of Your Body
Will the Quantified-Self Movement Disrupt Healthcare
Discovering Yourself with Computational Bioinformatics
The Pacific Research Platform: A Science-Driven Big-Data Freeway System
Quantifying The Dynamics of Your Superorganism Body Using Big Data Supercompu...
Big Data and Superorganism Genomics: Microbial Metagenomics Meets Human Genomics
The UCSD Big Data Freeway System
Ad

Similar to Quantified Self On Being A Personal Genomic Observatory (20)

PPT
Cross-Disciplinary Biomedical Research at Calit2
PPTX
Machine Learning Opportunities in the Explosion of Personalized Precision Med...
PPT
From N=1 to N=100: What I Have Learned from Quantifying My Superorganism Body
PPTX
Discovering the Other 90% of our Human Superorganism
PPTX
Decoding the Software Inside of You
PPT
The Emerging Global Community of Microbial Metagenomics Researchers
PPTX
Exploring the Dynamics of The Microbiome in Health and Disease
PPTX
Analyzing the Human Gut Microbiome Dynamics in Health and Disease Using Super...
PPT
Using Genetic Sequencing to Unravel the Dynamics of Your Superorganism Body
PPT
Large Memory High Performance Computing Enables Comparison Across Human Gut M...
PPT
Microbial Metagenomics Drives a New Cyberinfrastructure
PPT
Interactions of the Immune System with the Gut Microbiome in Inflammatory Bo...
PPTX
Supercomputing Your Inner Microbiome
PPT
Sequencing Genomics: The New Big Data Driver
PPTX
Finding the Patterns in the Big Data From Human Microbiome Ecology
PPTX
Mapping the Human Gut Microbiome in Health and Disease Using Sequencing, Supe...
PPTX
Biodiversity & Citizen Science in the Genomic Era
PPT
Quantifying the Time Progression of the Interaction of the Human Immune Syste...
PDF
Machine Learning in Healthcare by Mehrdad Yazdani
PPT
Tracking Large Variations in My Immune Biomarkers and My Gut Microbiome: Infl...
Cross-Disciplinary Biomedical Research at Calit2
Machine Learning Opportunities in the Explosion of Personalized Precision Med...
From N=1 to N=100: What I Have Learned from Quantifying My Superorganism Body
Discovering the Other 90% of our Human Superorganism
Decoding the Software Inside of You
The Emerging Global Community of Microbial Metagenomics Researchers
Exploring the Dynamics of The Microbiome in Health and Disease
Analyzing the Human Gut Microbiome Dynamics in Health and Disease Using Super...
Using Genetic Sequencing to Unravel the Dynamics of Your Superorganism Body
Large Memory High Performance Computing Enables Comparison Across Human Gut M...
Microbial Metagenomics Drives a New Cyberinfrastructure
Interactions of the Immune System with the Gut Microbiome in Inflammatory Bo...
Supercomputing Your Inner Microbiome
Sequencing Genomics: The New Big Data Driver
Finding the Patterns in the Big Data From Human Microbiome Ecology
Mapping the Human Gut Microbiome in Health and Disease Using Sequencing, Supe...
Biodiversity & Citizen Science in the Genomic Era
Quantifying the Time Progression of the Interaction of the Human Immune Syste...
Machine Learning in Healthcare by Mehrdad Yazdani
Tracking Large Variations in My Immune Biomarkers and My Gut Microbiome: Infl...

More from Larry Smarr (20)

PPTX
Revealing the Dynamics of an Individual’s Gut Microbiome Dynamics
PPTX
Smart Patients, Big Data, NextGen Primary Care
PPTX
Internet2 and QUILT Initiatives with Regional Networks -6NRP Larry Smarr and ...
PPTX
Internet2 and QUILT Initiatives with Regional Networks -6NRP Larry Smarr and ...
PPTX
National Research Platform: Application Drivers
PPT
From Supercomputing to the Grid - Larry Smarr
PPTX
The CENIC-AI Resource - Los Angeles Community College District (LACCD)
PPT
Redefining Collaboration through Groupware - From Groupware to Societyware
PPT
The Coming of the Grid - September 8-10,1997
PPT
Supercomputers: Directions in Technology, Architecture, and Applications
PPT
High Performance Geographic Information Systems
PPT
Data Intensive Applications at UCSD: Driving a Campus Research Cyberinfrastru...
PPT
Enhanced Telepresence and Green IT — The Next Evolution in the Internet
PPTX
The CENIC AI Resource CENIC AIR - CENIC Retreat 2024
PPTX
The CENIC-AI Resource: The Right Connection
PPTX
The Pacific Research Platform: The First Six Years
PPTX
The NSF Grants Leading Up to CHASE-CI ENS
PPTX
Integrated Optical Fiber/Wireless Systems for Environmental Monitoring
PPTX
Toward a National Research Platform to Enable Data-Intensive Open-Source Sci...
PPTX
Toward a National Research Platform to Enable Data-Intensive Computing
Revealing the Dynamics of an Individual’s Gut Microbiome Dynamics
Smart Patients, Big Data, NextGen Primary Care
Internet2 and QUILT Initiatives with Regional Networks -6NRP Larry Smarr and ...
Internet2 and QUILT Initiatives with Regional Networks -6NRP Larry Smarr and ...
National Research Platform: Application Drivers
From Supercomputing to the Grid - Larry Smarr
The CENIC-AI Resource - Los Angeles Community College District (LACCD)
Redefining Collaboration through Groupware - From Groupware to Societyware
The Coming of the Grid - September 8-10,1997
Supercomputers: Directions in Technology, Architecture, and Applications
High Performance Geographic Information Systems
Data Intensive Applications at UCSD: Driving a Campus Research Cyberinfrastru...
Enhanced Telepresence and Green IT — The Next Evolution in the Internet
The CENIC AI Resource CENIC AIR - CENIC Retreat 2024
The CENIC-AI Resource: The Right Connection
The Pacific Research Platform: The First Six Years
The NSF Grants Leading Up to CHASE-CI ENS
Integrated Optical Fiber/Wireless Systems for Environmental Monitoring
Toward a National Research Platform to Enable Data-Intensive Open-Source Sci...
Toward a National Research Platform to Enable Data-Intensive Computing

Recently uploaded (20)

PPTX
A presentation on AMPUTATION with special focus on orthopaedics
PPTX
bonding.pptx............................
PPTX
Drugs used in treatment of Malaria. Antimalarial Drugs.pptx
PPTX
Applied PSYCHOLOGY-FOR-BSC-Chapter-2.pptx
PPTX
Maternal and child health. The normal new born.pptx
PPTX
A brief presentation on Supportive Psychotherapy
PDF
Cardiovascular Disease & Obesity - Dr Cliff Wong
PPTX
applied physics dental materials basic principles
PPTX
Direct ELISA - procedure and application.pptx
PPTX
SlideEgg_100085- World Mental Health Day.pptx
PPTX
The Principle of Naturopathy Self-healing, toxin removal and balance
PPTX
OccupationalhealthPPT1Phealthinindustriesandsafety.pptx
PPTX
Non-Variceal-Upper-GI-Bleeding_-Comprehensive-Review_121037.pptx
PPTX
RENAL IMAGING MODALITIES-RENAL NURSING.pptx
PDF
PHARMACODYNAMICS_OF_AYURVEDIC_DRUGS_____A_PERSPECTIVE_VIEW_ijariie18418.pdf
PDF
Indonesian Healthtech Innovation_11Sep2019_Industry_Geraldine Seow_1.pdf
PDF
CASE PRESENTATION1.pdf bipolar disorder in which both mania and depression h...
PDF
Gastro Retentive Drug Delivery System.pdf
PPTX
Skeletal System presentation for high school
PPTX
1-back pain presentation presentation .pptx
A presentation on AMPUTATION with special focus on orthopaedics
bonding.pptx............................
Drugs used in treatment of Malaria. Antimalarial Drugs.pptx
Applied PSYCHOLOGY-FOR-BSC-Chapter-2.pptx
Maternal and child health. The normal new born.pptx
A brief presentation on Supportive Psychotherapy
Cardiovascular Disease & Obesity - Dr Cliff Wong
applied physics dental materials basic principles
Direct ELISA - procedure and application.pptx
SlideEgg_100085- World Mental Health Day.pptx
The Principle of Naturopathy Self-healing, toxin removal and balance
OccupationalhealthPPT1Phealthinindustriesandsafety.pptx
Non-Variceal-Upper-GI-Bleeding_-Comprehensive-Review_121037.pptx
RENAL IMAGING MODALITIES-RENAL NURSING.pptx
PHARMACODYNAMICS_OF_AYURVEDIC_DRUGS_____A_PERSPECTIVE_VIEW_ijariie18418.pdf
Indonesian Healthtech Innovation_11Sep2019_Industry_Geraldine Seow_1.pdf
CASE PRESENTATION1.pdf bipolar disorder in which both mania and depression h...
Gastro Retentive Drug Delivery System.pdf
Skeletal System presentation for high school
1-back pain presentation presentation .pptx

Quantified Self On Being A Personal Genomic Observatory

  • 1. “Quantified Self- On Being a Personal Genomic Observatory” Keynote in the “Humans as Genomic Observatories” Meeting Session in the Genomics Standards Consortium GSC 15 April 24, 2013 Dr. Larry Smarr Director, California Institute for Telecommunications and Information Technology Harry E. Gruber Professor, Dept. of Computer Science and Engineering Jacobs School of Engineering, UCSD http://lsmarr.calit2.net 1
  • 2. Calit2 Community Cyberinfrastructure for Advanced Microbial Ecology Research and Analysis (CAMERA) 512 Processors ~5 Teraflops Source: Phil Papadopoulos, SDSC, Calit2 ~ 200 Terabytes Storage 1GbE and 10GbE Switched/ Routed Core ~200TB Sun X4500 Storage 10GbE 5000 Users 90 Countries
  • 3. Access to Computing Resources Tailored by User’s Requirements and Resources NSF/SDSC Gordon Infrastructure Services Extend CAMERA Computations to 3rd Party Compute Resources UCSD Triton NSF/SDSC Trestles NSF/RCAC Steele NSF/TACC Lonestar NSF/TACC Ranger Core CAMERA HPC Resource EAGER: Multi-Domain, Workflow-Driven Computation System for Microbial Ecology Research and Analysis Source: Jeff Grethe, CRBS, UCSD
  • 4. CAMERA A Community Gateway to Data & Analysis Functions Data Data Analysis
  • 5. Marine Genome Sequencing Project – CAMERA Anchor Dataset Launched March 13, 2007 Measuring the Genetic Diversity of Ocean Microbes Specify Ocean Data Each Sample ~2000 Microbial Species
  • 6. The Human Microbiome Is a Microbial Environment Being Metagenomically Sampled
  • 7. CAMERA and NIH Funded Weizhong Li Group’s Metagenomic Computational NextGen Sequencing Pipeline Raw reads Reads QC HQ reads: Filter human Bowtie/BWA against Human genome and mRNAs Unique reads CD-HIT-Dup For single or PE reads Read recruitment Filter errors Further filtered reads Filtered reads Filter duplicate Cluster-based Denoising Assemble Contigs Velvet, SOAPdenovo, Abyss ------- K-mer setting Mapping BWA Bowtie Contigs with Abundance Taxonomy binning FR-HIT against Non-redundant microbial genomes FRV Visualization tRNA-scan rRNA - HMM tRNAs rRNAs ORFs ORF-finder Megagene Cd-hit at 95% Non redundant ORFs Cd-hit at 60% Core ORF clusters Cd-hit at 30% 1e-6 Protein families Function Pathway Annotation Pfam Tigrfam COG KOG PRK KEGG eggNOG Hmmer RPS-blast blast PI: (Weizhong Li, UCSD): NIH R01HG005978 (2010-2013, $1.1M)
  • 8. What is a “Healthy” Gut Microbiome? Dominated by Bacteroidetes and Firmicute Phyla Source: “Structure, function and diversity of the healthy human microbiome,” HMP Consortium, Nature, 486, 207-212 (2012)
  • 9. To Map My Gut Microbes, I Sent a Stool Sample to the Venter Institute for Metagenomic Sequencing Shipped Stool Sample December 28, 2011 Gel Image of Extract from Smarr Sample-Next is Library Construction Manny Torralba, Project Lead - Human Genomic Medicine J Craig Venter Institute January 25, 2012 I Received a Disk Drive April 3, 2012 With 35 GB FASTQ Files Weizhong Li, UCSD NGS Pipeline: 230M Reads Only 0.2% Human Required 1/2 cpu-yr Per Person Analyzed! Sequencing Funding Provided by UCSD School of Health Sciences
  • 10. Phyla Gut Microbial Abundance Without Viruses: LS, Crohn’s, UC, and Healthy Subjects Source: Weizhong Li, UCSD; Calit2 FuturePatient Expedition Crohn’s Ulcerative LS Healthy Colitis Toward Noninvasive Microbial Ecology Diagnostics
  • 11. Almost All Abundant Species (≥1%) in Healthy Subjects Are Severely Depleted in LS Gut Source: Sequencing JCVI; Analysis Weizhong Li, UCSD LS December 28, 2011 Stool Sample
  • 12. Top 20 Most Abundant Microbial Species In LS vs. Average Healthy Subject 152x 765x 148x 849x 483x 220x 201x 169x 522x Number Above LS Blue Bar is Multiple of LS Abundance Compared to Average Healthy Abundance Per Species Source: Sequencing JCVI; Analysis Weizhong Li, UCSD LS December 28, 2011 Stool Sample
  • 13. Comparing 3 LS Time Snapshots (Left) with Healthy, Crohn’s, UC (Right Top to Bottom) Calit2 VROOM-FuturePatient Expedition
  • 14. We Find Major Shifts in Microbial Ecology Between Healthy and Two Forms of IBD Collapse of Bacteroidetes Microbiome “Dysbiosis” or “Mass Extinction”? Explosion of Proteobacteria On the IBD Spectrum
  • 15. I Have Massive Reduction in the Families of the Bacteroidetes Phylum in My Gut Calit2 FuturePatient Expedition
  • 16. Major Changes in LS Microbiome Before and After 1 Month Antibiotic & 2 Month Prednisone Therapy Reduced 45x Reduced 90x Therapy Greatly Reduced Two Phyla, But Massive Reduction in Bacteroidetes And Large % Proteobacteria Remain Small Changes With No Therapy How Does One Get Back to a “Healthy” Gut Microbiome?
  • 17. From War to Gardening “I would like to lose the language of warfare,” said Julie Segre, a senior investigator at the National Human Genome Research Institute. ”It does a disservice to all the bacteria that have co-evolved with us and are maintaining the health of our bodies.”
  • 18. From Taxonomy to Function: Analysis of LS Clusters of Orthologous Groups (COGs) Analysis: Weizhong Li & Sitao Wu, UCSD
  • 19. What is Adequate Metadata to Define the Environment of the Human Microbiome? • Need the Variables that Determine Relative Abundances of Microbial Species – Genetics of Host – Immune System Variables – Other Environmental Variables (Food, Antibiotics, etc.) • At What Scale Do We Need These Metadata Variables? – SNPs vs. Full Genome – Medical Tests vs. Proteomics, Metabolomics, Transcriptomics – Phenotyping of Signs and Symptoms