SlideShare a Scribd company logo
Bioinformatics – Discovering the “Bio-Logic” of Nature Robert Cormia Foothill College
Transducing the Genome 50 years after Watson and Crick deduced the structure of DNA… The information molecules of nature now reside as data bits inside computers But what does it all mean? We have ~15 GBytes of genomic data And only just beginning to unravel it
 
‘ Energy Systems’ Before ‘Life’ “ Life” arose on earth almost 4 billion years ago, 1 billion years before cells Long chains of molecules harvesting energy, probably deep below the sea Before DNA, RNA and the sophisticated proteins that we know today There were plenty of sources of energy, but no “choreographed metabolism”
Energy Metabolism
“ In the Beginning” Rock, heat, and some water Early molecules of life Energy moved from rock into sea Molecular networks played in the path Capturing a memory of that process was probably the key to life today
Life on the Sea Floor?
RNA Busy Before Cellular Life
The RNA World There is no way to know how the molecules of life really formed… Amino acids and ribonucleotides have formed in “pre-biotic” experiments RNA molecules, which appear to be both catalysts and templates, are thought to have formed energy networks
RNA Codons and Catalysts
RNA and DNA A, T, C, G, and U A = Adenine T = Thymine C = Cytosine G = Guanine U = Uracil A-T and C-G in DNA A-U and C-G in RNA
Central Dogma of Life
The Genome DNA –  D eoxyribo N ucleic  A cid is the prominent molecule of the genome Genes are formed of lengths of DNA polymers which code for proteins Exons and introns exist in DNA Regulatory regions control transcription and the formation of every protein and enzyme. It is the key to metabolism.
 
DNA at Transcription
The Proteome Proteins form cellular structure and enzymes, which function in metabolism Over 100,000 proteins exist in humans DNA is not enough to run metabolism Proteins have a “run-time” knowledge Proteins control the transcription of DNA and DNA controls formation of proteins
Rubisco Protein – Photosynthesis
RAD Protein Complex
Number of Genes vs. Time
What is Bioinformatics? Molecular biology Ability to sequence DNA Internet databases To store and transmit data Mathematical algorithms To model and solve biological problems Analysis Using the I2I Technology Model
Internet Technologies CPU Networking Data Storage Data Mining Grid Computing Storage Area Networks
Bioinformatics Technologies Informatics IT / Networking Molecular Biology Data Modeling Computational Biology Genomic Databases
A Tool for Biotechnology Bioinformatics creates a set of tools for understanding the mountain of new data In biotechnology, these tools are used to discover how genes and proteins work Computers are used to both analyze and “mine” new data for hidden relationships Discovering the “bio-logic” of nature
From Data to Knowledge
DNA Sequencing
DNA Sequencing Chemical sequencing Molecular sequencing Now about $0.01 per base Human Genome took 10 years Celera sequenced in 3 years Moore’s law applies to biotechnology too In 2010 a single human genome in ~7 days
DNA Sequencing http://www.accessexcellence.org
Gel Enhanced Staining
DNA Micro Arrays Used to monitor gene expression Which genes are active? What are the “co-expressed patterns”? Compare healthy and diseased tissue Extract “expressed” mRNA in cytoplasm Convert mRNA to cDNA Discover relationships of proteins to disease states, and function / location of genes Is becoming the first step in “drug-discovery”
Microarray Output Screen
Microarray Output
Partnering with Pharma Bioinformatics is an industry of tools Biotech is a consumer / user of these tools Pharma needs more “innovation engines” Less than 2 drugs per firm in the ‘pipeline’ Drug discovery creates a new value chain bioinformatics > biotech > ‘big pharma’ Convergence is the modality of innovation
Pharma and Biotech
Drug Discovery Target discovery Target validation Protein interactions Rapid screening The long haul… $800 million / year is spent on drug discovery Over 75% of drug compounds will never work
Drug Development Process
Drug Discovery
“ Pharmaco Genomics” Individualized medicine Looking at SNPs along drug targets What makes each of us – us? 1 million SNPs, about one per intron In the future, each of us will have our genome “insilico” (genome on a chip) Data mining against 6 billion genomes!
Pharmaco Genomics
One Genome There are three very different ways to look at genomic diversity – and all are equally valid! A “collective” human genome 3 billion base pairs – called the ‘golden path’ Each one of us is a unique genome “ I am a genome of one”, my SNPS make me - ‘me’ The Genome on planet earth A collective metabolic evolution and speciation
Terra Genoma
Molecular Networks Genome or Proteome? Proteome of Genome? Wait a minute… What if it’s both? Now what would that look like?
Gene Regulatory Networks
 
Pathway Kinetics
Gene Regulatory Network
Bioinformatics Tools NCBI BLAST, 12 million records, SNP databases ExPASy Swiss-Prot, EMBL, Swiss-Model PIR – Protein Information Resource PDB – Protein Data Bank Pfam – Protein families
NCBI National Center for Biotechnology Information, part of NIH and NLM Funded by US – open to all GenBank and GenPept 13 million entries, 12 billion base pairs Resources include oncology, retroviruses, SNP databases, and much more Sequin submission of raw sequence data
 
NCBI Resources
Retroviruses
BLAST Basic Local Alignment Search Tool Used as a “genomic search engine” Compare your target sequence to the “non-redundant” database of 13B bps. Can search the genomes of species Human, mouse, fly, E.coli etc. ‘ Hits’ return inks to GenBank and GenPept
 
Swiss-Prot Swiss - protein annotated database Protein resource Minimal redundancy, reasonably current protein annotated / integrated database Links to protein structures and properties Links back into GenBank, EMBL, DDBJ Literature resources for submissions
ExPASy The ExPASy ( Ex pert  P rotein  A nalysis  Sy stem)  Proteomics server of the Swiss Institute of Bioinformatics (SIB) is dedicated to analysis of protein sequences and structures Swiss-Prot and PROSITE Links to SWISS-MODEL
PROSITE - Database of Protein Families and Domains
Structure Analysis
Protein Data Bank  SWISS-MODEL Protein Data Bank Archive of .pdb files Structures determined by X-ray, NMR Theoretical Structure Search Features a “Molecule of the Month” http://www.rcsb.org/pdb/
 
PIR Protein Information Resource i ProClass and PRI-NREF PIR-PSD, Swiss-Prot, TrEMBL, RefSeq, GenPept, and PDB http://pir.georgetown.edu/ Integrated public resource of protein informatics Supports genomic and proteomic research and scientific discovery -  i ProClass and PRI-NREF
 
Pfam Protein family comparisons Look at multiple alignments  View protein domain architectures  Examine species distribution  Follow links to other databases  View known protein structures Follow ‘conserved domains’ from BLASTp searches of protein databases
 
The Grand Challenge
The Technology Roadmap Genomics 1995 to 2005 Proteomics 2000 to 2010 Systems biology 2005 to 2015 Genetic remodeling / re-engineering 2010 to 2020 Generation Phi Children born in 2025 may never know disease
Convergence of Biotech & Pharma Genomics Proteomics Systems biology Pharmaco genomics Genetic engineering
Mouse Genome
 
Gene Therapy Somatic Gene Therapy Therapeutic Gene Therapy Incorporate “missing genes” Remove cells from host organism Amplify target cells Insert gene using (viral) vector Return target cells into host organism Insulin gene was one of the first trials
 
Labeling Active Genes  Along Chromosomes
Transgenic Species
Designer Flies – Is Blue Cool?
Your Own Private Genome
Surfing the Genome Internet technologies Connecting users, tools, and data Molecular biology Racing forward a top Moore’s Law Informatics Mathematical interrogation of nature’s secrets Surfing the Genome! Discovering the “bio-logic” of Nature http://www.SurfingTheGenome.us/  Spring 2003
Contact Information Robert D. Cormia Foothill College [email_address] http://www.informaticus.org/ 650 747 1588 Surfing the Genome – Spring 2003

More Related Content

What's hot (20)

PDF
Basics of bioinformatics
Abhishek Vatsa
 
PDF
EMBL- European Molecular Biology Laboratory
Thapar Institute of Engineering & Technology, Patiala, Punjab, India
 
PPT
Phylogeny
martyynyyte
 
PPT
Biological databases
Malla Reddy College of Pharmacy
 
PPTX
BIOLOGICAL SEQUENCE DATABASES
nadeem akhter
 
PPTX
Prosite
Rashi Srivastava
 
PPTX
Bioinformatics
Arockiyajainmary
 
PPTX
ClinVar: Getting the most from the reference assembly and reference materials
Genome Reference Consortium
 
PPT
swiss-prot<bioinformatics>
Pardeep kaushal
 
PPT
Data Base in Bioinformatics.ppt
Bangaluru
 
PPT
Sequence file formats
Alphonsa Joseph
 
PPTX
Phylogenetic tree and its construction and phylogeny of
bhavnesthakur
 
PPTX
Kegg databse
Rashi Srivastava
 
PPTX
phylogenetic analysis.pptx
Dr. Vimal Priya subramanian
 
PPTX
Protein information resource (PIR)
ShivaniShewale2
 
PPTX
Phylogenetic tree
Sanzid Kawsar
 
PPTX
Protein Data Bank ( PDB ) - Bioinformatics
karmandeepkaur7
 
PDF
Bioinformatics.Practical Notebook
Naima Tahsin
 
PPTX
Molecular Phylogenetics
Meghaj Mallick
 
PDF
Blast bioinformatics
atmapandey
 
Basics of bioinformatics
Abhishek Vatsa
 
Phylogeny
martyynyyte
 
Biological databases
Malla Reddy College of Pharmacy
 
BIOLOGICAL SEQUENCE DATABASES
nadeem akhter
 
Bioinformatics
Arockiyajainmary
 
ClinVar: Getting the most from the reference assembly and reference materials
Genome Reference Consortium
 
swiss-prot<bioinformatics>
Pardeep kaushal
 
Data Base in Bioinformatics.ppt
Bangaluru
 
Sequence file formats
Alphonsa Joseph
 
Phylogenetic tree and its construction and phylogeny of
bhavnesthakur
 
Kegg databse
Rashi Srivastava
 
phylogenetic analysis.pptx
Dr. Vimal Priya subramanian
 
Protein information resource (PIR)
ShivaniShewale2
 
Phylogenetic tree
Sanzid Kawsar
 
Protein Data Bank ( PDB ) - Bioinformatics
karmandeepkaur7
 
Bioinformatics.Practical Notebook
Naima Tahsin
 
Molecular Phylogenetics
Meghaj Mallick
 
Blast bioinformatics
atmapandey
 

Viewers also liked (20)

PPTX
BLAST [Basic Alignment Local Search Tool]
BiotechOnline
 
PPTX
BLAST (Basic local alignment search Tool)
Ariful Islam Sagar
 
PPTX
NGS: bioinformatic challenges
Lex Nederbragt
 
PPTX
Basic Local Alignment Search Tool (BLAST)
Asiri Wijesinghe
 
PPT
PROTEIN STRUCTURE DATABANK
Malvika Bansal
 
PPTX
Bioinformatics Course
USD Bioinformatics
 
PPT
BLAST(Basic Local Alignment Tool)
Sobia
 
PPT
Blast fasta 4
Er Puspendra Tripathi
 
PPT
Dna protein synthesis_ppt
Karl Pointer
 
PPT
Cell Cycle, Dna, And Protein Synthesis Notes New
Fred Phillips
 
PPTX
Gene identification and discovery
Amit Ruchi Yadav
 
PPTX
Bioinformatics Final Presentation
Shruthi Choudary
 
PPTX
Major databases in bioinformatics
Vidya Kalaivani Rajkumar
 
PDF
Gene prediction methods vijay
Vijay Hemmadi
 
PPTX
Application of bioinformatics
Kamlesh Patade
 
PPT
Application of Bioinformatics in different fields of sciences
Sobia
 
PPT
Protein structure classification
Malla Reddy College of Pharmacy
 
PPT
Bioinformatics
biinoida
 
BLAST [Basic Alignment Local Search Tool]
BiotechOnline
 
BLAST (Basic local alignment search Tool)
Ariful Islam Sagar
 
NGS: bioinformatic challenges
Lex Nederbragt
 
Basic Local Alignment Search Tool (BLAST)
Asiri Wijesinghe
 
PROTEIN STRUCTURE DATABANK
Malvika Bansal
 
Bioinformatics Course
USD Bioinformatics
 
BLAST(Basic Local Alignment Tool)
Sobia
 
Blast fasta 4
Er Puspendra Tripathi
 
Dna protein synthesis_ppt
Karl Pointer
 
Cell Cycle, Dna, And Protein Synthesis Notes New
Fred Phillips
 
Gene identification and discovery
Amit Ruchi Yadav
 
Bioinformatics Final Presentation
Shruthi Choudary
 
Major databases in bioinformatics
Vidya Kalaivani Rajkumar
 
Gene prediction methods vijay
Vijay Hemmadi
 
Application of bioinformatics
Kamlesh Patade
 
Application of Bioinformatics in different fields of sciences
Sobia
 
Protein structure classification
Malla Reddy College of Pharmacy
 
Bioinformatics
biinoida
 
Ad

Similar to Bioinformatics - Discovering the Bio Logic Of Nature (20)

PDF
Bioinformatics manual
kvnsai
 
PPTX
Introduction to Biological database ppt(1).pptx
RAJESHKUMAR428748
 
PPTX
Bioinformatics- An overwiew..................
SoumitraNath9
 
PPT
Lecture 1 Introduction to Bioinformatics BCH 433.ppt
KelechiChukwuemeka
 
PPTX
Introduction to bioinformatics and databases .pptx
ManjuM90
 
PDF
Introduction to Bioinformatics 2025.....pdf
omniaabdo276
 
PPTX
Bioinformatics final
Rainu Rajeev
 
PPTX
Biological database ppt(1).pptx Introuction
RAJESHKUMAR428748
 
PPTX
bioinformatics simple
nadeem akhter
 
PPTX
Introduction to Bioinformatics: Part 2
AhmedAbdElMoniem35
 
PPT
Intro bioinfo
Vinitha Nair
 
PPT
Intro bioinfo
Vinitha Nair
 
PPTX
Introduction
Naeem Ahmed
 
PDF
Bioinformatics issues and challanges presentation at s p college
SKUASTKashmir
 
PPTX
Bioinformatics .pptx
UpendraSharmaUS1
 
PPTX
bioinformatics-200510115939.pptx introduction
RAJESHKUMAR428748
 
PPT
BIOINFORMATICS.ppt
TSaiteja2
 
PPT
An Introductory lecture on BIOINFORMATICS
SanghamitraSanyal2
 
PPT
BIOINFORMATICS.ppt History and applications
RAJESHKUMAR428748
 
PDF
BITS: Overview of important biological databases beyond sequences
BITS
 
Bioinformatics manual
kvnsai
 
Introduction to Biological database ppt(1).pptx
RAJESHKUMAR428748
 
Bioinformatics- An overwiew..................
SoumitraNath9
 
Lecture 1 Introduction to Bioinformatics BCH 433.ppt
KelechiChukwuemeka
 
Introduction to bioinformatics and databases .pptx
ManjuM90
 
Introduction to Bioinformatics 2025.....pdf
omniaabdo276
 
Bioinformatics final
Rainu Rajeev
 
Biological database ppt(1).pptx Introuction
RAJESHKUMAR428748
 
bioinformatics simple
nadeem akhter
 
Introduction to Bioinformatics: Part 2
AhmedAbdElMoniem35
 
Intro bioinfo
Vinitha Nair
 
Intro bioinfo
Vinitha Nair
 
Introduction
Naeem Ahmed
 
Bioinformatics issues and challanges presentation at s p college
SKUASTKashmir
 
Bioinformatics .pptx
UpendraSharmaUS1
 
bioinformatics-200510115939.pptx introduction
RAJESHKUMAR428748
 
BIOINFORMATICS.ppt
TSaiteja2
 
An Introductory lecture on BIOINFORMATICS
SanghamitraSanyal2
 
BIOINFORMATICS.ppt History and applications
RAJESHKUMAR428748
 
BITS: Overview of important biological databases beyond sequences
BITS
 
Ad

More from Robert Cormia (20)

PPT
Foothill college nanoscience program
Robert Cormia
 
PPT
Carl djerassi
Robert Cormia
 
PPT
Understanding earth’s greenhouse
Robert Cormia
 
PPT
SPIE scanning microscopy
Robert Cormia
 
PDF
Turn down the heat final project notes
Robert Cormia
 
PPT
Turn down the heat
Robert Cormia
 
PPT
Building the electron economy
Robert Cormia
 
PPT
Foothill College Energy System
Robert Cormia
 
PPT
Career booster – jit skilling
Robert Cormia
 
PPT
Energy, carbon, climate, action
Robert Cormia
 
PPT
From lab to fab training for the innovation value chain
Robert Cormia
 
PPT
College campus energy & climate plan
Robert Cormia
 
PPT
Total world energy
Robert Cormia
 
PPT
Surface and Materials Analysis Techniques
Robert Cormia
 
PPT
NANO 53 Course Overview
Robert Cormia
 
PPT
Nano53 course overview
Robert Cormia
 
PPT
The Case for Materials Characterization
Robert Cormia
 
PPT
NANO 53 course overview
Robert Cormia
 
PPT
A World of Energy
Robert Cormia
 
PPT
ENGR 40 orientation
Robert Cormia
 
Foothill college nanoscience program
Robert Cormia
 
Carl djerassi
Robert Cormia
 
Understanding earth’s greenhouse
Robert Cormia
 
SPIE scanning microscopy
Robert Cormia
 
Turn down the heat final project notes
Robert Cormia
 
Turn down the heat
Robert Cormia
 
Building the electron economy
Robert Cormia
 
Foothill College Energy System
Robert Cormia
 
Career booster – jit skilling
Robert Cormia
 
Energy, carbon, climate, action
Robert Cormia
 
From lab to fab training for the innovation value chain
Robert Cormia
 
College campus energy & climate plan
Robert Cormia
 
Total world energy
Robert Cormia
 
Surface and Materials Analysis Techniques
Robert Cormia
 
NANO 53 Course Overview
Robert Cormia
 
Nano53 course overview
Robert Cormia
 
The Case for Materials Characterization
Robert Cormia
 
NANO 53 course overview
Robert Cormia
 
A World of Energy
Robert Cormia
 
ENGR 40 orientation
Robert Cormia
 

Recently uploaded (20)

DOCX
Unit 5: Speech-language and swallowing disorders
JELLA VISHNU DURGA PRASAD
 
PPTX
Basics and rules of probability with real-life uses
ravatkaran694
 
PPTX
Gupta Art & Architecture Temple and Sculptures.pptx
Virag Sontakke
 
PDF
BÀI TẬP TEST BỔ TRỢ THEO TỪNG CHỦ ĐỀ CỦA TỪNG UNIT KÈM BÀI TẬP NGHE - TIẾNG A...
Nguyen Thanh Tu Collection
 
PPTX
Cleaning Validation Ppt Pharmaceutical validation
Ms. Ashatai Patil
 
PPTX
Introduction to Probability(basic) .pptx
purohitanuj034
 
PDF
TOP 10 AI TOOLS YOU MUST LEARN TO SURVIVE IN 2025 AND ABOVE
digilearnings.com
 
PPTX
I INCLUDED THIS TOPIC IS INTELLIGENCE DEFINITION, MEANING, INDIVIDUAL DIFFERE...
parmarjuli1412
 
PPTX
Electrophysiology_of_Heart. Electrophysiology studies in Cardiovascular syste...
Rajshri Ghogare
 
PPTX
LDP-2 UNIT 4 Presentation for practical.pptx
abhaypanchal2525
 
PPTX
The Future of Artificial Intelligence Opportunities and Risks Ahead
vaghelajayendra784
 
PPTX
Translation_ Definition, Scope & Historical Development.pptx
DhatriParmar
 
PPTX
K-Circle-Weekly-Quiz12121212-May2025.pptx
Pankaj Rodey
 
PPTX
ENGLISH 8 WEEK 3 Q1 - Analyzing the linguistic, historical, andor biographica...
OliverOllet
 
PPTX
Digital Professionalism and Interpersonal Competence
rutvikgediya1
 
PPTX
Continental Accounting in Odoo 18 - Odoo Slides
Celine George
 
PPTX
Command Palatte in Odoo 18.1 Spreadsheet - Odoo Slides
Celine George
 
PDF
John Keats introduction and list of his important works
vatsalacpr
 
PPTX
PROTIEN ENERGY MALNUTRITION: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
PPTX
TOP 10 AI TOOLS YOU MUST LEARN TO SURVIVE IN 2025 AND ABOVE
digilearnings.com
 
Unit 5: Speech-language and swallowing disorders
JELLA VISHNU DURGA PRASAD
 
Basics and rules of probability with real-life uses
ravatkaran694
 
Gupta Art & Architecture Temple and Sculptures.pptx
Virag Sontakke
 
BÀI TẬP TEST BỔ TRỢ THEO TỪNG CHỦ ĐỀ CỦA TỪNG UNIT KÈM BÀI TẬP NGHE - TIẾNG A...
Nguyen Thanh Tu Collection
 
Cleaning Validation Ppt Pharmaceutical validation
Ms. Ashatai Patil
 
Introduction to Probability(basic) .pptx
purohitanuj034
 
TOP 10 AI TOOLS YOU MUST LEARN TO SURVIVE IN 2025 AND ABOVE
digilearnings.com
 
I INCLUDED THIS TOPIC IS INTELLIGENCE DEFINITION, MEANING, INDIVIDUAL DIFFERE...
parmarjuli1412
 
Electrophysiology_of_Heart. Electrophysiology studies in Cardiovascular syste...
Rajshri Ghogare
 
LDP-2 UNIT 4 Presentation for practical.pptx
abhaypanchal2525
 
The Future of Artificial Intelligence Opportunities and Risks Ahead
vaghelajayendra784
 
Translation_ Definition, Scope & Historical Development.pptx
DhatriParmar
 
K-Circle-Weekly-Quiz12121212-May2025.pptx
Pankaj Rodey
 
ENGLISH 8 WEEK 3 Q1 - Analyzing the linguistic, historical, andor biographica...
OliverOllet
 
Digital Professionalism and Interpersonal Competence
rutvikgediya1
 
Continental Accounting in Odoo 18 - Odoo Slides
Celine George
 
Command Palatte in Odoo 18.1 Spreadsheet - Odoo Slides
Celine George
 
John Keats introduction and list of his important works
vatsalacpr
 
PROTIEN ENERGY MALNUTRITION: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
TOP 10 AI TOOLS YOU MUST LEARN TO SURVIVE IN 2025 AND ABOVE
digilearnings.com
 

Bioinformatics - Discovering the Bio Logic Of Nature

  • 1. Bioinformatics – Discovering the “Bio-Logic” of Nature Robert Cormia Foothill College
  • 2. Transducing the Genome 50 years after Watson and Crick deduced the structure of DNA… The information molecules of nature now reside as data bits inside computers But what does it all mean? We have ~15 GBytes of genomic data And only just beginning to unravel it
  • 3.  
  • 4. ‘ Energy Systems’ Before ‘Life’ “ Life” arose on earth almost 4 billion years ago, 1 billion years before cells Long chains of molecules harvesting energy, probably deep below the sea Before DNA, RNA and the sophisticated proteins that we know today There were plenty of sources of energy, but no “choreographed metabolism”
  • 6. “ In the Beginning” Rock, heat, and some water Early molecules of life Energy moved from rock into sea Molecular networks played in the path Capturing a memory of that process was probably the key to life today
  • 7. Life on the Sea Floor?
  • 8. RNA Busy Before Cellular Life
  • 9. The RNA World There is no way to know how the molecules of life really formed… Amino acids and ribonucleotides have formed in “pre-biotic” experiments RNA molecules, which appear to be both catalysts and templates, are thought to have formed energy networks
  • 10. RNA Codons and Catalysts
  • 11. RNA and DNA A, T, C, G, and U A = Adenine T = Thymine C = Cytosine G = Guanine U = Uracil A-T and C-G in DNA A-U and C-G in RNA
  • 13. The Genome DNA – D eoxyribo N ucleic A cid is the prominent molecule of the genome Genes are formed of lengths of DNA polymers which code for proteins Exons and introns exist in DNA Regulatory regions control transcription and the formation of every protein and enzyme. It is the key to metabolism.
  • 14.  
  • 16. The Proteome Proteins form cellular structure and enzymes, which function in metabolism Over 100,000 proteins exist in humans DNA is not enough to run metabolism Proteins have a “run-time” knowledge Proteins control the transcription of DNA and DNA controls formation of proteins
  • 17. Rubisco Protein – Photosynthesis
  • 19. Number of Genes vs. Time
  • 20. What is Bioinformatics? Molecular biology Ability to sequence DNA Internet databases To store and transmit data Mathematical algorithms To model and solve biological problems Analysis Using the I2I Technology Model
  • 21. Internet Technologies CPU Networking Data Storage Data Mining Grid Computing Storage Area Networks
  • 22. Bioinformatics Technologies Informatics IT / Networking Molecular Biology Data Modeling Computational Biology Genomic Databases
  • 23. A Tool for Biotechnology Bioinformatics creates a set of tools for understanding the mountain of new data In biotechnology, these tools are used to discover how genes and proteins work Computers are used to both analyze and “mine” new data for hidden relationships Discovering the “bio-logic” of nature
  • 24. From Data to Knowledge
  • 26. DNA Sequencing Chemical sequencing Molecular sequencing Now about $0.01 per base Human Genome took 10 years Celera sequenced in 3 years Moore’s law applies to biotechnology too In 2010 a single human genome in ~7 days
  • 29. DNA Micro Arrays Used to monitor gene expression Which genes are active? What are the “co-expressed patterns”? Compare healthy and diseased tissue Extract “expressed” mRNA in cytoplasm Convert mRNA to cDNA Discover relationships of proteins to disease states, and function / location of genes Is becoming the first step in “drug-discovery”
  • 32. Partnering with Pharma Bioinformatics is an industry of tools Biotech is a consumer / user of these tools Pharma needs more “innovation engines” Less than 2 drugs per firm in the ‘pipeline’ Drug discovery creates a new value chain bioinformatics > biotech > ‘big pharma’ Convergence is the modality of innovation
  • 34. Drug Discovery Target discovery Target validation Protein interactions Rapid screening The long haul… $800 million / year is spent on drug discovery Over 75% of drug compounds will never work
  • 37. “ Pharmaco Genomics” Individualized medicine Looking at SNPs along drug targets What makes each of us – us? 1 million SNPs, about one per intron In the future, each of us will have our genome “insilico” (genome on a chip) Data mining against 6 billion genomes!
  • 39. One Genome There are three very different ways to look at genomic diversity – and all are equally valid! A “collective” human genome 3 billion base pairs – called the ‘golden path’ Each one of us is a unique genome “ I am a genome of one”, my SNPS make me - ‘me’ The Genome on planet earth A collective metabolic evolution and speciation
  • 41. Molecular Networks Genome or Proteome? Proteome of Genome? Wait a minute… What if it’s both? Now what would that look like?
  • 43.  
  • 46. Bioinformatics Tools NCBI BLAST, 12 million records, SNP databases ExPASy Swiss-Prot, EMBL, Swiss-Model PIR – Protein Information Resource PDB – Protein Data Bank Pfam – Protein families
  • 47. NCBI National Center for Biotechnology Information, part of NIH and NLM Funded by US – open to all GenBank and GenPept 13 million entries, 12 billion base pairs Resources include oncology, retroviruses, SNP databases, and much more Sequin submission of raw sequence data
  • 48.  
  • 51. BLAST Basic Local Alignment Search Tool Used as a “genomic search engine” Compare your target sequence to the “non-redundant” database of 13B bps. Can search the genomes of species Human, mouse, fly, E.coli etc. ‘ Hits’ return inks to GenBank and GenPept
  • 52.  
  • 53. Swiss-Prot Swiss - protein annotated database Protein resource Minimal redundancy, reasonably current protein annotated / integrated database Links to protein structures and properties Links back into GenBank, EMBL, DDBJ Literature resources for submissions
  • 54. ExPASy The ExPASy ( Ex pert P rotein A nalysis Sy stem) Proteomics server of the Swiss Institute of Bioinformatics (SIB) is dedicated to analysis of protein sequences and structures Swiss-Prot and PROSITE Links to SWISS-MODEL
  • 55. PROSITE - Database of Protein Families and Domains
  • 57. Protein Data Bank SWISS-MODEL Protein Data Bank Archive of .pdb files Structures determined by X-ray, NMR Theoretical Structure Search Features a “Molecule of the Month” http://www.rcsb.org/pdb/
  • 58.  
  • 59. PIR Protein Information Resource i ProClass and PRI-NREF PIR-PSD, Swiss-Prot, TrEMBL, RefSeq, GenPept, and PDB http://pir.georgetown.edu/ Integrated public resource of protein informatics Supports genomic and proteomic research and scientific discovery - i ProClass and PRI-NREF
  • 60.  
  • 61. Pfam Protein family comparisons Look at multiple alignments View protein domain architectures Examine species distribution Follow links to other databases View known protein structures Follow ‘conserved domains’ from BLASTp searches of protein databases
  • 62.  
  • 64. The Technology Roadmap Genomics 1995 to 2005 Proteomics 2000 to 2010 Systems biology 2005 to 2015 Genetic remodeling / re-engineering 2010 to 2020 Generation Phi Children born in 2025 may never know disease
  • 65. Convergence of Biotech & Pharma Genomics Proteomics Systems biology Pharmaco genomics Genetic engineering
  • 67.  
  • 68. Gene Therapy Somatic Gene Therapy Therapeutic Gene Therapy Incorporate “missing genes” Remove cells from host organism Amplify target cells Insert gene using (viral) vector Return target cells into host organism Insulin gene was one of the first trials
  • 69.  
  • 70. Labeling Active Genes Along Chromosomes
  • 72. Designer Flies – Is Blue Cool?
  • 74. Surfing the Genome Internet technologies Connecting users, tools, and data Molecular biology Racing forward a top Moore’s Law Informatics Mathematical interrogation of nature’s secrets Surfing the Genome! Discovering the “bio-logic” of Nature http://www.SurfingTheGenome.us/ Spring 2003
  • 75. Contact Information Robert D. Cormia Foothill College [email_address] http://www.informaticus.org/ 650 747 1588 Surfing the Genome – Spring 2003