SlideShare a Scribd company logo
Dmitry Grapov, PhD
Gene Ontology Network
Enrichment Analysis
Download all material for the tutorial
https://sourceforge.net/projects/teachingdemos/files/2014%2
https://sourceforge.net/projects/teachingdemos/files/
Choose 2014 UC Davis Proteomics Workshop or use the
full URL below
• decrease
• increase
Use functional analysis to identify if the changes in variables
are enriched (increased compared to random chance) for
some biological pathway, domain or ontological category.
Enrichment or Overrepresentation analysis
Biochemical Pathway Biochemical Ontology
Major Tasks
Using the proteins listed in the excel workbook: ‘proteomic data for
analysis.xlsx’ and worksheet: ‘protein IDs’
1. Conduct Gene Ontology (GO) Enrichment Analysis using
DAVID Bioinformatics Resources
http://david.abcc.ncifcrf.gov/home.jsp
2. Investigate enriched terms using
Quick GO http://www.ebi.ac.uk/QuickGO/
3. Summaries and visualize the results using
REVIGO http://revigo.irb.hr/
4. Create and modify GO network using
Cytoscape http://www.cytoscape.org/
Protein IDs
Common protein identifier
UniProt/SwissProt Accession
(default in scaffold)
http://www.uniprot.org/
Use Biomart to translate to other
database IDS
http://www.biomart.org/
e.g. gene symbols
David Bioinformatics Resources
David Bioinformatics Resources
1. Upload list
2. Choose ID
type
3. Select list
type
4. Submit
David Bioinformatics Resources
organism Make sure all IDs were recognized
List of
biochemical
databases tested
for enrichment
David Bioinformatics Resources
List of
biochemical
databases tested
for enrichment
1. Choose GO
David Bioinformatics Resources
http://david.abcc.ncifcrf.gov/helps/functional_annotation.html#E3
David Bioinformatics Resources
List of
biochemical
databases tested
for enrichment
1. Overview
BP: Biological
process
2. Select
David Bioinformatics Resources
http://david.abcc.ncifcrf.gov/helps/functional_annotation.html#E3
David Bioinformatics Resources
1. Overview most enriched term
Quick GO http://www.ebi.ac.uk/QuickGO/
1. View children (lower hierarchy subsets) of this term
David Bioinformatics Resources/Quick GO
1. Can you identify any enriched
children of this term in our DAVID
output?
?
2. Download
results
Overview and Format Results in Excel
1. Save results 2. Open in MS Excel
Overview Results
Modified Fisher’s Exact Test p-value
optionally: Check in R
x<-data.frame(user=c(1,47),genome=c(690,13528))
fisher.test(x) # p-value = 5.41e-06
(13/47) / (690/13528)
Alternative to Fisher Exact Test:
Hypergeometric Test
How to calculate statistics to determine enrichment?
hit.num = 51 # number of significantly changed pathway variables
set.num = 1455 # number of variables in pathway
full = 3358 # all possible variables in organism
q.size = 72 # number of significantly changed variables
phyper(hit.num-1, set.num, full-set.num, q.size, lower.tail=F)
enrichment p-value = 1.717553e-06
Visualization Options
Challenges:
•Removal of redundant information
•Visualizing term relationships (term-term, term-protein)
Use REVIGO to filter redundant terms
http://revigo.irb.hr/
prepare input (term, p-value)
1. Upload to
REVIGO
Supek F, Bošnjak M, Škunca N, Šmuc T. "REVIGO summarizes and visualizes long lists of Gene Ontology terms" PLoS ONE 2011. doi:10.1371/journal.pone.0021800
2. Run
REVIGO: overview scatterplot
Position defined on similarity (MDS)
REVIGO: overview table
Cluster leaders prioritized based on enrichment p-value
REVIGO: network
• Edges: 3% of the
strongest GO term
pairwise similarities
• Node size: generality
of term
(small = specific)
• Node color: p-value
Download network
Cytoscape
1. Open Cytoscape
Import REVIGO network into cytoscape
2
3 4
Cytoscape: set layout and defaults
1. Set layout 3. Set network defaults
2
4 5
Cytoscape: map data to network properties
1. Set Edge width and color 2. Set Node labels, size and color
Cytoscape: overview network components
Download edge information
1
2
3. View in excel
Download node information
1
2
3. View in excel
Bonus: Modify Edge and Node Attributes to show
term to protein connections
See file ‘test edge.xlsx’ and ‘test node.xslx, for examples of upload
formats
See detailed instructions at http://www.slideshare.net/dgrapov/demonstration-of-network-mapping
See more Statistical and Multivariate Analysis Examples at
http://imdevsoftware.wordpress.com/tutorials/
Questions?
dgrapov@ucdavis.edu
This research was supported in part by NIH 1 U24 DK097154

More Related Content

PPT
Systems biology & Approaches of genomics and proteomics
sonam786
 
PPT
Genomics
Pubudu Gokarella
 
PDF
Variant (SNP) calling - an introduction (with a worked example, using FreeBay...
Manikhandan Mudaliar
 
PPTX
Introduction to systems biology
lemberger
 
PPTX
bioinformatics simple
nadeem akhter
 
PPTX
Integrative omics approches
Sayali Magar
 
PPTX
Protein structure
Pooja Pawar
 
PDF
Functional genomics
Sreenivasa Reddy Thalla
 
Systems biology & Approaches of genomics and proteomics
sonam786
 
Variant (SNP) calling - an introduction (with a worked example, using FreeBay...
Manikhandan Mudaliar
 
Introduction to systems biology
lemberger
 
bioinformatics simple
nadeem akhter
 
Integrative omics approches
Sayali Magar
 
Protein structure
Pooja Pawar
 
Functional genomics
Sreenivasa Reddy Thalla
 

What's hot (20)

PPTX
Gene tagging
PRANJAL GUPTA
 
PPTX
BLAST and sequence alignment
Bioinformatics and Computational Biosciences Branch
 
PPTX
Ensembl annotation
Genome Reference Consortium
 
PPT
Metagenomic
kumarkanika
 
PPTX
Protein microarray
Ghalia Nawal
 
PDF
Genomic Data Analysis
Data Driven Innovation
 
PPT
Gene Ontology Project
vaibhavdeoda
 
PPTX
Bioinformatics.pptx
shewaademu
 
PPTX
Bioinformatics
Bivek Rai
 
PPTX
Comparison between RNASeq and Microarray for Gene Expression Analysis
Yaoyu Wang
 
PPTX
Kegg database resources
innocent87
 
PPT
Gene Ontology Network Enrichment Analysis
UC Davis
 
PPTX
String.pptx
RitikaChoudhary57
 
PPTX
gene prediction programs
MugdhaSharma11
 
PPT
Genomics seminar copy
manjunatha s s
 
PDF
Epigenetics & Epigenomics .pdf
shinycthomas
 
PPTX
Southern blotting
Health Tech Facts
 
PPT
Biological data base
kishoreGupta17
 
Gene tagging
PRANJAL GUPTA
 
Ensembl annotation
Genome Reference Consortium
 
Metagenomic
kumarkanika
 
Protein microarray
Ghalia Nawal
 
Genomic Data Analysis
Data Driven Innovation
 
Gene Ontology Project
vaibhavdeoda
 
Bioinformatics.pptx
shewaademu
 
Bioinformatics
Bivek Rai
 
Comparison between RNASeq and Microarray for Gene Expression Analysis
Yaoyu Wang
 
Kegg database resources
innocent87
 
Gene Ontology Network Enrichment Analysis
UC Davis
 
String.pptx
RitikaChoudhary57
 
gene prediction programs
MugdhaSharma11
 
Genomics seminar copy
manjunatha s s
 
Epigenetics & Epigenomics .pdf
shinycthomas
 
Southern blotting
Health Tech Facts
 
Biological data base
kishoreGupta17
 
Ad

Viewers also liked (20)

PPT
Prote-OMIC Data Analysis and Visualization
Dmitry Grapov
 
PPTX
6 metabolite enrichment analysis
Dmitry Grapov
 
PPTX
High Dimensional Biological Data Analysis and Visualization
Dmitry Grapov
 
PPTX
4 partial least squares modeling
Dmitry Grapov
 
PPTX
1 statistical analysis
Dmitry Grapov
 
PPTX
5 data analysis case study
Dmitry Grapov
 
PPTX
3 principal components analysis
Dmitry Grapov
 
PPTX
Metabolomic Data Analysis Workshop and Tutorials (2014)
Dmitry Grapov
 
PPTX
7 network mapping i
Dmitry Grapov
 
PPTX
0 introduction
Dmitry Grapov
 
PPT
Multivarite and network tools for biological data analysis
Dmitry Grapov
 
PPTX
2 cluster analysis
Dmitry Grapov
 
PDF
Functional And Pathway Analysis 2010
Stewart MacArthur
 
PPTX
Connecting Metabolomic Data with Context
Dmitry Grapov
 
PPTX
Data Normalization Approaches for Large-scale Biological Studies
Dmitry Grapov
 
PPTX
Mapping to the Metabolomic Manifold
Dmitry Grapov
 
PPTX
Introduction to Network Mapping
Dmitry Grapov
 
PDF
GoTermsAnalysisWithR
Aureliano Bombarely
 
PPTX
Automation of (Biological) Data Analysis and Report Generation
Dmitry Grapov
 
PDF
Taverna workflow management system (2010 11-30 Bath Workflow Tools)
Stian Soiland-Reyes
 
Prote-OMIC Data Analysis and Visualization
Dmitry Grapov
 
6 metabolite enrichment analysis
Dmitry Grapov
 
High Dimensional Biological Data Analysis and Visualization
Dmitry Grapov
 
4 partial least squares modeling
Dmitry Grapov
 
1 statistical analysis
Dmitry Grapov
 
5 data analysis case study
Dmitry Grapov
 
3 principal components analysis
Dmitry Grapov
 
Metabolomic Data Analysis Workshop and Tutorials (2014)
Dmitry Grapov
 
7 network mapping i
Dmitry Grapov
 
0 introduction
Dmitry Grapov
 
Multivarite and network tools for biological data analysis
Dmitry Grapov
 
2 cluster analysis
Dmitry Grapov
 
Functional And Pathway Analysis 2010
Stewart MacArthur
 
Connecting Metabolomic Data with Context
Dmitry Grapov
 
Data Normalization Approaches for Large-scale Biological Studies
Dmitry Grapov
 
Mapping to the Metabolomic Manifold
Dmitry Grapov
 
Introduction to Network Mapping
Dmitry Grapov
 
GoTermsAnalysisWithR
Aureliano Bombarely
 
Automation of (Biological) Data Analysis and Report Generation
Dmitry Grapov
 
Taverna workflow management system (2010 11-30 Bath Workflow Tools)
Stian Soiland-Reyes
 
Ad

Similar to Gene Ontology Enrichment Network Analysis -Tutorial (20)

PPTX
Omic Data Integration Strategies
Dmitry Grapov
 
PPTX
HyQue: Evaluating scientific Hypotheses using semantic web technologies
Michel Dumontier
 
PPTX
Being Reproducible: SSBSS Summer School 2017
Carole Goble
 
PPTX
Case Study in Linked Data and Semantic Web: Human Genome
David Portnoy
 
PPTX
BioAssay Express: Creating and exploiting assay metadata
Philip Cheung
 
PPT
BIOLINK 2008: Linking database submissions to primary citations with PubMe...
Heather Piwowar
 
PPTX
How can you access PubChem programmatically?
Sunghwan Kim
 
PPTX
Experiences with logic programming in bioinformatics
Chris Mungall
 
PPTX
Supporting researchers in the molecular life sciences Jeff Christiansen
ARDC
 
PDF
ISMB Workshop 2014
Alejandra Gonzalez-Beltran
 
PPTX
Pine education-platform
Jaclyn Williams
 
PDF
Overview of Next Gen Sequencing Data Analysis
Bioinformatics and Computational Biosciences Branch
 
PDF
Open Targets workshop at C4X in 2019
Denise Carvalho-Silva, PhD
 
PPTX
BioThings API: Building a FAIR API Ecosystem for Biomedical Knowledge
Chunlei Wu
 
PDF
Gcc talk baltimore july 2014
pratikomics
 
PDF
The Galaxy framework as a unifying bioinformatics solution for multi-omic dat...
pratikomics
 
PPTX
Introduction to Proteogenomics
Yasset Perez-Riverol
 
PPTX
Delivering The Benefits of Chemical-Biological Integration in Computational T...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
PDF
Bio-protocol4446.GO_KEGG_RICE.pdf
ssuserb500f8
 
PPTX
Data Integration vs Transparency: Tackling the tension
Paul Groth
 
Omic Data Integration Strategies
Dmitry Grapov
 
HyQue: Evaluating scientific Hypotheses using semantic web technologies
Michel Dumontier
 
Being Reproducible: SSBSS Summer School 2017
Carole Goble
 
Case Study in Linked Data and Semantic Web: Human Genome
David Portnoy
 
BioAssay Express: Creating and exploiting assay metadata
Philip Cheung
 
BIOLINK 2008: Linking database submissions to primary citations with PubMe...
Heather Piwowar
 
How can you access PubChem programmatically?
Sunghwan Kim
 
Experiences with logic programming in bioinformatics
Chris Mungall
 
Supporting researchers in the molecular life sciences Jeff Christiansen
ARDC
 
ISMB Workshop 2014
Alejandra Gonzalez-Beltran
 
Pine education-platform
Jaclyn Williams
 
Overview of Next Gen Sequencing Data Analysis
Bioinformatics and Computational Biosciences Branch
 
Open Targets workshop at C4X in 2019
Denise Carvalho-Silva, PhD
 
BioThings API: Building a FAIR API Ecosystem for Biomedical Knowledge
Chunlei Wu
 
Gcc talk baltimore july 2014
pratikomics
 
The Galaxy framework as a unifying bioinformatics solution for multi-omic dat...
pratikomics
 
Introduction to Proteogenomics
Yasset Perez-Riverol
 
Delivering The Benefits of Chemical-Biological Integration in Computational T...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Bio-protocol4446.GO_KEGG_RICE.pdf
ssuserb500f8
 
Data Integration vs Transparency: Tackling the tension
Paul Groth
 

More from Dmitry Grapov (15)

PDF
R programming for Data Science - A Beginner’s Guide
Dmitry Grapov
 
PDF
Network mapping 101 course
Dmitry Grapov
 
PDF
Rise of Deep Learning for Genomic, Proteomic, and Metabolomic Data Integratio...
Dmitry Grapov
 
PDF
Dmitry Grapov Resume and CV
Dmitry Grapov
 
PPTX
Machine Learning Powered Metabolomic Network Analysis
Dmitry Grapov
 
PPTX
Complex Systems Biology Informed Data Analysis and Machine Learning
Dmitry Grapov
 
PPTX
Data analysis workflows part 1 2015
Dmitry Grapov
 
PPTX
Data analysis workflows part 2 2015
Dmitry Grapov
 
PPTX
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses
Dmitry Grapov
 
PDF
Case Study: Overview of Metabolomic Data Normalization Strategies
Dmitry Grapov
 
PPTX
Modeling poster
Dmitry Grapov
 
PPTX
3 data normalization (2014 lab tutorial)
Dmitry Grapov
 
PPTX
Normalization of Large-Scale Metabolomic Studies 2014
Dmitry Grapov
 
PPTX
American Society of Mass Spectrommetry Conference 2014
Dmitry Grapov
 
PPTX
Metabolomic data analysis and visualization tools
Dmitry Grapov
 
R programming for Data Science - A Beginner’s Guide
Dmitry Grapov
 
Network mapping 101 course
Dmitry Grapov
 
Rise of Deep Learning for Genomic, Proteomic, and Metabolomic Data Integratio...
Dmitry Grapov
 
Dmitry Grapov Resume and CV
Dmitry Grapov
 
Machine Learning Powered Metabolomic Network Analysis
Dmitry Grapov
 
Complex Systems Biology Informed Data Analysis and Machine Learning
Dmitry Grapov
 
Data analysis workflows part 1 2015
Dmitry Grapov
 
Data analysis workflows part 2 2015
Dmitry Grapov
 
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses
Dmitry Grapov
 
Case Study: Overview of Metabolomic Data Normalization Strategies
Dmitry Grapov
 
Modeling poster
Dmitry Grapov
 
3 data normalization (2014 lab tutorial)
Dmitry Grapov
 
Normalization of Large-Scale Metabolomic Studies 2014
Dmitry Grapov
 
American Society of Mass Spectrommetry Conference 2014
Dmitry Grapov
 
Metabolomic data analysis and visualization tools
Dmitry Grapov
 

Recently uploaded (20)

PDF
A water-rich interior in the temperate sub-Neptune K2-18 b revealed by JWST
Sérgio Sacani
 
PPT
1. Basic Principles of Medical Microbiology Part 1.ppt
separatedwalk
 
PPTX
Role of GIS in precision farming.pptx
BikramjitDeuri
 
PDF
Renewable Energy Resources (Solar, Wind, Nuclear, Geothermal) Presentation
RimshaNaeem23
 
PDF
NSF-DOE Vera C. Rubin Observatory Observations of Interstellar Comet 3I/ATLAS...
Sérgio Sacani
 
PDF
A deep Search for Ethylene Glycol and Glycolonitrile in the V883 Ori Protopla...
Sérgio Sacani
 
PPTX
The Toxic Effects of Aflatoxin B1 and Aflatoxin M1 on Kidney through Regulati...
OttokomaBonny
 
PPTX
Nanofertilizer: Its potential benefits and associated challenges.pptx
BikramjitDeuri
 
PPTX
Sleep_pysilogy_types_REM_NREM_duration_Sleep center
muralinath2
 
PPTX
Cell Structure and Organelles Slides PPT
JesusNeyra8
 
PPTX
Pengenalan Sel dan organisasi kehidupanpptx
SuntiEkaprawesti1
 
PDF
Systems Biology: Integrating Engineering with Biological Research (www.kiu.a...
publication11
 
PPTX
Q1_Science 8_Week4-Day 5.pptx science re
AizaRazonado
 
PPTX
Nature of Science and the kinds of models used in science
JocelynEvascoRomanti
 
PPTX
Quality control test for plastic & metal.pptx
shrutipandit17
 
PPTX
Reticular formation_nuclei_afferent_efferent
muralinath2
 
PPTX
Brain_stem_Medulla oblongata_functions of pons_mid brain
muralinath2
 
PPTX
Qualification of.UV visible spectrophotometer pptx
shrutipandit17
 
PPTX
Evolution of diet breadth in herbivorus insects.pptx
Mr. Suresh R. Jambagi
 
PPTX
Unit 4 - Astronomy and Astrophysics - Milky Way And External Galaxies
RDhivya6
 
A water-rich interior in the temperate sub-Neptune K2-18 b revealed by JWST
Sérgio Sacani
 
1. Basic Principles of Medical Microbiology Part 1.ppt
separatedwalk
 
Role of GIS in precision farming.pptx
BikramjitDeuri
 
Renewable Energy Resources (Solar, Wind, Nuclear, Geothermal) Presentation
RimshaNaeem23
 
NSF-DOE Vera C. Rubin Observatory Observations of Interstellar Comet 3I/ATLAS...
Sérgio Sacani
 
A deep Search for Ethylene Glycol and Glycolonitrile in the V883 Ori Protopla...
Sérgio Sacani
 
The Toxic Effects of Aflatoxin B1 and Aflatoxin M1 on Kidney through Regulati...
OttokomaBonny
 
Nanofertilizer: Its potential benefits and associated challenges.pptx
BikramjitDeuri
 
Sleep_pysilogy_types_REM_NREM_duration_Sleep center
muralinath2
 
Cell Structure and Organelles Slides PPT
JesusNeyra8
 
Pengenalan Sel dan organisasi kehidupanpptx
SuntiEkaprawesti1
 
Systems Biology: Integrating Engineering with Biological Research (www.kiu.a...
publication11
 
Q1_Science 8_Week4-Day 5.pptx science re
AizaRazonado
 
Nature of Science and the kinds of models used in science
JocelynEvascoRomanti
 
Quality control test for plastic & metal.pptx
shrutipandit17
 
Reticular formation_nuclei_afferent_efferent
muralinath2
 
Brain_stem_Medulla oblongata_functions of pons_mid brain
muralinath2
 
Qualification of.UV visible spectrophotometer pptx
shrutipandit17
 
Evolution of diet breadth in herbivorus insects.pptx
Mr. Suresh R. Jambagi
 
Unit 4 - Astronomy and Astrophysics - Milky Way And External Galaxies
RDhivya6
 

Gene Ontology Enrichment Network Analysis -Tutorial