SlideShare a Scribd company logo
4/21/2024 8:54 PM
Introduction to Bioinformatics
databases: Nucleic Acid
Databases
Dinesh
Gupta
ICGEB
4/21/2024 8:54 PM
Biological databases: why?
• Need for storing and communicating
large datasets has grown
• Make biological data available to
scientists.
• To make biological data available in
computer-readable form.
4/21/2024 8:54 PM
Different classifications of
databases
• Type of data
– nucleotide sequences
– protein sequences
– proteins sequence patterns or motifs
– macromolecular 3D structure
– gene expression data
– metabolic pathways
4/21/2024 8:54 PM
Different classifications of databases….
• Primary or derived databases
– Primary databases: experimental results
directly into database
– Secondary databases: results of analysis of
primary databases
– Aggregate of many databases
• Links to other data items
• Combination of data
• Consolidation of data
4/21/2024 8:54 PM
Different classifications of databases….
• Technical design
– Flat-files
– Relational database (SQL)
– Exchange/publication technologies (FTP,
HTML, CORBA, XML,...)
4/21/2024 8:54 PM
Different classifications of databases….
• Availability
– Publicly available, no restrictions
– Available, but with copyright
– Accessible, but not downloadable
– Academic, but not freely available
– Proprietary, commercial; possibly free for
academics
4/21/2024 8:54 PM
Where do I get DB of my interest ?
4/21/2024 8:54 PM
4/21/2024 8:54 PM
http://www3.oup.co.uk/nar/database/c/
4/21/2024 8:54 PM
Nucleotide sequence databases
• EMBL, GenBank, and DDBJ are the three
primary nucleotide sequence
databases
• EMBL www.ebi.ac.uk/embl/
• GenBank
www.ncbi.nlm.nih.gov/Genbank/
• DDBJ www.ddbj.nig.ac.jp
4/21/2024 8:54 PM
Genbank
• An annotated collection of all publicly
available nucleotide and proteins
• Set up in 1979 at the LANL (Los Alamos).
• Maintained since 1992 NCBI (Bethesda).
• http://www.ncbi.nlm.nih.gov
4/21/2024 8:54 PM
4/21/2024 8:54 PM
4/21/2024 8:54 PM
EMBL Nucleotide Sequence
Database
• An annotated collection of all publicly available
nucleotide and protein sequences
• Created in 1980 at the European Molecular
Biology Laboratory in Heidelberg.
• Maintained since 1994 by EBI- Cambridge.
• http://www.ebi.ac.uk/embl.html
4/21/2024 8:54 PM
4/21/2024 8:54 PM
http://www3.ebi.ac.uk/Services/DBStats/
4/21/2024 8:54 PM
DDBJ–DNA Data Bank of Japan
• An annotated collection of all publicly available
nucleotide and protein sequences
• Started, 1984 at the National Institute of
Genetics (NIG) in Mishima.
• Still maintained in this institute a team led by
Takashi Gojobori.
• http://www.ddbj.nig.ac.jp
4/21/2024 8:54 PM
4/21/2024 8:54 PM
4/21/2024 8:54 PM
Other NCBI nucleic acids DBs
• EST database: A collection of expressed sequence tags, or short, single-pass sequence
reads from mRNA (cDNA).
• GSS database: A database of genome survey sequences, or short, single-pass genomic
sequences.
• HomoloGene: A gene homology tool that compares nucleotide sequences between pairs of
organisms in order to identify putative orthologs.
• HTG database: A collection of high-throughput genome sequences from large-scale
genome sequencing centers, including unfinished and finished sequences.
• SNPs database: A central repository for both single-base nucleotide substitutions and
short deletion and insertion polymorphisms.
• RefSeq: A database of non-redundant reference sequences standards, including genomic
DNA contigs, mRNAs, and proteins for known genes. Multiple collaborations, both within
NCBI and with external groups, supports data-gathering efforts.
• STS database: A database of sequence tagged sites, or short sequences that are
operationally unique in the genome.
• UniSTS: A unified, non-redundant view of sequence tagged sites (STSs).
• UniGene: A collection of ESTs and full-length mRNA sequences organized into clusters,
each representing a unique known or putative human gene annotated with mapping and
expression information and cross-references to other sources.
4/21/2024 8:54 PM
4/21/2024 8:54 PM
4/21/2024 8:54 PM
Sequence submission
• Data mainly direct submissions from the
authors.
• Submissions through the Internet:
– Web forms.
– Email.
• Sequences shared/exchanged between
the 3 centers on a daily basis:
– The sequence content of the banks is
identical.
4/21/2024 8:54 PM
Derived databases
• CUTG Codon usage tabulated from GenBank
http://www.kazusa.or.jp/codon/
• Genetic Codes Deviations from the standard genetic code in various
organisms and organelles
http://www.ncbi.nlm.nih.gov/Taxonomy/Utils/wprintgc.cgi?mode=c
• TIGR Gene Indices Organism-specific databases of EST and gene
sequences http://www.tigr.org/tdb/tgi.shtml
• UniGene Unified clusters of ESTs and full-length mRNA sequences
http://www.ncbi.nlm.nih.gov/UniGene/
• ASAP Alternative spliced isoforms
http://www.bioinformatics.ucla.edu/ASAP
• Intronerator Introns and alternative splicing in C.elegans and
C.briggsae http://www.cse.ucsc.edu/~kent/intronerator/
4/21/2024 8:54 PM
4/21/2024 8:54 PM
4/21/2024 8:54 PM
4/21/2024 8:54 PM
4/21/2024 8:54 PM
4/21/2024 8:54 PM
4/21/2024 8:54 PM
Nucleic acid structure
databases
• NDB Nucleic acid-containing structures
http://ndbserver.rutgers.edu/
• NTDB Thermodynamic data for nucleic acids
http://ntdb.chem.cuhk.edu.hk/
• RNABase RNA-containing structures from PDB and
NDB http://www.rnabase.org/
• SCOR Structural classification of RNA: RNA motifs by
structure, function and tertiary interactions
• http://scor.lbl.gov/
4/21/2024 8:54 PM
4/21/2024 8:54 PM
4/21/2024 8:54 PM
4/21/2024 8:54 PM
4/21/2024 8:54 PM
Database searching tips
• Look for links to Help or Examples
• Try Boolean searches
• Be careful with UK/US spelling differences
– leukaemia vs leukemia
– haemoglobin vs hemoglobin
– colour vs color
4/21/2024 8:54 PM
Exercises
• Study the statistics of the three primary nucleic acid
databases: Are they matching ?
• Look for a gene of your interest in the three primary
nucleic acid databases: compare the information given in
each one of them.
• Read NAR DB paper and NAR DB index site: search for
different nucleic acid databases based on different
search terms.
• Self study:
– http://www3.oup.co.uk/nar/database/c/
– Download NAR database paper (NARDB2004) from:
ftp://cbag.sc.mahidol.ac.th/pub/Course_Materials/dinesh

More Related Content

PPTX
Nucleic acid database
Esakkiammal S
 
PPTX
Primary Databases.pptx
Swarup Malakar
 
PPTX
Biological databasesBiological databases
KrittikaChandran
 
PPTX
Introduction to databases.pptx
sworna kumari chithiraivelu
 
PPTX
Biological database
Iqbal college Peringammala TVM
 
PPTX
Biological databases.pptx
PagudalaSangeetha
 
PPT
Introduction to Bioinformatics and DatabasesDay1.ppt
khadijarafiq2012
 
PPTX
Database in bioinformatics
VinaKhan1
 
Nucleic acid database
Esakkiammal S
 
Primary Databases.pptx
Swarup Malakar
 
Biological databasesBiological databases
KrittikaChandran
 
Introduction to databases.pptx
sworna kumari chithiraivelu
 
Biological database
Iqbal college Peringammala TVM
 
Biological databases.pptx
PagudalaSangeetha
 
Introduction to Bioinformatics and DatabasesDay1.ppt
khadijarafiq2012
 
Database in bioinformatics
VinaKhan1
 

Similar to Nucleic_Acid_Databases, Bioinformatics, genome (20)

PPTX
Sequence and Structural Databases of DNA and Protein, and its significance in...
BibiQuinah
 
PPTX
Sequence and Structural Databases of DNA and Protein, and its significance in...
SBituila
 
PPT
Biodatabases 101220022654-phpapp02
Sreekanth Gali
 
PPT
NCBI
Kavisa Ghosh
 
PDF
Bioinformatics: History of Bioinformatics, Components of Bioinformatics, Geno...
A Biodiction : A Unit of Dr. Divya Sharma
 
PPTX
BIOINFORMATICS BIOLOGICAL DATABASES DATA BASES.pptx
Jaleelkabdul Jaleel
 
PPTX
DATABASES...............................pptx
Cherry
 
PDF
BIOLOGICAL DATABASE AND ITS TYPES,IMPORTANCE OF BIOLOGICAL DATABASE
savidhasam2001
 
PPTX
Databases_L2.pptx
kigaruantony
 
PPTX
Bioinformatics final
Rainu Rajeev
 
PPTX
Biological database ppt(1).pptx Introuction
RAJESHKUMAR428748
 
PPTX
Nucleic acid and protein databanks
NithyaNandapal
 
PDF
Biological Database (1)pptxpdfpdfpdf.pdf
BioinformaticsCentre
 
PPTX
Introduction to Biological database ppt(1).pptx
RAJESHKUMAR428748
 
PPT
Bioinformatics and Databases in Biological Science
MohamedHasan816582
 
PPT
Bioinformatic_Databases_2.ppt
NaglaaFathy42
 
PPT
Bioinformatic databases 2
Razzaqe
 
PPT
Bioinformatic databases 2
Razzaqe
 
PPT
Bioinformatic_Databases_2xcxzczxcxzxcxzc
AdiM27
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
BibiQuinah
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
SBituila
 
Biodatabases 101220022654-phpapp02
Sreekanth Gali
 
Bioinformatics: History of Bioinformatics, Components of Bioinformatics, Geno...
A Biodiction : A Unit of Dr. Divya Sharma
 
BIOINFORMATICS BIOLOGICAL DATABASES DATA BASES.pptx
Jaleelkabdul Jaleel
 
DATABASES...............................pptx
Cherry
 
BIOLOGICAL DATABASE AND ITS TYPES,IMPORTANCE OF BIOLOGICAL DATABASE
savidhasam2001
 
Databases_L2.pptx
kigaruantony
 
Bioinformatics final
Rainu Rajeev
 
Biological database ppt(1).pptx Introuction
RAJESHKUMAR428748
 
Nucleic acid and protein databanks
NithyaNandapal
 
Biological Database (1)pptxpdfpdfpdf.pdf
BioinformaticsCentre
 
Introduction to Biological database ppt(1).pptx
RAJESHKUMAR428748
 
Bioinformatics and Databases in Biological Science
MohamedHasan816582
 
Bioinformatic_Databases_2.ppt
NaglaaFathy42
 
Bioinformatic databases 2
Razzaqe
 
Bioinformatic databases 2
Razzaqe
 
Bioinformatic_Databases_2xcxzczxcxzxcxzc
AdiM27
 
Ad

More from MohamedHasan816582 (20)

PPT
Introduction to Genetics and molecular biology.ppt
MohamedHasan816582
 
PPTX
Application of Biotechnology for Improving Medicinal Plants.pptx
MohamedHasan816582
 
PPT
structure Am Health Final and Technology. ppt
MohamedHasan816582
 
PPTX
Bioinformatics & AI- in Medicinal and aromatic plant.pptx
MohamedHasan816582
 
PPTX
Basic Bioinformatics and Biotechnology.pptx
MohamedHasan816582
 
PPT
2- Basics of Molecular Biology and biochemistry.ppt
MohamedHasan816582
 
PPT
3- introduction(SEQU ANAL of PCR products 9 9 12 (2).ppt
MohamedHasan816582
 
PPTX
TNBC Research Presentation and medical virology .pptx
MohamedHasan816582
 
PPTX
EBOV Presentation and medical Virology .pptx
MohamedHasan816582
 
PPTX
Presentation of medical biotechnology.pptx
MohamedHasan816582
 
PPTX
Mohamed El-Sayed Hasan and curriculum vitae.pptx
MohamedHasan816582
 
PPT
Introduction to classical and modern Genetics.ppt
MohamedHasan816582
 
PPTX
Topic 5 of the genomics and proteomics.pptx
MohamedHasan816582
 
PPTX
EmZ medical microbiology and classification.pptx
MohamedHasan816582
 
PPTX
presentation and microbial biotechnology.pptx
MohamedHasan816582
 
PPTX
EmZ medical microbiology and classification.pptx
MohamedHasan816582
 
PPTX
IMAN of medical microbiology and classification.pptx
MohamedHasan816582
 
PPT
aya presentation of discussion seminar .ppt
MohamedHasan816582
 
PPTX
INTRODUCTION-TO-RESEARCH-METHODOLOGY-2020_3.pptx
MohamedHasan816582
 
PPT
INTRODUCTION-TO-RESEARCH-METHODOLOGY-2020_2.ppt
MohamedHasan816582
 
Introduction to Genetics and molecular biology.ppt
MohamedHasan816582
 
Application of Biotechnology for Improving Medicinal Plants.pptx
MohamedHasan816582
 
structure Am Health Final and Technology. ppt
MohamedHasan816582
 
Bioinformatics & AI- in Medicinal and aromatic plant.pptx
MohamedHasan816582
 
Basic Bioinformatics and Biotechnology.pptx
MohamedHasan816582
 
2- Basics of Molecular Biology and biochemistry.ppt
MohamedHasan816582
 
3- introduction(SEQU ANAL of PCR products 9 9 12 (2).ppt
MohamedHasan816582
 
TNBC Research Presentation and medical virology .pptx
MohamedHasan816582
 
EBOV Presentation and medical Virology .pptx
MohamedHasan816582
 
Presentation of medical biotechnology.pptx
MohamedHasan816582
 
Mohamed El-Sayed Hasan and curriculum vitae.pptx
MohamedHasan816582
 
Introduction to classical and modern Genetics.ppt
MohamedHasan816582
 
Topic 5 of the genomics and proteomics.pptx
MohamedHasan816582
 
EmZ medical microbiology and classification.pptx
MohamedHasan816582
 
presentation and microbial biotechnology.pptx
MohamedHasan816582
 
EmZ medical microbiology and classification.pptx
MohamedHasan816582
 
IMAN of medical microbiology and classification.pptx
MohamedHasan816582
 
aya presentation of discussion seminar .ppt
MohamedHasan816582
 
INTRODUCTION-TO-RESEARCH-METHODOLOGY-2020_3.pptx
MohamedHasan816582
 
INTRODUCTION-TO-RESEARCH-METHODOLOGY-2020_2.ppt
MohamedHasan816582
 
Ad

Recently uploaded (20)

PPTX
Measures_of_location_-_Averages_and__percentiles_by_DR SURYA K.pptx
Surya Ganesh
 
PDF
Review of Related Literature & Studies.pdf
Thelma Villaflores
 
PPTX
PROTIEN ENERGY MALNUTRITION: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
PPTX
Virus sequence retrieval from NCBI database
yamunaK13
 
PPTX
Basics and rules of probability with real-life uses
ravatkaran694
 
PDF
BÀI TẬP TEST BỔ TRỢ THEO TỪNG CHỦ ĐỀ CỦA TỪNG UNIT KÈM BÀI TẬP NGHE - TIẾNG A...
Nguyen Thanh Tu Collection
 
PPTX
How to Track Skills & Contracts Using Odoo 18 Employee
Celine George
 
DOCX
pgdei-UNIT -V Neurological Disorders & developmental disabilities
JELLA VISHNU DURGA PRASAD
 
DOCX
SAROCES Action-Plan FOR ARAL PROGRAM IN DEPED
Levenmartlacuna1
 
PPTX
How to Apply for a Job From Odoo 18 Website
Celine George
 
PPTX
CARE OF UNCONSCIOUS PATIENTS .pptx
AneetaSharma15
 
PPTX
Python-Application-in-Drug-Design by R D Jawarkar.pptx
Rahul Jawarkar
 
PPTX
An introduction to Prepositions for beginners.pptx
drsiddhantnagine
 
PPTX
family health care settings home visit - unit 6 - chn 1 - gnm 1st year.pptx
Priyanshu Anand
 
PPTX
INTESTINALPARASITES OR WORM INFESTATIONS.pptx
PRADEEP ABOTHU
 
PDF
What is CFA?? Complete Guide to the Chartered Financial Analyst Program
sp4989653
 
PPTX
Tips Management in Odoo 18 POS - Odoo Slides
Celine George
 
PPTX
A Smarter Way to Think About Choosing a College
Cyndy McDonald
 
PPTX
CDH. pptx
AneetaSharma15
 
DOCX
Modul Ajar Deep Learning Bahasa Inggris Kelas 11 Terbaru 2025
wahyurestu63
 
Measures_of_location_-_Averages_and__percentiles_by_DR SURYA K.pptx
Surya Ganesh
 
Review of Related Literature & Studies.pdf
Thelma Villaflores
 
PROTIEN ENERGY MALNUTRITION: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
Virus sequence retrieval from NCBI database
yamunaK13
 
Basics and rules of probability with real-life uses
ravatkaran694
 
BÀI TẬP TEST BỔ TRỢ THEO TỪNG CHỦ ĐỀ CỦA TỪNG UNIT KÈM BÀI TẬP NGHE - TIẾNG A...
Nguyen Thanh Tu Collection
 
How to Track Skills & Contracts Using Odoo 18 Employee
Celine George
 
pgdei-UNIT -V Neurological Disorders & developmental disabilities
JELLA VISHNU DURGA PRASAD
 
SAROCES Action-Plan FOR ARAL PROGRAM IN DEPED
Levenmartlacuna1
 
How to Apply for a Job From Odoo 18 Website
Celine George
 
CARE OF UNCONSCIOUS PATIENTS .pptx
AneetaSharma15
 
Python-Application-in-Drug-Design by R D Jawarkar.pptx
Rahul Jawarkar
 
An introduction to Prepositions for beginners.pptx
drsiddhantnagine
 
family health care settings home visit - unit 6 - chn 1 - gnm 1st year.pptx
Priyanshu Anand
 
INTESTINALPARASITES OR WORM INFESTATIONS.pptx
PRADEEP ABOTHU
 
What is CFA?? Complete Guide to the Chartered Financial Analyst Program
sp4989653
 
Tips Management in Odoo 18 POS - Odoo Slides
Celine George
 
A Smarter Way to Think About Choosing a College
Cyndy McDonald
 
CDH. pptx
AneetaSharma15
 
Modul Ajar Deep Learning Bahasa Inggris Kelas 11 Terbaru 2025
wahyurestu63
 

Nucleic_Acid_Databases, Bioinformatics, genome

  • 1. 4/21/2024 8:54 PM Introduction to Bioinformatics databases: Nucleic Acid Databases Dinesh Gupta ICGEB
  • 2. 4/21/2024 8:54 PM Biological databases: why? • Need for storing and communicating large datasets has grown • Make biological data available to scientists. • To make biological data available in computer-readable form.
  • 3. 4/21/2024 8:54 PM Different classifications of databases • Type of data – nucleotide sequences – protein sequences – proteins sequence patterns or motifs – macromolecular 3D structure – gene expression data – metabolic pathways
  • 4. 4/21/2024 8:54 PM Different classifications of databases…. • Primary or derived databases – Primary databases: experimental results directly into database – Secondary databases: results of analysis of primary databases – Aggregate of many databases • Links to other data items • Combination of data • Consolidation of data
  • 5. 4/21/2024 8:54 PM Different classifications of databases…. • Technical design – Flat-files – Relational database (SQL) – Exchange/publication technologies (FTP, HTML, CORBA, XML,...)
  • 6. 4/21/2024 8:54 PM Different classifications of databases…. • Availability – Publicly available, no restrictions – Available, but with copyright – Accessible, but not downloadable – Academic, but not freely available – Proprietary, commercial; possibly free for academics
  • 7. 4/21/2024 8:54 PM Where do I get DB of my interest ?
  • 10. 4/21/2024 8:54 PM Nucleotide sequence databases • EMBL, GenBank, and DDBJ are the three primary nucleotide sequence databases • EMBL www.ebi.ac.uk/embl/ • GenBank www.ncbi.nlm.nih.gov/Genbank/ • DDBJ www.ddbj.nig.ac.jp
  • 11. 4/21/2024 8:54 PM Genbank • An annotated collection of all publicly available nucleotide and proteins • Set up in 1979 at the LANL (Los Alamos). • Maintained since 1992 NCBI (Bethesda). • http://www.ncbi.nlm.nih.gov
  • 14. 4/21/2024 8:54 PM EMBL Nucleotide Sequence Database • An annotated collection of all publicly available nucleotide and protein sequences • Created in 1980 at the European Molecular Biology Laboratory in Heidelberg. • Maintained since 1994 by EBI- Cambridge. • http://www.ebi.ac.uk/embl.html
  • 17. 4/21/2024 8:54 PM DDBJ–DNA Data Bank of Japan • An annotated collection of all publicly available nucleotide and protein sequences • Started, 1984 at the National Institute of Genetics (NIG) in Mishima. • Still maintained in this institute a team led by Takashi Gojobori. • http://www.ddbj.nig.ac.jp
  • 20. 4/21/2024 8:54 PM Other NCBI nucleic acids DBs • EST database: A collection of expressed sequence tags, or short, single-pass sequence reads from mRNA (cDNA). • GSS database: A database of genome survey sequences, or short, single-pass genomic sequences. • HomoloGene: A gene homology tool that compares nucleotide sequences between pairs of organisms in order to identify putative orthologs. • HTG database: A collection of high-throughput genome sequences from large-scale genome sequencing centers, including unfinished and finished sequences. • SNPs database: A central repository for both single-base nucleotide substitutions and short deletion and insertion polymorphisms. • RefSeq: A database of non-redundant reference sequences standards, including genomic DNA contigs, mRNAs, and proteins for known genes. Multiple collaborations, both within NCBI and with external groups, supports data-gathering efforts. • STS database: A database of sequence tagged sites, or short sequences that are operationally unique in the genome. • UniSTS: A unified, non-redundant view of sequence tagged sites (STSs). • UniGene: A collection of ESTs and full-length mRNA sequences organized into clusters, each representing a unique known or putative human gene annotated with mapping and expression information and cross-references to other sources.
  • 23. 4/21/2024 8:54 PM Sequence submission • Data mainly direct submissions from the authors. • Submissions through the Internet: – Web forms. – Email. • Sequences shared/exchanged between the 3 centers on a daily basis: – The sequence content of the banks is identical.
  • 24. 4/21/2024 8:54 PM Derived databases • CUTG Codon usage tabulated from GenBank http://www.kazusa.or.jp/codon/ • Genetic Codes Deviations from the standard genetic code in various organisms and organelles http://www.ncbi.nlm.nih.gov/Taxonomy/Utils/wprintgc.cgi?mode=c • TIGR Gene Indices Organism-specific databases of EST and gene sequences http://www.tigr.org/tdb/tgi.shtml • UniGene Unified clusters of ESTs and full-length mRNA sequences http://www.ncbi.nlm.nih.gov/UniGene/ • ASAP Alternative spliced isoforms http://www.bioinformatics.ucla.edu/ASAP • Intronerator Introns and alternative splicing in C.elegans and C.briggsae http://www.cse.ucsc.edu/~kent/intronerator/
  • 31. 4/21/2024 8:54 PM Nucleic acid structure databases • NDB Nucleic acid-containing structures http://ndbserver.rutgers.edu/ • NTDB Thermodynamic data for nucleic acids http://ntdb.chem.cuhk.edu.hk/ • RNABase RNA-containing structures from PDB and NDB http://www.rnabase.org/ • SCOR Structural classification of RNA: RNA motifs by structure, function and tertiary interactions • http://scor.lbl.gov/
  • 36. 4/21/2024 8:54 PM Database searching tips • Look for links to Help or Examples • Try Boolean searches • Be careful with UK/US spelling differences – leukaemia vs leukemia – haemoglobin vs hemoglobin – colour vs color
  • 37. 4/21/2024 8:54 PM Exercises • Study the statistics of the three primary nucleic acid databases: Are they matching ? • Look for a gene of your interest in the three primary nucleic acid databases: compare the information given in each one of them. • Read NAR DB paper and NAR DB index site: search for different nucleic acid databases based on different search terms. • Self study: – http://www3.oup.co.uk/nar/database/c/ – Download NAR database paper (NARDB2004) from: ftp://cbag.sc.mahidol.ac.th/pub/Course_Materials/dinesh