SlideShare a Scribd company logo
BIOLOGICAL DATABASE
Dr. Nusaifa Beevi.P
Associate Professor & HOD,
PG Department of Botany,
Iqbal College, Peringammala
 BIOLOGICAL DATABASES
 Collection of files containing records of
biological data in machine readable form
 Can be accessed, added, retrieved,
manipulated and modified
 Store, manage, connect and distribute
data
 Data are arranged by sets of rules which
are programmed into software that
manages the data called Database
Management System or DBMS.
 Primary Databases: Contain original data in the
form of primary sequence data or structural data
as submitted by the scientific community.
 Secondary Databases: Contain information that
has been processed and derived from the raw
data available in primary database.eg: PROSITE,
PRINTS, BLOCKS etc..
 Composite Databases: Collect and present data
after comparing and filtering them from different
primary databases and exhibit only the non-
redundant sequences
 Nucleic acid databases: Gen Bank, EMBL,DDBJ
 Protein sequence databases: PIR, Swiss-Prot,
UNIPROT
 Protein structure database: PDB
 Metabolic databases: KEGG
 Composed of a group of nucleotide sequence
entries.
 Data repositories that accept nucleic acid
sequence data and make it freely available to
the public.
 GenBank, EMBL,DDBJ are principal nucleotide
databases.
 All the three are members of the International
Nucleotide Sequence Database Consortium
(INSDC) and interchange data.
 Hosted by National Centre for Biotechnology Information
(NCBI), situated at the campus of US National Institute of
Health, USA.
 Gen Bank offers all publicly available nucleotide
sequences, their protein translation, and their annotated
information.
 It also facilitate direct submission of sequence data by a
user friendly process.
 Researchers from anywhere can submit their data to Gen
Bank.
 An accession number is given to the submitted sequence
and then released to the public database after the quality
assurance check.
 This information can be retrieved using the Entrez
retrieval system.
 We can access the data in NCBI over the internet through
their site, http://www.ncbi.nlm.nih.gov/genbank
Biological database
 Started in 1986, hosted now at National Institute of
Genetics, Japan.
 Gather data mainly from scientists in Japan and from
researchers all over the world.
 This can also share nucleotide sequence data with Gen
Bank and EMBL.
 About 99% of the nucleotide data in INSDC submitted by
Japanese researchers through DDBJ, and enhances the
quality of INSDC.
 It includes details of sequences, submitters details,
biological significance , and the scientific name and
taxonomy of the organism. In addition, features that
identify coding region, transcription units, mutation sites
etc. are also displayed in a feature table.
 Major activities of the DDBJ include, providing
internationally recognized accession numbers
to sequences, bioinformatics database
management, developing tools for the analysis
and visualization of biological data, and also
conducting courses for beginners to reduce
the complexity in the biological data analysis.
 DDBJ can be accessed through homepage,
http://www.ddbj.nig.ac.jp/.
Biological database
 European Molecular Biology Laboratory Nucleotide Sequence
Database, first established in 1974.
 Hosted at UK by the EMBL European Bioinformatics Institute.
 EMBL is a non-profit research institution supported by 20
European countries and Australia, for Molecular Biology
Research.
 EMBL collects nucleotide sequence data from individual
researchers, genome sequence projects and patent
applications.
 Sequences are stored in this database as they would exist in
the biological state.
 The stored data correspond to wild type sequences without
mutation or genetic manipulation.
 Accessed through the URL, http://www.ebi.ac.uk/embl
Biological database
 An array of amino acid sequence entries
arranged according to the identification
number.
 Well known protein sequence databases
available on www are
◦ Swiss-Prot
◦ PIR
◦ UNIPROT
 Developed by the Swiss Institute of Bioinformatics (SIB) and
European Bioinformatics Institute(EBI).
 High quality, manually annotated protein sequence
database created in 1986.
 It provides high level annotations with functions of protein
and post transcriptional modifications.
 It provide all known relevant information about a particular
protein.
 Consists of two sections:- UniProt KB/Swiss-Prot, which is
manually annotated and is reviewed, and Uni
ProtKB/TrEMBL, which is automatically annotated and not
reviewed.
 Available at http://www.expasy.ch/sprot
Biological database
 Protein Information Resource database
 Established in 1984, by National Biomedical Research
Foundation (NBRF).
 It is an integrated public bioinformatics resource that
support genomic and proteomic research, and scientific
studies.
 It assists researchers in the identification and
interpretation of protein sequence information.
 PIR can be searched for entries or sequence similarity
searches.
 Can be downloaded at
http://www.pir.georgetown.edu/.
 PIR offers a variety of resources mainly oriented to
assist the propagation and standardization of protein
annotation.
Biological database
 It provide a comprehensive, high quality and
freely accessible resource of protein
sequence.
 Entries are derived from genome sequencing
projects.
 The Uniprot consortium comprises the
European Bioinformatics Institute(EBI),the
Swiss Institute of Bioinformatics(SIB), And the
Protein Information Resourse(PIR).
 Uniprot is composed of four components,
each optimized for different uses.
 1. UniProt Knowledge Base (UniProtKB)- For
extensive curated protein information with
two sections-UniProt KB/Swiss-Prot, which
is manually annotated and is reviewed,
and Uni ProtKB/TrEMBL, which is
automatically annotated and not reviewed.
 2. UniProt Reference Clusters (UniRef)
 3. UniProt Archive (UniParc)
 4. UniProt Metagenomic and Environmental
Sequences (UniMes)
Biological database
 Many proteins which exhibit a common
evolutionary origin, show structural
similarities.
 Dissimilar proteins exhibit changes in
primary, secondary, teritiary and
quarternary structures.
 Similar or dissimilar protein structure
can be predicted with structure
database.
 These databases store a collection of
three dimensional structures of proteins.
 Understanding the shape of a molecule helps to
understand how it works.
 PDB is the main primary database used for the
prediction of 3D Structures of proteins and nucleic
acids.
 The single world wide archive of structural data.
 Maintained by the Research Collaboratory for structural
bioinformatics (RCSB)
 The data obtained from X-ray chrystallography and
NMR-spectroscopy, are submitted to the PDB.
 Then, these structures are annotated as per the
depositors specifications.
 Freely available and accessed through URL
http://www.pdb.org/
Biological database
 MODs are also called Organism – specific databases.
They describe genome and other information about
well studied experimental organisms in life sciences.
 They store large volumes of data and allow users to
analyse results and interpret datasets and data they
generated. ( organism of their own interest).
 Examples:
 Fly Base- database of Drosophilla melanogaster
 SGD- Sacharomyces Genome Database
 AGR- Arabidopsis Genome Resource
 HGP- Human Genome Project
 RGD- Rat Genome Database etc…
 Provide information on the biodiversity of a particular
area or group of living organisms.
 They may store genus level information, species level
information, information on nomenclature or any
combination of the three.
 Species 2000
◦ Established in September 1994, by the International Union of
Biological Sciences(IUBS), in co-operation with the committee on
Data for science and technology(CODATA) and the International
Union of Microbiological Sciences(IUMS).
◦ It is a Federation of database organizations working closely with
users, taxonomists and sponsoring agencies.
◦ It plans to create an array of participant global species databases
covering each of the major groups of organisms(plants, animals,
fungi and microbes)
◦ The goal of species 2000 is to provide a uniform and validated
quality index of names of all known species for use as a practical
tool.
Thank You

More Related Content

PPTX
Proteins databases
Hafiz Muhammad Zeeshan Raza
 
PPTX
Introduction OF BIOLOGICAL DATABASE
PrashantSharma807
 
PPTX
Primary and secondary databases ppt by puneet kulyana
Puneet Kulyana
 
PPTX
databases in bioinformatics
nadeem akhter
 
PPT
Primary and secondary database
KAUSHAL SAHU
 
PDF
Nucleic Acid Sequence databases
Pranavathiyani G
 
PPTX
European molecular biology laboratory (EMBL)
Hafiz Muhammad Zeeshan Raza
 
PPTX
Introduction to ncbi, embl, ddbj
KAUSHAL SAHU
 
Proteins databases
Hafiz Muhammad Zeeshan Raza
 
Introduction OF BIOLOGICAL DATABASE
PrashantSharma807
 
Primary and secondary databases ppt by puneet kulyana
Puneet Kulyana
 
databases in bioinformatics
nadeem akhter
 
Primary and secondary database
KAUSHAL SAHU
 
Nucleic Acid Sequence databases
Pranavathiyani G
 
European molecular biology laboratory (EMBL)
Hafiz Muhammad Zeeshan Raza
 
Introduction to ncbi, embl, ddbj
KAUSHAL SAHU
 

What's hot (20)

PDF
Sequence alignment
Vidya Kalaivani Rajkumar
 
DOCX
UniProt
AmnaA7
 
PPTX
Swiss prot database
sagrika chugh
 
PPT
Phylogenetic analysis
National Institute of Biologics
 
PPTX
Cath
Ramya S
 
PPTX
DNA data bank of japan (DDBJ)
ZoufishanY
 
PPTX
Protein Databases
SATHIYA NARAYANAN
 
PPTX
Clustal W - Multiple Sequence alignment
The Oxford College Engineering
 
PPTX
Sequence alig Sequence Alignment Pairwise alignment:-
naveed ul mushtaq
 
PPTX
Recombination
Mary Theresa
 
PPTX
MULTIPLE SEQUENCE ALIGNMENT
Mariya Raju
 
PPTX
Genome mapping
Rashmi Yadav
 
PPTX
Prosite
Rashi Srivastava
 
PPTX
Database in bioinformatics
VinaKhan1
 
PPTX
Sequence Submission Tools
RishikaMaji
 
PPTX
Gen bank databases
Hafiz Muhammad Zeeshan Raza
 
PPTX
Phylogenetic tree
Sanzid Kawsar
 
PPTX
Uni prot presentation
Rida Khalid
 
Sequence alignment
Vidya Kalaivani Rajkumar
 
UniProt
AmnaA7
 
Swiss prot database
sagrika chugh
 
Phylogenetic analysis
National Institute of Biologics
 
Cath
Ramya S
 
DNA data bank of japan (DDBJ)
ZoufishanY
 
Protein Databases
SATHIYA NARAYANAN
 
Clustal W - Multiple Sequence alignment
The Oxford College Engineering
 
Sequence alig Sequence Alignment Pairwise alignment:-
naveed ul mushtaq
 
Recombination
Mary Theresa
 
MULTIPLE SEQUENCE ALIGNMENT
Mariya Raju
 
Genome mapping
Rashmi Yadav
 
Database in bioinformatics
VinaKhan1
 
Sequence Submission Tools
RishikaMaji
 
Gen bank databases
Hafiz Muhammad Zeeshan Raza
 
Phylogenetic tree
Sanzid Kawsar
 
Uni prot presentation
Rida Khalid
 
Ad

Similar to Biological database (20)

PPTX
DATABASES...............................pptx
Cherry
 
PPTX
BIOINFORMATICS BIOLOGICAL DATABASES DATA BASES.pptx
Jaleelkabdul Jaleel
 
PPTX
Presentation on Biological database By Elufer Akram @ University Of Science ...
Elufer Akram
 
PPTX
Sequence and Structural Databases of DNA and Protein, and its significance in...
BibiQuinah
 
PPTX
Sequence and Structural Databases of DNA and Protein, and its significance in...
SBituila
 
PPTX
Biological databases
SEKHARREDDYAMBATI
 
PDF
Bioinformatics biological databases
Sangeeta Das
 
PPTX
Biological databases.pptx
PagudalaSangeetha
 
PDF
Bioinformatics introduction
DrGopaSarma
 
PDF
Bioinformatics: History of Bioinformatics, Components of Bioinformatics, Geno...
A Biodiction : A Unit of Dr. Divya Sharma
 
PPTX
Introduction to databases.pptx
sworna kumari chithiraivelu
 
PPT
Bioinformatics in biotechnology by kk sahu
KAUSHAL SAHU
 
PPTX
biological databases.pptx
science lover
 
PPT
Bioinformatics and Databases in Biological Science
MohamedHasan816582
 
PPTX
Biological database ppt(1).pptx Introuction
RAJESHKUMAR428748
 
PPTX
Introduction to Biological database ppt(1).pptx
RAJESHKUMAR428748
 
PPT
bioinfomatics
nguyenpg
 
PPTX
Biological databases
SHRADHEYA GUPTA
 
PPTX
Nucleic acid and protein databanks
NithyaNandapal
 
PDF
Biological Database (1)pptxpdfpdfpdf.pdf
BioinformaticsCentre
 
DATABASES...............................pptx
Cherry
 
BIOINFORMATICS BIOLOGICAL DATABASES DATA BASES.pptx
Jaleelkabdul Jaleel
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Elufer Akram
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
BibiQuinah
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
SBituila
 
Biological databases
SEKHARREDDYAMBATI
 
Bioinformatics biological databases
Sangeeta Das
 
Biological databases.pptx
PagudalaSangeetha
 
Bioinformatics introduction
DrGopaSarma
 
Bioinformatics: History of Bioinformatics, Components of Bioinformatics, Geno...
A Biodiction : A Unit of Dr. Divya Sharma
 
Introduction to databases.pptx
sworna kumari chithiraivelu
 
Bioinformatics in biotechnology by kk sahu
KAUSHAL SAHU
 
biological databases.pptx
science lover
 
Bioinformatics and Databases in Biological Science
MohamedHasan816582
 
Biological database ppt(1).pptx Introuction
RAJESHKUMAR428748
 
Introduction to Biological database ppt(1).pptx
RAJESHKUMAR428748
 
bioinfomatics
nguyenpg
 
Biological databases
SHRADHEYA GUPTA
 
Nucleic acid and protein databanks
NithyaNandapal
 
Biological Database (1)pptxpdfpdfpdf.pdf
BioinformaticsCentre
 
Ad

Recently uploaded (20)

PPTX
Hydrocarbons Pollution. OIL pollutionpptx
AkCreation33
 
PPTX
Quality control test for plastic & metal.pptx
shrutipandit17
 
PPTX
Nanofertilizer: Its potential benefits and associated challenges.pptx
BikramjitDeuri
 
PPTX
Sleep_pysilogy_types_REM_NREM_duration_Sleep center
muralinath2
 
PPTX
Unit 4 - Astronomy and Astrophysics - Milky Way And External Galaxies
RDhivya6
 
PPTX
Cell Structure and Organelles Slides PPT
JesusNeyra8
 
PPTX
METABOLIC_SYNDROME Dr Shadab- kgmu lucknow pptx
ShadabAlam169087
 
PDF
Approximating manifold orbits by means of Machine Learning Techniques
Esther Barrabés Vera
 
PDF
A deep Search for Ethylene Glycol and Glycolonitrile in the V883 Ori Protopla...
Sérgio Sacani
 
DOCX
Echoes_of_Andromeda_Partial (1).docx9989
yakshitkrishnia5a3
 
PDF
Control and coordination Class 10 Chapter 6
LataHolkar
 
PDF
Sujay Rao Mandavilli Multi-barreled appraoch to educational reform FINAL FINA...
Sujay Rao Mandavilli
 
PPTX
Embark on a journey of cell division and it's stages
sakyierhianmontero
 
PPTX
Hericium erinaceus, also known as lion's mane mushroom
TinaDadkhah1
 
PPTX
Qualification of.UV visible spectrophotometer pptx
shrutipandit17
 
PPTX
Hepatopulmonary syndrome power point presentation
raknasivar1997
 
PPTX
Feeding stratagey for climate change dairy animals.
Dr.Zulfy haq
 
PPTX
Laboratory design and safe microbiological practices
Akanksha Divkar
 
PDF
NSF-DOE Vera C. Rubin Observatory Observations of Interstellar Comet 3I/ATLAS...
Sérgio Sacani
 
PPTX
Role of GIS in precision farming.pptx
BikramjitDeuri
 
Hydrocarbons Pollution. OIL pollutionpptx
AkCreation33
 
Quality control test for plastic & metal.pptx
shrutipandit17
 
Nanofertilizer: Its potential benefits and associated challenges.pptx
BikramjitDeuri
 
Sleep_pysilogy_types_REM_NREM_duration_Sleep center
muralinath2
 
Unit 4 - Astronomy and Astrophysics - Milky Way And External Galaxies
RDhivya6
 
Cell Structure and Organelles Slides PPT
JesusNeyra8
 
METABOLIC_SYNDROME Dr Shadab- kgmu lucknow pptx
ShadabAlam169087
 
Approximating manifold orbits by means of Machine Learning Techniques
Esther Barrabés Vera
 
A deep Search for Ethylene Glycol and Glycolonitrile in the V883 Ori Protopla...
Sérgio Sacani
 
Echoes_of_Andromeda_Partial (1).docx9989
yakshitkrishnia5a3
 
Control and coordination Class 10 Chapter 6
LataHolkar
 
Sujay Rao Mandavilli Multi-barreled appraoch to educational reform FINAL FINA...
Sujay Rao Mandavilli
 
Embark on a journey of cell division and it's stages
sakyierhianmontero
 
Hericium erinaceus, also known as lion's mane mushroom
TinaDadkhah1
 
Qualification of.UV visible spectrophotometer pptx
shrutipandit17
 
Hepatopulmonary syndrome power point presentation
raknasivar1997
 
Feeding stratagey for climate change dairy animals.
Dr.Zulfy haq
 
Laboratory design and safe microbiological practices
Akanksha Divkar
 
NSF-DOE Vera C. Rubin Observatory Observations of Interstellar Comet 3I/ATLAS...
Sérgio Sacani
 
Role of GIS in precision farming.pptx
BikramjitDeuri
 

Biological database

  • 1. BIOLOGICAL DATABASE Dr. Nusaifa Beevi.P Associate Professor & HOD, PG Department of Botany, Iqbal College, Peringammala
  • 2.  BIOLOGICAL DATABASES  Collection of files containing records of biological data in machine readable form  Can be accessed, added, retrieved, manipulated and modified  Store, manage, connect and distribute data  Data are arranged by sets of rules which are programmed into software that manages the data called Database Management System or DBMS.
  • 3.  Primary Databases: Contain original data in the form of primary sequence data or structural data as submitted by the scientific community.  Secondary Databases: Contain information that has been processed and derived from the raw data available in primary database.eg: PROSITE, PRINTS, BLOCKS etc..  Composite Databases: Collect and present data after comparing and filtering them from different primary databases and exhibit only the non- redundant sequences
  • 4.  Nucleic acid databases: Gen Bank, EMBL,DDBJ  Protein sequence databases: PIR, Swiss-Prot, UNIPROT  Protein structure database: PDB  Metabolic databases: KEGG
  • 5.  Composed of a group of nucleotide sequence entries.  Data repositories that accept nucleic acid sequence data and make it freely available to the public.  GenBank, EMBL,DDBJ are principal nucleotide databases.  All the three are members of the International Nucleotide Sequence Database Consortium (INSDC) and interchange data.
  • 6.  Hosted by National Centre for Biotechnology Information (NCBI), situated at the campus of US National Institute of Health, USA.  Gen Bank offers all publicly available nucleotide sequences, their protein translation, and their annotated information.  It also facilitate direct submission of sequence data by a user friendly process.  Researchers from anywhere can submit their data to Gen Bank.  An accession number is given to the submitted sequence and then released to the public database after the quality assurance check.  This information can be retrieved using the Entrez retrieval system.  We can access the data in NCBI over the internet through their site, http://www.ncbi.nlm.nih.gov/genbank
  • 8.  Started in 1986, hosted now at National Institute of Genetics, Japan.  Gather data mainly from scientists in Japan and from researchers all over the world.  This can also share nucleotide sequence data with Gen Bank and EMBL.  About 99% of the nucleotide data in INSDC submitted by Japanese researchers through DDBJ, and enhances the quality of INSDC.  It includes details of sequences, submitters details, biological significance , and the scientific name and taxonomy of the organism. In addition, features that identify coding region, transcription units, mutation sites etc. are also displayed in a feature table.
  • 9.  Major activities of the DDBJ include, providing internationally recognized accession numbers to sequences, bioinformatics database management, developing tools for the analysis and visualization of biological data, and also conducting courses for beginners to reduce the complexity in the biological data analysis.  DDBJ can be accessed through homepage, http://www.ddbj.nig.ac.jp/.
  • 11.  European Molecular Biology Laboratory Nucleotide Sequence Database, first established in 1974.  Hosted at UK by the EMBL European Bioinformatics Institute.  EMBL is a non-profit research institution supported by 20 European countries and Australia, for Molecular Biology Research.  EMBL collects nucleotide sequence data from individual researchers, genome sequence projects and patent applications.  Sequences are stored in this database as they would exist in the biological state.  The stored data correspond to wild type sequences without mutation or genetic manipulation.  Accessed through the URL, http://www.ebi.ac.uk/embl
  • 13.  An array of amino acid sequence entries arranged according to the identification number.  Well known protein sequence databases available on www are ◦ Swiss-Prot ◦ PIR ◦ UNIPROT
  • 14.  Developed by the Swiss Institute of Bioinformatics (SIB) and European Bioinformatics Institute(EBI).  High quality, manually annotated protein sequence database created in 1986.  It provides high level annotations with functions of protein and post transcriptional modifications.  It provide all known relevant information about a particular protein.  Consists of two sections:- UniProt KB/Swiss-Prot, which is manually annotated and is reviewed, and Uni ProtKB/TrEMBL, which is automatically annotated and not reviewed.  Available at http://www.expasy.ch/sprot
  • 16.  Protein Information Resource database  Established in 1984, by National Biomedical Research Foundation (NBRF).  It is an integrated public bioinformatics resource that support genomic and proteomic research, and scientific studies.  It assists researchers in the identification and interpretation of protein sequence information.  PIR can be searched for entries or sequence similarity searches.  Can be downloaded at http://www.pir.georgetown.edu/.  PIR offers a variety of resources mainly oriented to assist the propagation and standardization of protein annotation.
  • 18.  It provide a comprehensive, high quality and freely accessible resource of protein sequence.  Entries are derived from genome sequencing projects.  The Uniprot consortium comprises the European Bioinformatics Institute(EBI),the Swiss Institute of Bioinformatics(SIB), And the Protein Information Resourse(PIR).  Uniprot is composed of four components, each optimized for different uses.
  • 19.  1. UniProt Knowledge Base (UniProtKB)- For extensive curated protein information with two sections-UniProt KB/Swiss-Prot, which is manually annotated and is reviewed, and Uni ProtKB/TrEMBL, which is automatically annotated and not reviewed.  2. UniProt Reference Clusters (UniRef)  3. UniProt Archive (UniParc)  4. UniProt Metagenomic and Environmental Sequences (UniMes)
  • 21.  Many proteins which exhibit a common evolutionary origin, show structural similarities.  Dissimilar proteins exhibit changes in primary, secondary, teritiary and quarternary structures.  Similar or dissimilar protein structure can be predicted with structure database.  These databases store a collection of three dimensional structures of proteins.
  • 22.  Understanding the shape of a molecule helps to understand how it works.  PDB is the main primary database used for the prediction of 3D Structures of proteins and nucleic acids.  The single world wide archive of structural data.  Maintained by the Research Collaboratory for structural bioinformatics (RCSB)  The data obtained from X-ray chrystallography and NMR-spectroscopy, are submitted to the PDB.  Then, these structures are annotated as per the depositors specifications.  Freely available and accessed through URL http://www.pdb.org/
  • 24.  MODs are also called Organism – specific databases. They describe genome and other information about well studied experimental organisms in life sciences.  They store large volumes of data and allow users to analyse results and interpret datasets and data they generated. ( organism of their own interest).  Examples:  Fly Base- database of Drosophilla melanogaster  SGD- Sacharomyces Genome Database  AGR- Arabidopsis Genome Resource  HGP- Human Genome Project  RGD- Rat Genome Database etc…
  • 25.  Provide information on the biodiversity of a particular area or group of living organisms.  They may store genus level information, species level information, information on nomenclature or any combination of the three.  Species 2000 ◦ Established in September 1994, by the International Union of Biological Sciences(IUBS), in co-operation with the committee on Data for science and technology(CODATA) and the International Union of Microbiological Sciences(IUMS). ◦ It is a Federation of database organizations working closely with users, taxonomists and sponsoring agencies. ◦ It plans to create an array of participant global species databases covering each of the major groups of organisms(plants, animals, fungi and microbes) ◦ The goal of species 2000 is to provide a uniform and validated quality index of names of all known species for use as a practical tool.