SlideShare a Scribd company logo
3
Most read
9
Most read
10
Most read
Chou fasman algorithm for protein structure prediction
Contents…
• Importance of the Structures of proteins
• Prediction of 2D Structures
• Chou-Fasman Algorithm
• How it works!
Chou-Fasman Algorithm for Protein Prediction 2
What is chou-fasman algorithm?
• The experimental methods used by biotechnologists
to determine the structures of proteins demand
sophisticated equipment and time.
• A host of computational methods are developed to
predict the location of secondary structure elements
in proteins for complementing or creating insights
into experimental results.
• Chou-Fasman algorithm is an empirical algorithm
developed for the prediction of protein secondary
structure
Chou-Fasman Algorithm for Protein Prediction 3
Before we go…..
• Structures of proteins……
• Why study of structures are important….
• What is the need of an algorithm ….
Chou-Fasman Algorithm for Protein Prediction 4
Chou-Fasman Algorithm for Protein Prediction 5
Secondary structure prediction
• In either case, amino acid propensities should be
useful for predicting secondary structure
• Two classical methods that use previously
determined propensities:
• Chou-Fasman
• Garnier-Osguthorpe-Robson
Chou-Fasman Algorithm for Protein Prediction 6
Goal…
• Take primary structure (sequence) and, using rules
derived from known structures, predict the
secondary structure that is most likely to be
adopted by each residue
• Major classes are a-helices, b-sheets and loops
Chou-Fasman Algorithm for Protein Prediction 7
Structural Propensities
• Due to the size, shape and charge of its side chain,
each amino acid may “fit” better in one type of
secondary structure than another
• Classic example: The rigidity and side chain angle of
proline cannot be accomodated in an a-helical
structure
Chou-Fasman Algorithm for Protein Prediction 8
Structural Propensities
• Two ways to view the significance of this
preference (or propensity)
• It may control or affect the folding of the protein in its
immediate vicinity (amino acid determines structure)
• It may constitute selective pressure to use particular
amino acids in regions that must have a particular
structure (structure determines amino acid)
Chou-Fasman Algorithm for Protein Prediction 9
Chou-Fasman method
• Uses table of conformational parameters
(propensities) determined primarily from
measurements of secondary structure by CD
spectroscopy
• Table consists of one “likelihood” for each structure
for each amino acid
Chou-Fasman Algorithm for Protein Prediction 10
Chou-Fasman Algorithm for Protein Prediction 11
Chou-Fasman Algorithm
• Conformational parameters
for every amino acid (AA):
P(a) = propensity in an alpha helix P(b) = propensity in a beta
sheet P(turn) = propensity in a turn
Based on observed propensities in proteins of known structure
Chou-Fasman propensities
(partial table)
Amino Acid Pa Pb Pt
Glu 1.51 0.37 0.74
Met 1.45 1.05 0.60
Ala 1.42 0.83 0.66
Val 1.06 1.70 0.50
Ile 1.08 1.60 0.50
Tyr 0.69 1.47 1.14
Pro 0.57 0.55 1.52
Gly 0.57 0.75 1.56
Chou-Fasman Algorithm for Protein Prediction 12
Chou-Fasman method
• A prediction is made for each type of structure for
each amino acid
• Can result in ambiguity if a region has high propensities
for both helix and sheet (higher value usually chosen,
with exceptions)
Chou-Fasman Algorithm for Protein Prediction 13
Chou-Fasman method
• Calculation rules are somewhat ad hoc
• Example: Method for helix
• Search for nucleating region where 4 out of 6 a.a. have
Pa > 1.03
• Extend until 4 consecutive a.a. have an average Pa < 1.00
• If region is at least 6 a.a. long, has an average Pa > 1.03,
and average Pa > average Pb consider region to be helix
Chou-Fasman Algorithm for Protein Prediction 14
• Scan the peptide and identify regions where 3 out
of 5 contiguous residues have P(β)>100.
• These residues nucleate β- strands. Extend these in
both directions until a set of four contiguous
residues have an average P(β)<100.
• This ends β- strand.
Chou-Fasman Algorithm for Protein Prediction 15
• region containing overlapping α and β Any
assignment are taken to be helical or β depending
on if the average P(α) and P(β) for that region is
largest.
• If this residues an α or β- region so that it
becomes less than 5 residues, the α or β
assignment for that region is removed.
Chou-Fasman Algorithm for Protein Prediction 16
Chou-Fasman Algorithm for Protein Prediction 17
SPASEASDGQSVSV
P(a) P(b)
S: 77 75
P: 55 55
A: 142 83
S: 77
SPASEASDGQFETTY
P(a) P(b)
E: 151 37
A: 142 83
S: 77 75
D: 101 54
G: 57
Q: 111 1) 4 of 6, P(a) > 100
2) Extend RIGHT until 4 contiguous
Residues have P(a) < 100
3) Calculate SP(a) and SP(b). Is SP(a) >
SP(b)? (Do Not Include last 4 in
sum)
Find potential alpha
helix:
MFCTYYGNNGEHIELMM
MFCTYYGNNGEHIELMM
Accuracy of Chou-Fasman predictions
• Sequences whose 3D structures are known are processed so
that each residue is “assigned” to a given secondary
structure class by looking at the backbone angles
• Three classes most often used (helix=H, sheet=E, turn=C)
but sometimes use four classes (helix, sheet, turn, loop)
Chou-Fasman Algorithm for Protein Prediction 18
Conclusion…..
Confusion matrix for Chou-Fasman method
on 78 proteins
Predicted
True
H E C Unknown
H 47.5 3.0 4.3 45.2
E 20.8 16.8 7.1 55.4
C 6.4 3.6 38.0 52.0
Data from Z-Y Zhu, Protein Engineering 8:103-109, 1995
Average accuracy =54.4
Chou fasman algorithm for protein structure prediction
Thank You!

More Related Content

What's hot (20)

PPTX
Secondary protein structure prediction
Siva Dharshini R
 
PDF
Ab Initio Protein Structure Prediction
Arindam Ghosh
 
PPTX
DNA protein interaction.pptx
shwetaliprajapati
 
DOCX
Open Reading Frames
Osama Zahid
 
PPTX
Protein 3 d structure prediction
Samvartika Majumdar
 
PPTX
sequence of file formats in bioinformatics
nadeem akhter
 
PPTX
Introduction to sequence alignment partii
SumatiHajela
 
PPT
Sequence file formats
Alphonsa Joseph
 
PPTX
Swiss prot database
sagrika chugh
 
PPTX
Protein protein interactions
SHRIKANT YANKANCHI
 
PPTX
Tools of bioinforformatics by kk
KAUSHAL SAHU
 
PDF
dot plot analysis
ShwetA Kumari
 
PPTX
Genome annotation
Shifa Ansari
 
PDF
Gene prediction methods vijay
Vijay Hemmadi
 
PPTX
Scop database
Sayantani Roy
 
PPT
methods for protein structure prediction
karamveer prajapat
 
PPTX
Protein Databases
SATHIYA NARAYANAN
 
Secondary protein structure prediction
Siva Dharshini R
 
Ab Initio Protein Structure Prediction
Arindam Ghosh
 
DNA protein interaction.pptx
shwetaliprajapati
 
Open Reading Frames
Osama Zahid
 
Protein 3 d structure prediction
Samvartika Majumdar
 
sequence of file formats in bioinformatics
nadeem akhter
 
Introduction to sequence alignment partii
SumatiHajela
 
Sequence file formats
Alphonsa Joseph
 
Swiss prot database
sagrika chugh
 
Protein protein interactions
SHRIKANT YANKANCHI
 
Tools of bioinforformatics by kk
KAUSHAL SAHU
 
dot plot analysis
ShwetA Kumari
 
Genome annotation
Shifa Ansari
 
Gene prediction methods vijay
Vijay Hemmadi
 
Scop database
Sayantani Roy
 
methods for protein structure prediction
karamveer prajapat
 
Protein Databases
SATHIYA NARAYANAN
 

Similar to Chou fasman algorithm for protein structure prediction (11)

PPTX
Bioinformatics t7-proteinstructure v2014
Prof. Wim Van Criekinge
 
PDF
Aarthy ppt.pdf Bioinformatics PPT .......
aarthya16
 
PDF
Aarthy ppt.pdf Bioinformatics PPT .......
aarthya16
 
PPTX
chou fasman method(Mahalakshmi -123011356012)
PraveenaKalaiselvan1
 
PPTX
Secondary Structure Prediction Methods.pptx
Gurunathan Subramanian
 
PPTX
Bioinformatics t7-protein structure-v2013_wim_vancriekinge
Prof. Wim Van Criekinge
 
PPT
Bioinformatica 01-12-2011-t7-protein
Prof. Wim Van Criekinge
 
PPTX
Protein structure 2
Rainu Rajeev
 
PPTX
In silico structure prediction
Subin E K
 
PPTX
Critical Assessment of Structure Prediction.pptx
Dr Vardhana Janakiraman, VISTAS
 
Bioinformatics t7-proteinstructure v2014
Prof. Wim Van Criekinge
 
Aarthy ppt.pdf Bioinformatics PPT .......
aarthya16
 
Aarthy ppt.pdf Bioinformatics PPT .......
aarthya16
 
chou fasman method(Mahalakshmi -123011356012)
PraveenaKalaiselvan1
 
Secondary Structure Prediction Methods.pptx
Gurunathan Subramanian
 
Bioinformatics t7-protein structure-v2013_wim_vancriekinge
Prof. Wim Van Criekinge
 
Bioinformatica 01-12-2011-t7-protein
Prof. Wim Van Criekinge
 
Protein structure 2
Rainu Rajeev
 
In silico structure prediction
Subin E K
 
Critical Assessment of Structure Prediction.pptx
Dr Vardhana Janakiraman, VISTAS
 
Ad

More from Roshan Karunarathna (8)

PPTX
UI/UX presentation by Roshan Karunarathna
Roshan Karunarathna
 
PPTX
LIVE CHAT ETIQUETTE
Roshan Karunarathna
 
PPTX
Iplanet
Roshan Karunarathna
 
PPTX
Introduction to Parallel Computing
Roshan Karunarathna
 
PPTX
onlinemarketing
Roshan Karunarathna
 
PPTX
Pay Pal Introduction.........!
Roshan Karunarathna
 
PPTX
Iterative and Incremental Development (RAD)
Roshan Karunarathna
 
PPTX
Introduction to backwards learning algorithm
Roshan Karunarathna
 
UI/UX presentation by Roshan Karunarathna
Roshan Karunarathna
 
LIVE CHAT ETIQUETTE
Roshan Karunarathna
 
Introduction to Parallel Computing
Roshan Karunarathna
 
onlinemarketing
Roshan Karunarathna
 
Pay Pal Introduction.........!
Roshan Karunarathna
 
Iterative and Incremental Development (RAD)
Roshan Karunarathna
 
Introduction to backwards learning algorithm
Roshan Karunarathna
 
Ad

Recently uploaded (20)

PDF
Calcium in a supernova remnant as a fingerprint of a sub-Chandrasekhar-mass e...
Sérgio Sacani
 
PPTX
Diagnostic Features of Common Oral Ulcerative Lesions.pptx
Dr Palak borade
 
PDF
A Man of the Forest: The Contributions of Gifford Pinchot
RowanSales
 
DOCX
Paper - Taboo Language (Makalah Presentasi)
Sahmiral Amri Rajagukguk
 
PDF
GUGC Research Overview (December 2024)
Ghent University Global Campus
 
PPTX
Bacillus thuringiensis.crops & golden rice
priyadharshini87125
 
PDF
Unit-3 ppt.pdf organic chemistry - 3 unit 3
visionshukla007
 
PDF
Adding Geochemistry To Understand Recharge Areas - Kinney County, Texas - Jim...
Texas Alliance of Groundwater Districts
 
PPTX
LESSON 2 PSYCHOSOCIAL DEVELOPMENT.pptx L
JeanCarolColico1
 
PDF
Pharmakon of algorithmic alchemy: Marketing in the age of AI
Selcen Ozturkcan
 
PDF
High-speedBouldersandtheDebrisFieldinDARTEjecta
Sérgio Sacani
 
PDF
oil and gas chemical injection system
Okeke Livinus
 
DOCX
Critical Book Review (CBR) - "Hate Speech: Linguistic Perspectives"
Sahmiral Amri Rajagukguk
 
PDF
Global Congress on Forensic Science and Research
infoforensicscience2
 
PPTX
Q1_Science 8_Week3-Day 1.pptx science lesson
AizaRazonado
 
PDF
Portable Hyperspectral Imaging (pHI) for the enhanced recording of archaeolog...
crabbn
 
PPTX
abdominal compartment syndrome presentation and treatment.pptx
LakshmiMounicaGrandh
 
PDF
Plankton and Fisheries Bovas Joel Notes.pdf
J. Bovas Joel BFSc
 
PDF
A High-Caliber View of the Bullet Cluster through JWST Strong and Weak Lensin...
Sérgio Sacani
 
PDF
BlackBody Radiation experiment report.pdf
Ghadeer Shaabna
 
Calcium in a supernova remnant as a fingerprint of a sub-Chandrasekhar-mass e...
Sérgio Sacani
 
Diagnostic Features of Common Oral Ulcerative Lesions.pptx
Dr Palak borade
 
A Man of the Forest: The Contributions of Gifford Pinchot
RowanSales
 
Paper - Taboo Language (Makalah Presentasi)
Sahmiral Amri Rajagukguk
 
GUGC Research Overview (December 2024)
Ghent University Global Campus
 
Bacillus thuringiensis.crops & golden rice
priyadharshini87125
 
Unit-3 ppt.pdf organic chemistry - 3 unit 3
visionshukla007
 
Adding Geochemistry To Understand Recharge Areas - Kinney County, Texas - Jim...
Texas Alliance of Groundwater Districts
 
LESSON 2 PSYCHOSOCIAL DEVELOPMENT.pptx L
JeanCarolColico1
 
Pharmakon of algorithmic alchemy: Marketing in the age of AI
Selcen Ozturkcan
 
High-speedBouldersandtheDebrisFieldinDARTEjecta
Sérgio Sacani
 
oil and gas chemical injection system
Okeke Livinus
 
Critical Book Review (CBR) - "Hate Speech: Linguistic Perspectives"
Sahmiral Amri Rajagukguk
 
Global Congress on Forensic Science and Research
infoforensicscience2
 
Q1_Science 8_Week3-Day 1.pptx science lesson
AizaRazonado
 
Portable Hyperspectral Imaging (pHI) for the enhanced recording of archaeolog...
crabbn
 
abdominal compartment syndrome presentation and treatment.pptx
LakshmiMounicaGrandh
 
Plankton and Fisheries Bovas Joel Notes.pdf
J. Bovas Joel BFSc
 
A High-Caliber View of the Bullet Cluster through JWST Strong and Weak Lensin...
Sérgio Sacani
 
BlackBody Radiation experiment report.pdf
Ghadeer Shaabna
 

Chou fasman algorithm for protein structure prediction

  • 2. Contents… • Importance of the Structures of proteins • Prediction of 2D Structures • Chou-Fasman Algorithm • How it works! Chou-Fasman Algorithm for Protein Prediction 2
  • 3. What is chou-fasman algorithm? • The experimental methods used by biotechnologists to determine the structures of proteins demand sophisticated equipment and time. • A host of computational methods are developed to predict the location of secondary structure elements in proteins for complementing or creating insights into experimental results. • Chou-Fasman algorithm is an empirical algorithm developed for the prediction of protein secondary structure Chou-Fasman Algorithm for Protein Prediction 3
  • 4. Before we go….. • Structures of proteins…… • Why study of structures are important…. • What is the need of an algorithm …. Chou-Fasman Algorithm for Protein Prediction 4
  • 5. Chou-Fasman Algorithm for Protein Prediction 5
  • 6. Secondary structure prediction • In either case, amino acid propensities should be useful for predicting secondary structure • Two classical methods that use previously determined propensities: • Chou-Fasman • Garnier-Osguthorpe-Robson Chou-Fasman Algorithm for Protein Prediction 6
  • 7. Goal… • Take primary structure (sequence) and, using rules derived from known structures, predict the secondary structure that is most likely to be adopted by each residue • Major classes are a-helices, b-sheets and loops Chou-Fasman Algorithm for Protein Prediction 7
  • 8. Structural Propensities • Due to the size, shape and charge of its side chain, each amino acid may “fit” better in one type of secondary structure than another • Classic example: The rigidity and side chain angle of proline cannot be accomodated in an a-helical structure Chou-Fasman Algorithm for Protein Prediction 8
  • 9. Structural Propensities • Two ways to view the significance of this preference (or propensity) • It may control or affect the folding of the protein in its immediate vicinity (amino acid determines structure) • It may constitute selective pressure to use particular amino acids in regions that must have a particular structure (structure determines amino acid) Chou-Fasman Algorithm for Protein Prediction 9
  • 10. Chou-Fasman method • Uses table of conformational parameters (propensities) determined primarily from measurements of secondary structure by CD spectroscopy • Table consists of one “likelihood” for each structure for each amino acid Chou-Fasman Algorithm for Protein Prediction 10
  • 11. Chou-Fasman Algorithm for Protein Prediction 11 Chou-Fasman Algorithm • Conformational parameters for every amino acid (AA): P(a) = propensity in an alpha helix P(b) = propensity in a beta sheet P(turn) = propensity in a turn Based on observed propensities in proteins of known structure
  • 12. Chou-Fasman propensities (partial table) Amino Acid Pa Pb Pt Glu 1.51 0.37 0.74 Met 1.45 1.05 0.60 Ala 1.42 0.83 0.66 Val 1.06 1.70 0.50 Ile 1.08 1.60 0.50 Tyr 0.69 1.47 1.14 Pro 0.57 0.55 1.52 Gly 0.57 0.75 1.56 Chou-Fasman Algorithm for Protein Prediction 12
  • 13. Chou-Fasman method • A prediction is made for each type of structure for each amino acid • Can result in ambiguity if a region has high propensities for both helix and sheet (higher value usually chosen, with exceptions) Chou-Fasman Algorithm for Protein Prediction 13
  • 14. Chou-Fasman method • Calculation rules are somewhat ad hoc • Example: Method for helix • Search for nucleating region where 4 out of 6 a.a. have Pa > 1.03 • Extend until 4 consecutive a.a. have an average Pa < 1.00 • If region is at least 6 a.a. long, has an average Pa > 1.03, and average Pa > average Pb consider region to be helix Chou-Fasman Algorithm for Protein Prediction 14
  • 15. • Scan the peptide and identify regions where 3 out of 5 contiguous residues have P(β)>100. • These residues nucleate β- strands. Extend these in both directions until a set of four contiguous residues have an average P(β)<100. • This ends β- strand. Chou-Fasman Algorithm for Protein Prediction 15
  • 16. • region containing overlapping α and β Any assignment are taken to be helical or β depending on if the average P(α) and P(β) for that region is largest. • If this residues an α or β- region so that it becomes less than 5 residues, the α or β assignment for that region is removed. Chou-Fasman Algorithm for Protein Prediction 16
  • 17. Chou-Fasman Algorithm for Protein Prediction 17 SPASEASDGQSVSV P(a) P(b) S: 77 75 P: 55 55 A: 142 83 S: 77 SPASEASDGQFETTY P(a) P(b) E: 151 37 A: 142 83 S: 77 75 D: 101 54 G: 57 Q: 111 1) 4 of 6, P(a) > 100 2) Extend RIGHT until 4 contiguous Residues have P(a) < 100 3) Calculate SP(a) and SP(b). Is SP(a) > SP(b)? (Do Not Include last 4 in sum) Find potential alpha helix: MFCTYYGNNGEHIELMM MFCTYYGNNGEHIELMM
  • 18. Accuracy of Chou-Fasman predictions • Sequences whose 3D structures are known are processed so that each residue is “assigned” to a given secondary structure class by looking at the backbone angles • Three classes most often used (helix=H, sheet=E, turn=C) but sometimes use four classes (helix, sheet, turn, loop) Chou-Fasman Algorithm for Protein Prediction 18 Conclusion…..
  • 19. Confusion matrix for Chou-Fasman method on 78 proteins Predicted True H E C Unknown H 47.5 3.0 4.3 45.2 E 20.8 16.8 7.1 55.4 C 6.4 3.6 38.0 52.0 Data from Z-Y Zhu, Protein Engineering 8:103-109, 1995 Average accuracy =54.4