Abstract
Free full text
Identification and application of the concepts important for accurate and reliable protein secondary structure prediction.
Abstract
A protein secondary structure prediction method from multiply aligned homologous sequences is presented with an overall per residue three-state accuracy of 70.1%. There are two aims: to obtain high accuracy by identification of a set of concepts important for prediction followed by use of linear statistics; and to provide insight into the folding process. The important concepts in secondary structure prediction are identified as: residue conformational propensities, sequence edge effects, moments of hydrophobicity, position of insertions and deletions in aligned homologous sequence, moments of conservation, auto-correlation, residue ratios, secondary structure feedback effects, and filtering. Explicit use of edge effects, moments of conservation, and auto-correlation are new to this paper. The relative importance of the concepts used in prediction was analyzed by stepwise addition of information and examination of weights in the discrimination function. The simple and explicit structure of the prediction allows the method to be reimplemented easily. The accuracy of a prediction is predictable a priori. This permits evaluation of the utility of the prediction: 10% of the chains predicted were identified correctly as having a mean accuracy of > 80%. Existing high-accuracy prediction methods are "black-box" predictors based on complex nonlinear statistics (e.g., neural networks in PHD: Rost & Sander, 1993a). For medium- to short-length chains (> or = 90 residues and < 170 residues), the prediction method is significantly more accurate (P < 0.01) than the PHD algorithm (probably the most commonly used algorithm). In combination with the PHD, an algorithm is formed that is significantly more accurate than either method, with an estimated overall three-state accuracy of 72.4%, the highest accuracy reported for any prediction method.
Full Text
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Benner SA, Cohen MA, Gerloff D. Correct structure prediction? Nature. 1992 Oct 29;359(6398):781–781. [Abstract] [Google Scholar]
- Benner SA, Gerloff D. Patterns of divergence in homologous proteins as indicators of secondary and tertiary structure: a prediction of the structure of the catalytic domain of protein kinases. Adv Enzyme Regul. 1991;31:121–181. [Abstract] [Google Scholar]
- Benner SA, Gerloff DL. Predicting the conformation of proteins. Man versus machine. FEBS Lett. 1993 Jun 28;325(1-2):29–33. [Abstract] [Google Scholar]
- Biou V, Gibrat JF, Levin JM, Robson B, Garnier J. Secondary structure prediction: combination of three different methods. Protein Eng. 1988 Sep;2(3):185–191. [Abstract] [Google Scholar]
- Bryson JW, Betz SF, Lu HS, Suich DJ, Zhou HX, O'Neil KT, DeGrado WF. Protein design: a hierarchic approach. Science. 1995 Nov 10;270(5238):935–941. [Abstract] [Google Scholar]
- Chou PY, Fasman GD. Prediction of protein conformation. Biochemistry. 1974 Jan 15;13(2):222–245. [Abstract] [Google Scholar]
- Colloc'h N, Etchebest C, Thoreau E, Henrissat B, Mornon JP. Comparison of three algorithms for the assignment of secondary structure in proteins: the advantages of a consensus assignment. Protein Eng. 1993 Jun;6(4):377–382. [Abstract] [Google Scholar]
- Garnier J, Osguthorpe DJ, Robson B. Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins. J Mol Biol. 1978 Mar 25;120(1):97–120. [Abstract] [Google Scholar]
- Geourjon C, Deléage G. SOPM: a self-optimized method for protein secondary structure prediction. Protein Eng. 1994 Feb;7(2):157–164. [Abstract] [Google Scholar]
- Horovitz A, Matthews JM, Fersht AR. Alpha-helix stability in proteins. II. Factors that influence stability at an internal position. J Mol Biol. 1992 Sep 20;227(2):560–568. [Abstract] [Google Scholar]
- Jenny TF, Benner SA. Evaluating predictions of secondary structure in proteins. Biochem Biophys Res Commun. 1994 Apr 15;200(1):149–155. [Abstract] [Google Scholar]
- Kneller DG, Cohen FE, Langridge R. Improvements in protein secondary structure prediction by an enhanced neural network. J Mol Biol. 1990 Jul 5;214(1):171–182. [Abstract] [Google Scholar]
- Lim VI. Algorithms for prediction of alpha-helical and beta-structural regions in globular proteins. J Mol Biol. 1974 Oct 5;88(4):873–894. [Abstract] [Google Scholar]
- Mehta PK, Heringa J, Argos P. A simple and fast approach to prediction of protein secondary structure from multiply aligned sequences with accuracy above 70%. Protein Sci. 1995 Dec;4(12):2517–2525. [Europe PMC free article] [Abstract] [Google Scholar]
- Muggleton S, King RD, Sternberg MJ. Protein secondary structure prediction using logic-based machine learning. Protein Eng. 1992 Oct;5(7):647–657. [Abstract] [Google Scholar]
- Padmanabhan S, Marqusee S, Ridgeway T, Laue TM, Baldwin RL. Relative helix-forming tendencies of nonpolar amino acids. Nature. 1990 Mar 15;344(6263):268–270. [Abstract] [Google Scholar]
- Qian N, Sejnowski TJ. Predicting the secondary structure of globular proteins using neural network models. J Mol Biol. 1988 Aug 20;202(4):865–884. [Abstract] [Google Scholar]
- Richardson JS, Richardson DC. Amino acid preferences for specific locations at the ends of alpha helices. Science. 1988 Jun 17;240(4859):1648–1652. [Abstract] [Google Scholar]
- Robson B, Suzuki E. Conformational properties of amino acid residues in globular proteins. J Mol Biol. 1976 Nov 5;107(3):327–356. [Abstract] [Google Scholar]
- Rost B, Sander C. Prediction of protein secondary structure at better than 70% accuracy. J Mol Biol. 1993 Jul 20;232(2):584–599. [Abstract] [Google Scholar]
- Rost B, Sander C, Schneider R. Redefining the goals of protein secondary structure prediction. J Mol Biol. 1994 Jan 7;235(1):13–26. [Abstract] [Google Scholar]
- Russell RB, Barton GJ. The limits of protein secondary structure prediction accuracy from multiple sequence alignment. J Mol Biol. 1993 Dec 20;234(4):951–957. [Abstract] [Google Scholar]
- Solovyev VV, Salamov AA. Predicting alpha-helix and beta-strand segments of globular proteins. Comput Appl Biosci. 1994 Dec;10(6):661–669. [Abstract] [Google Scholar]
- Wako H, Blundell TL. Use of amino acid environment-dependent substitution tables and conformational propensities in structure prediction from aligned sequences of homologous proteins. II. Secondary structures. J Mol Biol. 1994 May 20;238(5):693–708. [Abstract] [Google Scholar]
- White SH. Amino acid preferences of small proteins. Implications for protein stability and evolution. J Mol Biol. 1992 Oct 20;227(4):991–995. [Abstract] [Google Scholar]
- Williams RW, Chang A, Juretić D, Loughran S. Secondary structure predictions and medium range interactions. Biochim Biophys Acta. 1987 Nov 26;916(2):200–204. [Abstract] [Google Scholar]
- Yi TM, Lander ES. Protein secondary structure prediction using nearest-neighbor methods. J Mol Biol. 1993 Aug 20;232(4):1117–1129. [Abstract] [Google Scholar]
- Zhang X, Mesirov JP, Waltz DL. Hybrid system for protein secondary structure prediction. J Mol Biol. 1992 Jun 20;225(4):1049–1063. [Abstract] [Google Scholar]
- Zvelebil MJ, Barton GJ, Taylor WR, Sternberg MJ. Prediction of protein secondary structure and active sites using the alignment of homologous sequences. J Mol Biol. 1987 Jun 20;195(4):957–961. [Abstract] [Google Scholar]
Articles from Protein Science : A Publication of the Protein Society are provided here courtesy of The Protein Society
Full text links
Read article at publisher's site: https://doi.org/10.1002/pro.5560051116
Read article for free, from open access legal sources, via Unpaywall: https://europepmc.org/articles/pmc2143286?pdf=render
Citations & impact
Impact metrics
Article citations
The crosstalk between neuropilin-1 and tumor necrosis factor-α in endothelial cells.
Front Cell Dev Biol, 12:1210944, 27 Jun 2024
Cited by: 0 articles | PMID: 38994453 | PMCID: PMC11236538
KCNQ1 is an essential mediator of the sex-dependent perception of moderate cold temperatures.
Proc Natl Acad Sci U S A, 121(25):e2322475121, 10 Jun 2024
Cited by: 1 article | PMID: 38857404 | PMCID: PMC11194602
Allelic variation and haplotype diversity of Matrilineal (MTL) gene governing in vivo maternal haploid induction in maize.
Physiol Mol Biol Plants, 30(5):823-838, 13 May 2024
Cited by: 0 articles | PMID: 38846462
Comparison, Analysis, and Molecular Dynamics Simulations of Structures of a Viral Protein Modeled Using Various Computational Tools.
Bioengineering (Basel), 10(9):1004, 24 Aug 2023
Cited by: 1 article | PMID: 37760106 | PMCID: PMC10525864
Unveiling an indole alkaloid diketopiperazine biosynthetic pathway that features a unique stereoisomerase and multifunctional methyltransferase.
Nat Commun, 14(1):2558, 03 May 2023
Cited by: 3 articles | PMID: 37137876 | PMCID: PMC10156859
Go to all (234) article citations
Other citations
Similar Articles
To arrive at the top five similar articles we use a word-weighted algorithm to compare words from the Title and Abstract of each citation.
Combining prediction of secondary structure and solvent accessibility in proteins.
Proteins, 59(3):467-475, 01 May 2005
Cited by: 169 articles | PMID: 15768403
Protein secondary structure prediction using local alignments.
J Mol Biol, 268(1):31-36, 01 Apr 1997
Cited by: 49 articles | PMID: 9149139
Use of amino acid environment-dependent substitution tables and conformational propensities in structure prediction from aligned sequences of homologous proteins. II. Secondary structures.
J Mol Biol, 238(5):693-708, 01 May 1994
Cited by: 22 articles | PMID: 8182744
[A turning point in the knowledge of the structure-function-activity relations of elastin].
J Soc Biol, 195(2):181-193, 01 Jan 2001
Cited by: 10 articles | PMID: 11727705
Review