Co-reporter:Jitao T. Huang;Dajie J. Xing;Wei Huang
Amino Acids 2012 Volume 43( Issue 2) pp:567-572
Publication Date(Web):2012 August
DOI:10.1007/s00726-011-1189-3
The successful prediction of protein-folding rates based on the sequence-predicted secondary structure suggests that the folding rates might be predicted from sequence alone. To pursue this question, we directly predict the folding rates from amino acid sequences, which do not require any information on secondary or tertiary structure. Our work achieves 88% correlation with folding rates determined experimentally for proteins of all folding types and peptide, suggesting that almost all of the information needed to specify a protein’s folding kinetics and mechanism is comprised within its amino acid sequence. The influence of residue on folding rate is related to amino acid properties. Hydrophobic character of amino acids may be an important determinant of folding kinetics, whereas other properties, size, flexibility, polarity and isoelectric point, of amino acids have contributed little to the folding rate constant.
Co-reporter:Jitao T. Huang, Titi Wang, Shanran R. Huang, Xin Li
Journal of Theoretical Biology (21 October 2015) Volume 383() pp:1-6
Publication Date(Web):21 October 2015
DOI:10.1016/j.jtbi.2015.07.024
•Secondary structures determine folding rate of only large, multi-state proteins.•The importance of secondary structures for protein folding is very different.•α Helix, β strand and bend are well correlated with folding rates of large proteins.Protein folding is a very complicated and highly cooperative dynamic process. However, the folding kinetics is likely to depend more on a few key structural features. Here we find that secondary structures can determine folding rates of only large, multi-state folding proteins and fails to predict those for small, two-state proteins. The importance of secondary structures for protein folding is ordered as: extended β strand>α helix>bend>turn>undefined secondary structure>310 helix>isolated β strand>π helix. Only the first three secondary structures, extended β strand, α helix and bend, can achieve a good correlation with folding rates. This suggests that the rate-limiting step of protein folding would depend upon the formation of regular secondary structures and the buckling of chain. The reduced secondary structure alphabet provides a simplified description for the machine learning applications in protein design.