PREDICTION OF PROTEIN FOLDING CLASS FROM AMINO-ACID-COMPOSITION

被引:67
|
作者
DUBCHAK, I
HOLBROOK, SR
KIM, SH
机构
[1] LAWRENCE BERKELEY LAB,DEPT CHEM,BERKELEY,CA 94720
[2] LAWRENCE BERKELEY LAB,DIV STRUCT BIOL,BERKELEY,CA 94720
来源
关键词
PROTEIN STRUCTURE PREDICTION; NEURAL NETWORKS; AMINO ACID COMPOSITION; PROTEIN FOLDING CLASSES; 4-ALPHA-HELICAL BUNDLES; PARALLEL (ALPHA/BETA)8 BARRELS; NUCLEOTIDE BINDING FOLD; IMMUNOGLOBULIN FOLD;
D O I
10.1002/prot.340160109
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
An empirical relation between the amino acid composition and three-dimensional folding pattern of several classes of proteins has been determined. Computer simulated neural networks have been used to assign proteins to one of the following classes based on their amino acid composition and size: (1) 4alpha-helical bundles, (2) parallel (alpha/beta)8 barrels, (3) nucleotide binding fold, (4) immunoglobulin fold, or (5) none of these. Networks trained on the known crystal structures as well as sequences of closely related proteins are shown to correctly predict folding classes of proteins not represented in the training set with an average accuracy of 87%. Other folding motifs can easily be added to the prediction scheme once larger databases become available. Analysis of the neural network weights reveals that amino acids favoring prediction of a folding class are usually over represented in that class and amino acids with unfavorable weights are underrepresented in composition. The neural networks utilize combinations of these multiple small variations in amino acid composition in order to make a prediction. The favorably weighted amino acids in a given class also form the most intramolecular interactions with other residues in proteins of that class. A detailed examination of the contacts of these amino acids reveals some general patterns that may help stabilize each folding class.
引用
收藏
页码:79 / 91
页数:13
相关论文
共 50 条