FORESST: fold recognition from secondary structure predictions of proteins

被引:35
|
作者
Di Francesco, V
Munson, PJ
Garnier, J
机构
[1] NIH, Analyt Biostat Sect, Math & Stat Comp Lab, Ctr Informat Technol, Bethesda, MD 20892 USA
[2] Inst Genom Res, Blolgiza, Rockville, MD 20850 USA
[3] INRA, Biol Cellulaire & Mol Lab, F-78352 Jouy En Josas, France
关键词
D O I
10.1093/bioinformatics/15.2.131
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: A method for recognizing the three-dimensional fold from the protein amino acid sequence based on a combination of hidden Markov models (HMMs) and secondary structure prediction was recently developed for proteins in the Mainly-Alpha structural class. Here, this methodology is extended to Mainly-Beta and Alpha-Beta class proteins. Compared to other fold recognition methods based on HMMs, this approach is novel in that only secondary structure information is used. Each HMM is trained from known secondary structure sequences of proteins having a similar fold. Secondary structure prediction is performed for the amino acid sequence of a query protein. The predicted fold of a query protein is the fold described by the model fitting the predicted sequence the best. Results: After model cross-validation, the success rare on 44 test proteins covering the three structural classes was found to be 59%. On seven fold predictions performed prior to the publication of experimental structure, the success rate was 71%. In conclusion, this approach manages to capture important information about the fold of a protein embedded in the length avid arrangement of the predicted helices, strands and coils along the polypeptide chain. When a more extensive library of HMMs representing the universe of known structural families is available (work in progress), the program will allow rapid screening of genomic databases and sequence annotation when fold similarity is not detectable from the amino acid sequence.
引用
收藏
页码:131 / 140
页数:10
相关论文
共 50 条
  • [11] RECOGNITION OF SUPER-SECONDARY STRUCTURE IN PROTEINS
    TAYLOR, WR
    THORNTON, JM
    JOURNAL OF MOLECULAR BIOLOGY, 1984, 173 (04) : 487 - 514
  • [12] SECONDARY-STRUCTURE PREDICTIONS OF CALCIUM-BINDING PROTEINS
    ARGOS, P
    BIOCHEMISTRY, 1977, 16 (04) : 665 - 672
  • [13] Fold and function predictions for Mycoplasma genitalium proteins
    Rychlewski, L
    Zhang, BH
    Godzik, A
    FOLDING & DESIGN, 1998, 3 (04): : 229 - 238
  • [14] Accurate secondary structure prediction and fold recognition for circular dichroism spectroscopy
    Micsonai, Andras
    Wien, Frank
    Kernya, Linda
    Lee, Young-Ho
    Goto, Yuji
    Refregiers, Matthieu
    Kardos, Jozsef
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2015, 112 (24) : E3095 - E3103
  • [15] A proposed architecture for the Central domain of the bacterial enhancer-binding proteins based on secondary structure prediction and fold recognition
    Osuna, J
    Soberon, X
    Morett, E
    PROTEIN SCIENCE, 1997, 6 (03) : 543 - 555
  • [16] Predicted structure and fold recognition for the glutamyl tRNA reductase family of proteins
    Brody, SS
    Gough, SP
    Kannangara, CG
    PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1999, 37 (03): : 485 - 493
  • [17] PREDICTIONS FROM THE REGULARITIES OF THE PRIMARY STRUCTURE OF PROTEINS
    SIMON, I
    PEPTIDE RESEARCH, 1993, 6 (05): : 260 - 262
  • [18] Assessment of fold recognition predictions in CASP6
    Wang, G
    Jin, YM
    Dunbrack, RL
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 61 : 46 - 66
  • [19] TertProt: A Protein Fold Recognition Method Using Protein Secondary Structure Program
    Kaladhar, D. S. V. G. K.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS 2012 (INDIA 2012), 2012, 132 : 161 - 168
  • [20] Structure classification-based assessment of CASP3 predictions for the fold recognition targets
    Murzin, AG
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1999, : 88 - 103