FORESST: fold recognition from secondary structure predictions of proteins

被引:35
|
作者
Di Francesco, V
Munson, PJ
Garnier, J
机构
[1] NIH, Analyt Biostat Sect, Math & Stat Comp Lab, Ctr Informat Technol, Bethesda, MD 20892 USA
[2] Inst Genom Res, Blolgiza, Rockville, MD 20850 USA
[3] INRA, Biol Cellulaire & Mol Lab, F-78352 Jouy En Josas, France
关键词
D O I
10.1093/bioinformatics/15.2.131
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: A method for recognizing the three-dimensional fold from the protein amino acid sequence based on a combination of hidden Markov models (HMMs) and secondary structure prediction was recently developed for proteins in the Mainly-Alpha structural class. Here, this methodology is extended to Mainly-Beta and Alpha-Beta class proteins. Compared to other fold recognition methods based on HMMs, this approach is novel in that only secondary structure information is used. Each HMM is trained from known secondary structure sequences of proteins having a similar fold. Secondary structure prediction is performed for the amino acid sequence of a query protein. The predicted fold of a query protein is the fold described by the model fitting the predicted sequence the best. Results: After model cross-validation, the success rare on 44 test proteins covering the three structural classes was found to be 59%. On seven fold predictions performed prior to the publication of experimental structure, the success rate was 71%. In conclusion, this approach manages to capture important information about the fold of a protein embedded in the length avid arrangement of the predicted helices, strands and coils along the polypeptide chain. When a more extensive library of HMMs representing the universe of known structural families is available (work in progress), the program will allow rapid screening of genomic databases and sequence annotation when fold similarity is not detectable from the amino acid sequence.
引用
收藏
页码:131 / 140
页数:10
相关论文
共 50 条
  • [21] BeStSel: a web server for accurate protein secondary structure prediction and fold recognition from the circular dichroism spectra
    Micsonai, Andras
    Wien, Frank
    Bulyaki, Eva
    Kun, Judit
    Moussong, Eva
    Lee, Young-Ho
    Goto, Yuji
    Refregiers, Matthieu
    Kardos, Jozsef
    NUCLEIC ACIDS RESEARCH, 2018, 46 (W1) : W315 - W322
  • [22] Secondary structure of proteins and three-dimensional pattern recognition
    Figureau, A
    Soto, MA
    Tohá, J
    JOURNAL OF THEORETICAL BIOLOGY, 1999, 201 (02) : 103 - 111
  • [23] Prediction of secondary structure of proteins on the basis of Bayesian recognition procedures
    Beletskiy, Boris A.
    Vasilyev, Sergey V.
    Gupal, Anatoliy M.
    Journal of Automation and Information Sciences, 2007, 39 (02) : 1 - 9
  • [24] REGULARITY OF THE SECONDARY STRUCTURE OF PROTEINS STUDIED BY PATTERN RECOGNITION METHOD
    陈念贻
    缪强
    ScienceBulletin, 1986, (22) : 1579 - 1580
  • [25] Resonant recognition model defines the secondary structure of bioactive proteins
    Stambuk, N
    Konjevoda, P
    Pokric, B
    Barisic, I
    Martinic, R
    Mrljak, V
    Ramadan, P
    CROATICA CHEMICA ACTA, 2002, 75 (04) : 899 - 908
  • [26] Phylogenetic analysis of membrane trafficking proteins: A family reunion and secondary structure predictions
    Terrian, DM
    White, MK
    EUROPEAN JOURNAL OF CELL BIOLOGY, 1997, 73 (03) : 198 - 204
  • [27] Protein fold recognition using sequence-derived predictions
    Fischer, D
    Eisenberg, D
    PROTEIN SCIENCE, 1996, 5 (05) : 947 - 955
  • [28] Protein fold recognition using residue-based alignments of sequence and secondary structure
    Aydin, Zafer
    Erdogan, Hakan
    Altunbasak, Yucel
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS, 2007, : 349 - +
  • [29] CASP5 assessment of fold recognition target predictions
    Kinch, LN
    Wrabl, JO
    Krishna, SS
    Majumdar, I
    Sadreyev, RI
    Qi, Y
    Pei, JM
    Cheng, H
    Grishin, NV
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 53 : 395 - 409
  • [30] MANIFOLD: protein fold recognition based on secondary structure, sequence similarity and enzyme classification
    Bindewald, E
    Cestaro, A
    Hesser, J
    Heiler, M
    Tosatto, SCE
    PROTEIN ENGINEERING, 2003, 16 (11): : 785 - 789