FORESST: fold recognition from secondary structure predictions of proteins

被引:35
|
作者
Di Francesco, V
Munson, PJ
Garnier, J
机构
[1] NIH, Analyt Biostat Sect, Math & Stat Comp Lab, Ctr Informat Technol, Bethesda, MD 20892 USA
[2] Inst Genom Res, Blolgiza, Rockville, MD 20850 USA
[3] INRA, Biol Cellulaire & Mol Lab, F-78352 Jouy En Josas, France
关键词
D O I
10.1093/bioinformatics/15.2.131
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: A method for recognizing the three-dimensional fold from the protein amino acid sequence based on a combination of hidden Markov models (HMMs) and secondary structure prediction was recently developed for proteins in the Mainly-Alpha structural class. Here, this methodology is extended to Mainly-Beta and Alpha-Beta class proteins. Compared to other fold recognition methods based on HMMs, this approach is novel in that only secondary structure information is used. Each HMM is trained from known secondary structure sequences of proteins having a similar fold. Secondary structure prediction is performed for the amino acid sequence of a query protein. The predicted fold of a query protein is the fold described by the model fitting the predicted sequence the best. Results: After model cross-validation, the success rare on 44 test proteins covering the three structural classes was found to be 59%. On seven fold predictions performed prior to the publication of experimental structure, the success rate was 71%. In conclusion, this approach manages to capture important information about the fold of a protein embedded in the length avid arrangement of the predicted helices, strands and coils along the polypeptide chain. When a more extensive library of HMMs representing the universe of known structural families is available (work in progress), the program will allow rapid screening of genomic databases and sequence annotation when fold similarity is not detectable from the amino acid sequence.
引用
收藏
页码:131 / 140
页数:10
相关论文
共 50 条
  • [41] Predictions of protein fold switching from sequence
    Porter, Lauren
    Schafer, Joseph
    Chakravarty, Devlina
    PROTEIN SCIENCE, 2023, 32 (12)
  • [42] Nucleic acid recognition by OB-fold proteins
    Theobald, DL
    Mitton-Fry, RM
    Wuttke, DS
    ANNUAL REVIEW OF BIOPHYSICS AND BIOMOLECULAR STRUCTURE, 2003, 32 : 115 - 133
  • [43] CRYSTALLIZATION OF ALCOHOL OXIDASE FROM PICHIA-PASTORIS - SECONDARY STRUCTURE PREDICTIONS INDICATE A DOMAIN WITH THE EIGHTFOLD BETA-ALPHA-BARREL FOLD
    TYKARSKA, E
    LEBIODA, L
    MARCHUT, E
    STECZKO, J
    STEC, B
    JOURNAL OF PROTEIN CHEMISTRY, 1990, 9 (01): : 83 - 86
  • [44] SECONDARY STRUCTURE PREDICTIONS FOR RAT OSTEOPONTIN
    PRINCE, CW
    CONNECTIVE TISSUE RESEARCH, 1989, 21 (1-4) : 345 - 350
  • [45] IMPROVING THE ACCURACY OF SECONDARY STRUCTURE PREDICTIONS
    BUSETTA, B
    HOSPITAL, M
    BIOCHIMIE, 1981, 63 (11-1) : 951 - 954
  • [46] PATTERN-RECOGNITION OF THE SECONDARY STRUCTURE OF PROTEINS (ALPHA-HELIX AND BETA-STRUCTURE)
    TOHA, JC
    SOTO, MA
    CHINGA, H
    JOURNAL OF THEORETICAL BIOLOGY, 1990, 146 (02) : 145 - 156
  • [47] Structural protein fold recognition based on secondary structure and evolutionary information using machine learning algorithms
    Qin, Xinyi
    Liu, Min
    Zhang, Lu
    Liu, Guangzhong
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2021, 91
  • [48] Protein topology recognition from secondary structure sequences: Application of the hidden Markov models to the alpha class proteins
    DiFrancesco, V
    Garnier, J
    Munson, PJ
    JOURNAL OF MOLECULAR BIOLOGY, 1997, 267 (02) : 446 - 463
  • [49] Identifying good predictions of RNA secondary structure
    Nebel, ME
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2004, 2003, : 423 - 434
  • [50] PROTEIN SECONDARY STRUCTURE PREDICTIONS OF MHC PRODUCTS
    CHURCH, WR
    REISFELD, RA
    FEDERATION PROCEEDINGS, 1981, 40 (03) : 1112 - 1112