HYPROSP: a hybrid protein secondary structure prediction algorithm - a knowledge-based approach

被引:20
|
作者
Wu, KP [1 ]
Lin, HN [1 ]
Chang, JM [1 ]
Sung, TY [1 ]
Hsu, WL [1 ]
机构
[1] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
关键词
D O I
10.1093/nar/gkh836
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We develop a knowledge-based approach (called PROSP) for protein secondary structure prediction. The knowledge base contains small peptide fragments together with their secondary structural information. A quantitative measure M, called match rate, is defined to measure the amount of structural information that a target protein can extract from the knowledge base. Our experimental results show that proteins with a higher match rate will likely be predicted more accurately based on PROSP. That is, there is roughly a monotone correlation between the prediction accuracy and the amount of structure matching with the knowledge base. To fully utilize the strength of our knowledge base, a hybrid prediction method is proposed as follows: if the match rate of a target protein is at least 80%, we use the extracted information to make the prediction; otherwise, we adopt a popular machine-learning approach. This comprises our hybrid protein structure prediction (HYPROSP) approach. We use the DSSP and EVA data as our datasets and PSIPRED as our underlying machine-learning algorithm. For target proteins with match rate at least 80%, the average Q(3) of PROSP is 3.96 and 7.2 better than that of PSIPRED on DSSP and EVA data, respectively.
引用
收藏
页码:5059 / 5065
页数:7
相关论文
共 50 条
  • [1] HYPROSP II - A knowledge-based hybrid method for protein secondary structure prediction based on local prediction confidence
    Lin, HN
    Chang, JM
    Wu, KP
    Sung, TY
    Hsu, WL
    [J]. BIOINFORMATICS, 2005, 21 (15) : 3227 - 3233
  • [2] A knowledge-based approach to protein local structure prediction
    Chen, CT
    Lin, HN
    Wu, KP
    Sung, TY
    Hsu, WL
    [J]. PROCEEDINGS OF THE 4TH ASIA-PACIFIC BIOINFORMATICS CONFERENCE, 2006, 3 : 257 - 266
  • [3] PROTEIN SECONDARY STRUCTURE PREDICTION USING KNOWLEDGE-BASED POTENTIALS
    Saraswathi, Saras
    Jernigan, Robert L.
    Kloczkowski, Andrzej
    Kolinski, Andrzej
    [J]. ICFC 2010/ ICNC 2010: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON FUZZY COMPUTATION AND INTERNATIONAL CONFERENCE ON NEURAL COMPUTATION, 2010, : 370 - 375
  • [4] Knowledge-based protein secondary structure assignment
    Frishman, D
    Argos, P
    [J]. PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1995, 23 (04): : 566 - 579
  • [5] Protein Secondary Structure Prediction Using Knowledge-Based Potentials and An Ensemble of Classifiers
    Sundararajan, Saraswathi
    Gniewek, Pawel
    Jernigan, Robert L.
    Kolinski, Andrzej
    Kloczkowski, Andrzej
    [J]. BIOPHYSICAL JOURNAL, 2010, 98 (03) : 52A - 52A
  • [6] Knowledge-based prediction of protein tertiary structure
    L'Heureux, PJ
    Cromp, B
    Martineau, É
    Gunn, JR
    [J]. COMPUTATIONAL METHODS FOR PROTEIN FOLDING, 2002, 120 : 193 - 222
  • [7] Knowledge-based protein structure prediction.
    Zhao, SR
    Yun, T
    Chen, KX
    [J]. PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS, 1996, 23 (05) : 422 - 426
  • [8] AN ALGORITHM FOR PROTEIN SECONDARY STRUCTURE PREDICTION BASED ON CLASS PREDICTION
    DELEAGE, G
    ROUX, B
    [J]. PROTEIN ENGINEERING, 1987, 1 (04): : 289 - 294
  • [9] KNOWLEDGE-BASED PREDICTION OF PROTEIN STRUCTURES
    KADEN, F
    KOCH, I
    SELBIG, J
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 1990, 147 (01) : 85 - 100
  • [10] A knowledge-based artificial bee colony algorithm for the 3-D protein structure prediction problem
    Correa, Leonardo de Lima
    Dorn, Marcio
    [J]. 2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, : 949 - 956