A Novel Approach of Protein Secondary Structure Prediction by SVM Using PSSM Combined by Sequence Features

被引:0
|
作者
Chen, Yehong [1 ]
Cheng, Jinyong [2 ]
Liu, Yihui [2 ]
Park, Pil Seong [3 ]
机构
[1] Qilu Univ Technol, Sch Graph Commun & Packaging, Jinan, Shandong, Peoples R China
[2] Qilu Univ Technol, Sch Informat, Jinan, Shandong, Peoples R China
[3] Univ Suwon, Dept Comp Sci, Suwon, South Korea
基金
中国国家自然科学基金;
关键词
Protein secondary structure prediction; SVM; Position specific scoring matrices; Sequence feature; Amino acid scale; ProtScale;
D O I
10.1007/978-3-319-56994-9_74
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge of protein secondary structure is a useful step toward prediction of the 3D structure of a particular protein. In this paper, a support vector machine (SVM) based method used for the prediction of secondary structure is introduced in details. Protein sequence data is in a hybrid representation combining the Position-specific Scoring Matrix (PSSM), the Hydrophobicity Sequence Feature (HSF), and the Structural Sequence Feature (SSF). Protein sequences are obtained from CB513 dataset, corresponding PSSM profiles are obtained from PSI-BLAST Program and sequence features are computed based on amino acid scales offered by Expasy website (http://web.expasy.org/protscale/). Basically, PSSM profiles are used as input data to the SVM-PSSM classifier of the secondary structure prediction. Furthermore, to construct more accurate classifiers, more than 40 SFs (sequence features) are examined as accessional input vector to SVM-PSSM classifier for feature selection. The most accurate classifier in this study is constructed using a combination of PSSM and few relevant sequence features. The experimental results show that relevant sequence features extracted from Hydrophobicity index and Structural conformational parameters can improve the SVM-PSSM classifier for the prediction of protein secondary structure elements. Our proposed final SVM-PSSM-SF method achieved an overall accuracy of 78%.
引用
收藏
页码:1074 / 1084
页数:11
相关论文
共 50 条
  • [31] Using evolutionary trees in protein secondary structure prediction and other comparative sequence analyses
    Goldman, N
    Thorne, JL
    Jones, DT
    JOURNAL OF MOLECULAR BIOLOGY, 1996, 263 (02) : 196 - 208
  • [32] Estimating the significance of sequence order in protein secondary structure and prediction
    Park, J
    Dietmann, S
    Heger, A
    Holm, L
    BIOINFORMATICS, 2000, 16 (11) : 978 - 987
  • [33] Integrating protein secondary structure prediction and multiple sequence alignment
    Simossis, VA
    Heringa, J
    CURRENT PROTEIN & PEPTIDE SCIENCE, 2004, 5 (04) : 249 - 266
  • [34] Prediction of protein secondary structure from amino acid sequence
    Yang, JT
    JOURNAL OF PROTEIN CHEMISTRY, 1996, 15 (02): : 185 - 191
  • [35] Bayesian Model of Protein Primary Sequence for Secondary Structure Prediction
    Li, Qiwei
    Dahl, David B.
    Vannucci, Marina
    Joo, Hyun
    Tsai, Jerry W.
    PLOS ONE, 2014, 9 (10):
  • [36] A study on the effect of using physico-chemical features in protein secondary structure prediction
    Rama, C. L. Jayavardhana
    Palaniswami, M.
    Lai, Daniel
    Parker, Michael W.
    APPLIED ARTIFICIAL INTELLIGENCE, 2006, : 609 - +
  • [37] Prediction of protease substrates using sequence and structure features
    Barkan, David T.
    Hostetter, Daniel R.
    Mahrus, Sami
    Pieper, Ursula
    Wells, James A.
    Craik, Charles S.
    Sali, Andrej
    BIOINFORMATICS, 2010, 26 (14) : 1714 - 1722
  • [38] Granular Decision Tree and Evolutionary Neural SVM for Protein Secondary Structure Prediction
    Reyaz-Ahmed, Anjum
    Zhang, Yan-Qing
    Harrison, Robert W.
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2009, 2 (04) : 343 - 352
  • [39] Understanding protein structure prediction using SVM_DT
    He, JY
    Hu, HJ
    Harrison, R
    Tai, PC
    Dong, YS
    Pan, Y
    PARALLEL AND DISTRIBUTED PROCESSING AND APPLICATIONS - ISPA 2005 WORKSHOPS, 2005, 3759 : 203 - 212
  • [40] BNN: A novel model for prediction of protein secondary structure
    College of Life Sciences, China Jiliang University, Hangzhou 310018, China
    不详
    Jiliang Xuebao, 2006, 3 (281-285):