Predicting protein secondary structure using a mixed-modal SVM method in a compound pyramid model

被引:32
|
作者
Yang, Bingru [1 ]
Wu, Qu [1 ]
Ying, Zhou [1 ]
Sui, Haifeng [1 ]
机构
[1] Univ Sci & Technol Beijing, Sch Informat Engn, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Protein secondary structure prediction; Physicochemical properties; Mixed-modal SVM; Compound pyramid model; FOLD-RECOGNITION; SERVER; ACCURACY; MATRICES;
D O I
10.1016/j.knosys.2010.10.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate protein secondary structure prediction plays an important role in direct tertiary structure modeling, and can also significantly improve sequence analysis and sequence-structure threading for structure and function determination. Hence improving the accuracy of secondary structure prediction is essential for future developments throughout the field of protein research. In this article, we propose a mixed-modal support vector machine (SVM) method for predicting protein secondary structure. Using the evolutionary information contained in the physicochemical properties of each amino acid and a position-specific scoring matrix generated by a PSI-BLAST multiple sequence alignment as input for a mixed-modal SVM, secondary structure can be predicted at significantly increased accuracy. Using a Knowledge Discovery Theory based on the Inner Cognitive Mechanism (KDTICM) method, we have proposed a compound pyramid model, which is composed of three layers of intelligent interface that integrate a mixed-modal SVM (MMS) module, a modified Knowledge Discovery in Databases (KDD*) process, a mixed-modal back propagation neural network (MMBP) module and so on. Testing against data sets of non-redundant protein sequences returned values for the Q(3) accuracy measure that ranged from 84.0% to 85.6%,while values for the SOV99 segment overlap measure ranged from 79.8% to 80.6%. When compared using a blind test dataset from the CASP8 meeting against currently available secondary structure prediction methods, our new approach shows superior accuracy. Availability: http://www.kdd.ustb.edu.cn/protein_Web/. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:304 / 313
页数:10
相关论文
共 50 条
  • [21] Protein secondary structure prediction by using deep learning method
    Wang, Yangxu
    Mao, Hua
    Yi, Zhang
    KNOWLEDGE-BASED SYSTEMS, 2017, 118 : 115 - 123
  • [22] PREDICTING PROTEIN SECONDARY STRUCTURE USING NEURAL NET AND STATISTICAL-METHODS
    STOLORZ, P
    LAPEDES, A
    XIA, Y
    JOURNAL OF MOLECULAR BIOLOGY, 1992, 225 (02) : 363 - 377
  • [23] Prediction of Protein Secondary Structure using SVM-PSSM Classifier Combined by Sequence Features
    Chen, Yehong
    Liu, Yihui
    Cheng, Jinyong
    Wang, Yanchun
    PROCEEDINGS OF 2016 IEEE ADVANCED INFORMATION MANAGEMENT, COMMUNICATES, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IMCEC 2016), 2016, : 103 - 106
  • [24] A Novel Approach of Protein Secondary Structure Prediction by SVM Using PSSM Combined by Sequence Features
    Chen, Yehong
    Cheng, Jinyong
    Liu, Yihui
    Park, Pil Seong
    PROCEEDINGS OF SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS) 2016, VOL 1, 2018, 15 : 1074 - 1084
  • [25] Predicting the content of camelina protein using FT-IR spectroscopy coupled with SVM model
    Liu, Jun
    Wu, Mengting
    Wang, Mingqing
    Zou, Yuntao
    Tan, Zhenglin
    Wang, Donghai
    Sun, Xiuzhi Susan
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 4): : S8401 - S8406
  • [26] Predicting the content of camelina protein using FT-IR spectroscopy coupled with SVM model
    Jun Liu
    Mengting Wu
    Mingqing Wang
    Yuntao Zou
    Zhenglin Tan
    Donghai Wang
    Xiuzhi Susan Sun
    Cluster Computing, 2019, 22 : 8401 - 8406
  • [27] Predicting protein secondary structure and solvent accessibility with an improved multiple linear regression method
    Qin, SB
    He, Y
    Pan, XM
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 61 (03) : 473 - 480
  • [28] HYBP_PSSP: a hybrid back propagation method for predicting protein secondary structure
    Qu, Wu
    Yang, Bingru
    Jiang, Wei
    Wang, Lijun
    NEURAL COMPUTING & APPLICATIONS, 2012, 21 (02): : 337 - 349
  • [29] DANGLE: A Bayesian inferential method for predicting protein backbone dihedral angles and secondary structure
    Cheung, Ming-Sin
    Maguire, Mahon L.
    Stevens, Tim J.
    Broadhurst, R. William
    JOURNAL OF MAGNETIC RESONANCE, 2010, 202 (02) : 223 - 233
  • [30] HYBP_PSSP: a hybrid back propagation method for predicting protein secondary structure
    Wu Qu
    Bingru Yang
    Wei Jiang
    Lijun Wang
    Neural Computing and Applications, 2012, 21 : 337 - 349