Protein secondary structure prediction using DWKF based on SVR-NSGAII

被引:21
|
作者
Zangooei, Mohammad Hossein [1 ]
Jalili, Saeed [1 ]
机构
[1] Tarbiat Modares Univ, Elect & Comp Engn Fac, Dept Comp Engn, SCS Lab,Sch Elect & Comp Engn, Tehran, Iran
关键词
Protein secondary structure prediction; Machine learning approach; Support vector regression; Multi-Objective Genetic Algorithm; SUPPORT VECTOR MACHINES; NEURAL-NETWORKS; ACCURACY; SUBSTITUTION; ALGORITHMS; ALIGNMENT; HELICES;
D O I
10.1016/j.neucom.2012.04.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prediction of protein secondary structure is an important step towards elucidating its three dimensional structure and its function. This is a challenging problem in bioinformatics. By introduction of machine learning for protein structure prediction, a solution has brought to this challenge to some extent. In the literature of Machine learning or data mining, regression and classification problems are typically viewed as two distinct problems differentiated by continuous or categorical dependent variable. There are endeavors to use regression methods to solve the classification problem and vice versa. To regard a classification problem as a regression one, we proposed a method which is based on Support Vector Regression (SVR) classification model as one of the powerful methods in the field of machine intelligence. We applied non-dominated Sorting Genetic Algorithm II (NSGAII) to find mapping points (MPs) for rounding a real-value to an integer one. Also NSGAII is used for finding out and tuning SVR kernel parameters optimally to enhance the performance of our model and achieve better results. At the other hand, using a suitable SVR kernel function for a particular problem can improve the prediction results remarkably but there is not a kernel which can predict all protein secondary structure classes with acceptable accuracy. Therefore we use a Dynamic Weighted Kernel Fusion (DWKF) method for fusing of three SVR kernels to achieve a supreme performance. Also to improve our method, Position Scoring Matrix (PSSM) profiles are used as the input information to it. The goals of this research are to regulate SVR parameters and fuse different SVR kernel outputs in order to determine protein secondary structure classes accurately. The obtained classification accuracies of our method are 85.79% and 84.94% on RS126 and CB513 datasets respectively and they are promising with regard to other classification methods in the literature. Moreover, for gauging our method behavior in comparison to other state of arts methods, an independent dataset is used and achieves 81.4% accuracy. Our method cannot achieve the best value for any considered performance metrics on an independent dataset but its values for whole metrics are quite acceptable. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:87 / 101
页数:15
相关论文
共 50 条
  • [21] PREDICTION OF PROTEIN SECONDARY STRUCTURE
    CHOU, PY
    FASMAN, GD
    BIOPHYSICAL JOURNAL, 1977, 17 (02) : A53 - A53
  • [22] PROTEIN SECONDARY STRUCTURE PREDICTION
    BARTON, GJ
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 1995, 5 (03) : 372 - 376
  • [23] Label Sequence Learning Based Protein Secondary Structure Prediction Using Hydrophobicity Scales
    Vinodhini, R.
    Vijaya, M. S.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2011), VOL 2, 2012, 131 : 611 - +
  • [24] Protein Secondary Structure Prediction Using Knowledge-Based Potentials and An Ensemble of Classifiers
    Sundararajan, Saraswathi
    Gniewek, Pawel
    Jernigan, Robert L.
    Kolinski, Andrzej
    Kloczkowski, Andrzej
    BIOPHYSICAL JOURNAL, 2010, 98 (03) : 52A - 52A
  • [25] Protein secondary structure prediction by using deep learning method
    Wang, Yangxu
    Mao, Hua
    Yi, Zhang
    KNOWLEDGE-BASED SYSTEMS, 2017, 118 : 115 - 123
  • [26] Protein Secondary Structure Prediction using Large Margin Methods
    Tang, Buzhou
    Wang, Xuan
    Wang, Xiaolong
    PROCEEDINGS OF THE 8TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE, 2009, : 142 - 146
  • [27] Prediction of protein secondary structure based on continuous wavelet transform
    Qiu, JD
    Liang, RP
    Zou, XY
    Mo, JY
    TALANTA, 2003, 61 (03) : 285 - 293
  • [28] A SEGMENT-BASED APPROACH TO PROTEIN SECONDARY STRUCTURE PREDICTION
    PRESNELL, SR
    COHEN, BI
    COHEN, FE
    BIOCHEMISTRY, 1992, 31 (04) : 983 - 993
  • [29] PREDICTION OF PROTEIN SECONDARY STRUCTURE BASED ON PHYSICAL THEORY - HISTONES
    PTITSYN, OB
    FINKELSTEIN, AV
    PROTEIN ENGINEERING, 1989, 2 (06): : 443 - 447
  • [30] The Research of Protein Secondary Structure Prediction System Based on KDTICM
    Yang, Bingru
    Hou, Wei
    Xie, Yonghong
    Wang, Lijun
    WCECS 2009: WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, VOLS I AND II, 2009, : 47 - 51