Protein secondary structure prediction using DWKF based on SVR-NSGAII

被引:21
|
作者
Zangooei, Mohammad Hossein [1 ]
Jalili, Saeed [1 ]
机构
[1] Tarbiat Modares Univ, Elect & Comp Engn Fac, Dept Comp Engn, SCS Lab,Sch Elect & Comp Engn, Tehran, Iran
关键词
Protein secondary structure prediction; Machine learning approach; Support vector regression; Multi-Objective Genetic Algorithm; SUPPORT VECTOR MACHINES; NEURAL-NETWORKS; ACCURACY; SUBSTITUTION; ALGORITHMS; ALIGNMENT; HELICES;
D O I
10.1016/j.neucom.2012.04.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prediction of protein secondary structure is an important step towards elucidating its three dimensional structure and its function. This is a challenging problem in bioinformatics. By introduction of machine learning for protein structure prediction, a solution has brought to this challenge to some extent. In the literature of Machine learning or data mining, regression and classification problems are typically viewed as two distinct problems differentiated by continuous or categorical dependent variable. There are endeavors to use regression methods to solve the classification problem and vice versa. To regard a classification problem as a regression one, we proposed a method which is based on Support Vector Regression (SVR) classification model as one of the powerful methods in the field of machine intelligence. We applied non-dominated Sorting Genetic Algorithm II (NSGAII) to find mapping points (MPs) for rounding a real-value to an integer one. Also NSGAII is used for finding out and tuning SVR kernel parameters optimally to enhance the performance of our model and achieve better results. At the other hand, using a suitable SVR kernel function for a particular problem can improve the prediction results remarkably but there is not a kernel which can predict all protein secondary structure classes with acceptable accuracy. Therefore we use a Dynamic Weighted Kernel Fusion (DWKF) method for fusing of three SVR kernels to achieve a supreme performance. Also to improve our method, Position Scoring Matrix (PSSM) profiles are used as the input information to it. The goals of this research are to regulate SVR parameters and fuse different SVR kernel outputs in order to determine protein secondary structure classes accurately. The obtained classification accuracies of our method are 85.79% and 84.94% on RS126 and CB513 datasets respectively and they are promising with regard to other classification methods in the literature. Moreover, for gauging our method behavior in comparison to other state of arts methods, an independent dataset is used and achieves 81.4% accuracy. Our method cannot achieve the best value for any considered performance metrics on an independent dataset but its values for whole metrics are quite acceptable. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:87 / 101
页数:15
相关论文
共 50 条
  • [41] Extraction of Prediction Rules: Protein Secondary Structure Prediction
    Muhamud, Ahmed I.
    Abdelhalim, M. B.
    Mabrouk, Mai S.
    2014 10TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO), 2014, : 21 - 25
  • [42] Protein Secondary Structure Prediction Using Cascaded Feature Learning Model
    Geethu, S.
    Vimina, E. R.
    APPLIED SOFT COMPUTING, 2023, 140
  • [43] Improvement of protein secondary structure prediction using binary word encoding
    Kawabata, T
    Doi, J
    PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1997, 27 (01): : 36 - 46
  • [44] PROTEIN SECONDARY STRUCTURE PREDICTION USING A STATISTICAL-MECHANICAL METHOD
    KOBAYASHI, Y
    SAITO, N
    PROTEIN ENGINEERING, 1994, 7 (09): : 1164 - 1164
  • [45] Improving the accuracy of protein secondary structure prediction using structural alignment
    Scott Montgomerie
    Shan Sundararaj
    Warren J Gallin
    David S Wishart
    BMC Bioinformatics, 7
  • [46] Protein Secondary Structure Prediction Using Rule Induction from Coverings
    Lee, Leong
    Leopold, Jennifer L.
    Frank, Ronald L.
    Maglia, Anne M.
    CIBCB: 2009 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2009, : 79 - +
  • [47] Protein secondary structure prediction (PSSP) using different machine algorithms
    Heba M. Afify
    Mohamed B. Abdelhalim
    Mai S. Mabrouk
    Ahmed Y. Sayed
    Egyptian Journal of Medical Human Genetics, 22
  • [48] Protein Secondary Structure Prediction Using AutoEncoder Network and Bayes Classifier
    Wang, Leilei
    Cheng, Jinyong
    2017 INTERNATIONAL SYMPOSIUM ON APPLICATION OF MATERIALS SCIENCE AND ENERGY MATERIALS (SAMSE 2017), 2018, 322
  • [49] Prediction of Protein Secondary Structure Using Feature Selection and Analysis Approach
    Feng, Yonge
    Lin, Hao
    Luo, Liaofu
    ACTA BIOTHEORETICA, 2014, 62 (01) : 1 - 14
  • [50] Protein secondary structure prediction (PSSP) using different machine algorithms
    Afify, Heba M.
    Abdelhalim, Mohamed B.
    Mabrouk, Mai S.
    Sayed, Ahmed Y.
    EGYPTIAN JOURNAL OF MEDICAL HUMAN GENETICS, 2021, 22 (01)