Effect of simple ensemble methods on protein secondary structure prediction

被引:21
|
作者
Bouziane, Hafida [1 ]
Messabih, Belhadri [1 ]
Chouarfia, Abdallah [1 ]
机构
[1] USTO MB Univ, Dept Comp Sci, El Mnaouer, Oran, Algeria
关键词
Ensemble methods; Simple aggregation rules; Weighted opinions pooling; Protein secondary structure prediction; NEURAL-NETWORK; CLASSIFIER;
D O I
10.1007/s00500-014-1355-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensemble methods for building improved classifier models have been an important topic in machine learning, pattern recognition and data mining areas, where they have shown great promise. They boast a robustness that has spearheaded their application in many practical classification problems, especially when there is a significant diversity among the ensemble members. Actually, they replace traditional machine learning techniques in many applications and special attention has been devoted to them as a mean to improve the prediction accuracy for problems of high complexity. Several combination rules have been investigated in this context. However, it is claimed that no rule is always better than others for designing an optimal decision. The present study evaluates the performance of two different ensemble methods for protein secondary structure prediction. We focus on weighted opinions pooling and the most common aggregation rules for decisions inference. The ensemble members are accurate protein secondary structure single model predictors namely, Multi-Class Support Vector Machines and Artificial Neural Networks. Experiments are carried out using cross-validation tests on RS126 and CB513 benchmark datasets. Our results clearly confirm that ensembles are more accurate than a single model and the experimental comparison of the investigated ensemble schemes demonstrates that the newly introduced rule called Exponential Opinion Pool competes well against state-of-the-art fixed rules, especially the sum rule which in some cases is able to achieve better performance.
引用
收藏
页码:1663 / 1678
页数:16
相关论文
共 50 条
  • [21] Evaluation and improvement of multiple sequence methods for protein secondary structure prediction
    Cuff, JA
    Barton, GJ
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1999, 34 (04) : 508 - 519
  • [22] Protein secondary structure reduction methods significantly affect prediction accuracy
    Subair, Saad Osman Abdalla
    Deris, Safaai
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1-3, 2006, : 296 - +
  • [23] Protein secondary structure prediction methods based on RBF neural networks
    Jing, N.
    Xia, B.
    Zhou, C. G.
    Wang, Y.
    [J]. COMPUTATIONAL METHODS, PTS 1 AND 2, 2006, : 1037 - +
  • [24] A comparison of two machine learning methods for protein secondary structure prediction
    Wang, LH
    Liu, J
    Zhou, HB
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 2730 - 2735
  • [25] PROTEIN SECONDARY STRUCTURE PREDICTION USING NEAREST-NEIGHBOR METHODS
    YI, TM
    LANDER, ES
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1993, 232 (04) : 1117 - 1129
  • [26] Ensemble Learning for Protein Secondary Structure Analysis
    Iryanto, Syam B.
    Djatna, Taufik
    Haryanto, Toto
    [J]. 2017 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2017, : 409 - 414
  • [27] Prediction of Protein Structure Classes with Ensemble Classifiers
    Bao, Wenzheng
    Chen, Yuehui
    Wang, Dong
    Kong, Fanliang
    Yu, Gaoqiang
    [J]. INTELLIGENT COMPUTING IN BIOINFORMATICS, 2014, 8590 : 330 - 338
  • [28] Ensemble of Template-Free and Template-Based Classifiers for Protein Secondary Structure Prediction
    de Oliveira, Gabriel Bianchin
    Pedrini, Helio
    Dias, Zanoni
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2021, 22 (21)
  • [29] MEMBRANE-PROTEIN SECONDARY STRUCTURE PREDICTION - AN EVALUATION OF EMPIRICAL-METHODS
    CASCIO, M
    MIELKE, DL
    WALLACE, BA
    [J]. BIOPHYSICAL JOURNAL, 1986, 49 (02) : A293 - A293
  • [30] Training set reduction methods for single sequence protein secondary structure prediction
    Pakatci, Isa Kemal
    Aydin, Zafer
    Erdogan, Hakan
    Altunbasak, Yuecel
    [J]. 2007 IEEE 15TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS, VOLS 1-3, 2007, : 1038 - +