Effect of simple ensemble methods on protein secondary structure prediction

被引:21
|
作者
Bouziane, Hafida [1 ]
Messabih, Belhadri [1 ]
Chouarfia, Abdallah [1 ]
机构
[1] USTO MB Univ, Dept Comp Sci, El Mnaouer, Oran, Algeria
关键词
Ensemble methods; Simple aggregation rules; Weighted opinions pooling; Protein secondary structure prediction; NEURAL-NETWORK; CLASSIFIER;
D O I
10.1007/s00500-014-1355-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensemble methods for building improved classifier models have been an important topic in machine learning, pattern recognition and data mining areas, where they have shown great promise. They boast a robustness that has spearheaded their application in many practical classification problems, especially when there is a significant diversity among the ensemble members. Actually, they replace traditional machine learning techniques in many applications and special attention has been devoted to them as a mean to improve the prediction accuracy for problems of high complexity. Several combination rules have been investigated in this context. However, it is claimed that no rule is always better than others for designing an optimal decision. The present study evaluates the performance of two different ensemble methods for protein secondary structure prediction. We focus on weighted opinions pooling and the most common aggregation rules for decisions inference. The ensemble members are accurate protein secondary structure single model predictors namely, Multi-Class Support Vector Machines and Artificial Neural Networks. Experiments are carried out using cross-validation tests on RS126 and CB513 benchmark datasets. Our results clearly confirm that ensembles are more accurate than a single model and the experimental comparison of the investigated ensemble schemes demonstrates that the newly introduced rule called Exponential Opinion Pool competes well against state-of-the-art fixed rules, especially the sum rule which in some cases is able to achieve better performance.
引用
收藏
页码:1663 / 1678
页数:16
相关论文
共 50 条
  • [1] Effect of simple ensemble methods on protein secondary structure prediction
    Hafida Bouziane
    Belhadri Messabih
    Abdallah Chouarfia
    [J]. Soft Computing, 2015, 19 : 1663 - 1678
  • [2] Combining protein secondary structure prediction models with ensemble methods of optimal complexity
    Guermeur, Y
    Pollastri, G
    Elisseeff, A
    Zelus, D
    Paugam-Moisy, H
    Baldi, P
    [J]. NEUROCOMPUTING, 2004, 56 : 305 - 327
  • [3] Efficient ensemble schemes for protein secondary structure prediction
    Liu, Kun-Hong
    Xia, Jun-Feng
    Li, Xueling
    [J]. PROTEIN AND PEPTIDE LETTERS, 2008, 15 (05): : 488 - 493
  • [4] Fusion of BLAST and Ensemble of Classifiers for Protein Secondary Structure Prediction
    de Oliveira, Gabriel Bianchin
    Pedrini, Helio
    Dias, Zanoni
    [J]. 2020 33RD SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2020), 2020, : 308 - 315
  • [5] New methods for accurate prediction of protein secondary structure
    Chandonia, JM
    Karplus, M
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1999, 35 (03) : 293 - 306
  • [6] Validated accessment of protein secondary structure prediction methods
    Pappas, G
    Subramaniam, S
    [J]. BIOPHYSICAL JOURNAL, 1998, 74 (02) : A282 - A282
  • [7] Multi-layer ensemble classifiers on protein secondary structure prediction
    Li, Wei
    Chen, Yuehui
    Zhao, Yaou
    [J]. ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, PROCEEDINGS: WITH ASPECTS OF THEORETICAL AND METHODOLOGICAL ISSUES, 2008, 5226 : 79 - +
  • [8] Protein Ensemble Learning with Atrous Spatial Pyramid Networks for Secondary Structure Prediction
    Guo, Yuzhi
    Wu, Jiaxiang
    Ma, Hehuan
    Wang, Sheng
    Huang, Junzhou
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 17 - 22
  • [9] MMEC: Multi-Modal Ensemble Classifier for Protein Secondary Structure Prediction
    de Oliveira, Gabriel Bianchin
    Pedrini, Helio
    Dias, Zanoni
    [J]. COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2021, PT 1, 2021, 13052 : 175 - 184
  • [10] A Bi-LSTM Based Ensemble Algorithm for Prediction of Protein Secondary Structure
    Hu, Hailong
    Li, Zhong
    Elofsson, Arne
    Xie, Shangxin
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (17):