Effect of simple ensemble methods on protein secondary structure prediction

被引:21
|
作者
Bouziane, Hafida [1 ]
Messabih, Belhadri [1 ]
Chouarfia, Abdallah [1 ]
机构
[1] USTO MB Univ, Dept Comp Sci, El Mnaouer, Oran, Algeria
关键词
Ensemble methods; Simple aggregation rules; Weighted opinions pooling; Protein secondary structure prediction; NEURAL-NETWORK; CLASSIFIER;
D O I
10.1007/s00500-014-1355-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensemble methods for building improved classifier models have been an important topic in machine learning, pattern recognition and data mining areas, where they have shown great promise. They boast a robustness that has spearheaded their application in many practical classification problems, especially when there is a significant diversity among the ensemble members. Actually, they replace traditional machine learning techniques in many applications and special attention has been devoted to them as a mean to improve the prediction accuracy for problems of high complexity. Several combination rules have been investigated in this context. However, it is claimed that no rule is always better than others for designing an optimal decision. The present study evaluates the performance of two different ensemble methods for protein secondary structure prediction. We focus on weighted opinions pooling and the most common aggregation rules for decisions inference. The ensemble members are accurate protein secondary structure single model predictors namely, Multi-Class Support Vector Machines and Artificial Neural Networks. Experiments are carried out using cross-validation tests on RS126 and CB513 benchmark datasets. Our results clearly confirm that ensembles are more accurate than a single model and the experimental comparison of the investigated ensemble schemes demonstrates that the newly introduced rule called Exponential Opinion Pool competes well against state-of-the-art fixed rules, especially the sum rule which in some cases is able to achieve better performance.
引用
收藏
页码:1663 / 1678
页数:16
相关论文
共 50 条
  • [41] Extraction of Prediction Rules: Protein Secondary Structure Prediction
    Muhamud, Ahmed I.
    Abdelhalim, M. B.
    Mabrouk, Mai S.
    [J]. 2014 10TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO), 2014, : 21 - 25
  • [42] Ensemble deep learning model for protein secondary structure prediction using NLP metrics and explainable AI
    Vignesh, U.
    Parvathi, R.
    Gokul Ram, K.
    [J]. Results in Engineering, 2024, 24
  • [43] Practical approaches and ensemble computing in protein structure prediction
    Feig, M
    [J]. PROTEIN SCIENCE, 2004, 13 : 55 - 55
  • [44] METHODS FOR ENSEMBLE PREDICTION
    HOUTEKAMER, PL
    DEROME, J
    [J]. MONTHLY WEATHER REVIEW, 1995, 123 (07) : 2181 - 2196
  • [45] ASSESSMENT OF PROTEIN SECONDARY STRUCTURE PREDICTION METHODS BASED ON AMINO-ACID SEQUENCE
    ARGOS, P
    SCHWARZ, J
    SCHWARZ, J
    [J]. BIOCHIMICA ET BIOPHYSICA ACTA, 1976, 439 (02) : 261 - 273
  • [46] The PFP and ESG protein function prediction methods in 2014: effect of database updates and ensemble approaches
    Khan, Ishita K.
    Wei, Qing
    Chapman, Samuel
    Kc, Dukka B.
    Kihara, Daisuke
    [J]. GIGASCIENCE, 2015, 4
  • [47] Fast methods for protein structure prediction
    Martyna, G
    Minary, P
    Tuckerman, ME
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2004, 227 : U342 - U342
  • [48] Computational methods in protein structure prediction
    Floudas, C. A.
    [J]. BIOTECHNOLOGY AND BIOENGINEERING, 2007, 97 (02) : 207 - 213
  • [49] Impact of protein dynamics on secondary structure prediction
    de Brevern, Alexandre G.
    [J]. BIOCHIMIE, 2020, 179 : 14 - 22
  • [50] Prediction of protein secondary structure at 80% accuracy
    Petersen, TN
    Lundegaard, C
    Nielsen, M
    Bohr, H
    Bohr, J
    Brunak, S
    Gippert, GP
    Lund, O
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2000, 41 (01) : 17 - 20