Stacked Ensemble for Bioactive Molecule Prediction

被引:11
|
作者
Petinrin, Olutomilayo Olayemi [1 ,2 ]
Saeed, Faisal [1 ,3 ]
机构
[1] Univ Teknol Malaysia Johor Bahru, Informat Syst Dept, Fac Comp, Skudai 81310, Malaysia
[2] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[3] Taibah Univ, Coll Comp Sci & Engn, Medina 42353, Saudi Arabia
关键词
Bioactive molecule prediction; chemoinformatics; drug discovery; ensemble; stacked ensemble; VOTING-BASED CLASSIFICATION; RANDOM FOREST; OPTIMIZATION; CLASSIFIERS; EXTRACTION;
D O I
10.1109/ACCESS.2019.2945422
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bioactive molecular compounds are essential for drug discovery. The biological activity of these compounds needs to be predicted as this is used to determine the drug-target ability. As ineffective drugs are discarded after production, leading to resource and time wastage, it is important to predict bioactive molecules with models having high predictive performance. This study utilizes the stacked ensemble which uses the prediction of multiple base classifiers as features, used to train a meta classifier which makes the final prediction. Using three datasets DS1, DS2, and DS3 gotten from MDL Drug Data Report (MDDR) database, the performance of stacked ensemble was compared to three other ensembles: adaboost, bagging, and vote ensemble, based on different evaluation criteria and also a statistical method, Kendall's W test. The accuracy of Stacked ensemble ranged from 96.7002%, 98.2260% and 94.9007% for the three datasets respectively, although Vote had the best accuracy using dataset DS2 which consist of structurally homogeneous bioactive molecules. Also, using Kendall's W test to rank the ensembles, Stacked ensemble was ranked best with datasets DS1 and DS3, with both having a mean average of 4.00 and an overall level of agreement, W, of 0.986 and 1.000 respectively. Using dataset DS2, it was ranked after Vote and Adaboost with mean average of 2.33 and an overall level of agreement, W of 0.857. Stacked ensemble is recommended for the prediction of heterogeneous bioactive molecules during drug discovery and can also be implemented in other research areas.
引用
收藏
页码:153952 / 153957
页数:6
相关论文
共 50 条
  • [21] Using an innovative stacked ensemble algorithm for the accurate prediction of preterm birth
    Ramalingam, Pari
    Sandhya, Maheshwari
    Sankar, Sharmila
    JOURNAL OF THE TURKISH-GERMAN GYNECOLOGICAL ASSOCIATION, 2019, 20 (02) : 70 - 78
  • [22] Sea surface temperature prediction by stacked generalization ensemble of deep learning
    Dai, Hao
    Lei, Famei
    Wei, Guomei
    Zhang, Xining
    Lin, Rui
    Zhang, Weijie
    Shang, Shaoping
    DEEP-SEA RESEARCH PART I-OCEANOGRAPHIC RESEARCH PAPERS, 2024, 209
  • [23] Diabetes Mellitus Prediction and Severity Calculation Using Stacked Ensemble Method
    G. Ananthi
    S. Santhiya
    V. Gokila
    SN Computer Science, 5 (8)
  • [24] DeepBP: Ensemble deep learning strategy for bioactive peptide prediction
    Zhang, Ming
    Zhou, Jianren
    Wang, Xiaohua
    Wang, Xun
    Ge, Fang
    BMC BIOINFORMATICS, 2024, 25 (01):
  • [25] Voting-Based Ensemble Method for Prediction of Bioactive Molecules
    Petinrin, Olutomilayo Olayemi
    Saeed, Faisal
    Al-Hadhrami, Tawfik
    PROCEEDINGS OF 2017 2ND INTERNATIONAL CONFERENCE ON KNOWLEDGE ENGINEERING AND APPLICATIONS (ICKEA), 2017, : 118 - 122
  • [26] A stacked meta-ensemble for protein inter-residue distance prediction
    Rahman, Julia
    Newton, M. A. Hakim
    Hasan, Md Al Mehedi
    Sattar, Abdul
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 148
  • [27] A stacked ensemble model for automatic stroke prediction using only raw electrocardiogram
    Kunwar, Prashant
    Choudhary, Prakash
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 17
  • [28] Stacked ensemble modeling for improved tuberculosis treatment outcome prediction in pediatric cases
    Yilmaz, Yildiran
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (13):
  • [29] PreTP-Stack: Prediction of Therapeutic Peptides Based on the Stacked Ensemble Learing
    Yan, Ke
    Lv, Hongwu
    Wen, Jie
    Guo, Yichen
    Xu, Yong
    Liu, Bin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (02) : 1337 - 1344
  • [30] Stacked ensemble model for optimized prediction of triangular side orifice discharge coefficient
    Elshaarawy, Mohamed Kamel
    Hamed, Abdelrahman Kamal
    ENGINEERING OPTIMIZATION, 2024,