Stacked Ensemble for Bioactive Molecule Prediction

被引:11
|
作者
Petinrin, Olutomilayo Olayemi [1 ,2 ]
Saeed, Faisal [1 ,3 ]
机构
[1] Univ Teknol Malaysia Johor Bahru, Informat Syst Dept, Fac Comp, Skudai 81310, Malaysia
[2] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[3] Taibah Univ, Coll Comp Sci & Engn, Medina 42353, Saudi Arabia
关键词
Bioactive molecule prediction; chemoinformatics; drug discovery; ensemble; stacked ensemble; VOTING-BASED CLASSIFICATION; RANDOM FOREST; OPTIMIZATION; CLASSIFIERS; EXTRACTION;
D O I
10.1109/ACCESS.2019.2945422
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bioactive molecular compounds are essential for drug discovery. The biological activity of these compounds needs to be predicted as this is used to determine the drug-target ability. As ineffective drugs are discarded after production, leading to resource and time wastage, it is important to predict bioactive molecules with models having high predictive performance. This study utilizes the stacked ensemble which uses the prediction of multiple base classifiers as features, used to train a meta classifier which makes the final prediction. Using three datasets DS1, DS2, and DS3 gotten from MDL Drug Data Report (MDDR) database, the performance of stacked ensemble was compared to three other ensembles: adaboost, bagging, and vote ensemble, based on different evaluation criteria and also a statistical method, Kendall's W test. The accuracy of Stacked ensemble ranged from 96.7002%, 98.2260% and 94.9007% for the three datasets respectively, although Vote had the best accuracy using dataset DS2 which consist of structurally homogeneous bioactive molecules. Also, using Kendall's W test to rank the ensembles, Stacked ensemble was ranked best with datasets DS1 and DS3, with both having a mean average of 4.00 and an overall level of agreement, W, of 0.986 and 1.000 respectively. Using dataset DS2, it was ranked after Vote and Adaboost with mean average of 2.33 and an overall level of agreement, W of 0.857. Stacked ensemble is recommended for the prediction of heterogeneous bioactive molecules during drug discovery and can also be implemented in other research areas.
引用
收藏
页码:153952 / 153957
页数:6
相关论文
共 50 条
  • [31] Prediction of mortality in sepsis patients using stacked ensemble machine learning algorithm
    Babu, M.
    Sappani, M.
    Joy, M.
    Chandiraseharan, V. K.
    Jeyaseelan, L.
    Sudarsanam, T. D.
    JOURNAL OF POSTGRADUATE MEDICINE, 2024, 70 (04) : 209 - 216
  • [32] Explainable prediction of daily hospitalizations for cerebrovascular disease using stacked ensemble learning
    Lu, Xiaoya
    Qiu, Hang
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2023, 23 (01)
  • [33] Explainable prediction of daily hospitalizations for cerebrovascular disease using stacked ensemble learning
    Xiaoya Lu
    Hang Qiu
    BMC Medical Informatics and Decision Making, 23
  • [34] Capreomycin resistance prediction in two species of Mycobacterium using a stacked ensemble method
    Chowdhury, A. S.
    Khaledian, E.
    Broschat, S. L.
    JOURNAL OF APPLIED MICROBIOLOGY, 2019, 127 (06) : 1656 - 1664
  • [35] Prediction of the Indian summer monsoon using a stacked autoencoder and ensemble regression model
    Saha, Moumita
    Santara, Anirban
    Mitra, Pabitra
    Chakraborty, Arun
    Nanjundiah, Ravi S.
    INTERNATIONAL JOURNAL OF FORECASTING, 2021, 37 (01) : 58 - 71
  • [36] Deep Stacked Ensemble Recommender
    Otunba, Rasaq
    Rufai, Raimi A.
    Lin, Jessica
    SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM 2019), 2019, : 197 - 200
  • [37] Smart grid stability prediction using Adaptive Aquila Optimizer and ensemble stacked BiLSTM
    Al-Selwi, Safwan Mahmood
    Hassan, Mohd Fadzil
    Abdulkadir, Said Jadid
    Ragab, Mohammed Gamal
    Alqushaibi, Alawi
    Sumiea, Ebrahim Hamid
    RESULTS IN ENGINEERING, 2024, 24
  • [38] Survival Prediction of Heart Failure Patients using Stacked Ensemble Machine Learning Algorithm
    Zaman, S. M. Mehedi
    Qureshi, Wasay Mahmood
    Raihan, Md Mohsin Sarker
    Bin Shams, Abdullah
    Sultana, Sharmin
    2021 IEEE INTERNATIONAL WOMEN IN ENGINEERING (WIE) CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (WIECON-ECE), 2022, : 117 - 120
  • [39] Stacked ensemble model for accurate crop yield prediction using machine learning techniques
    Ramesh, V
    Kumaresan, P.
    ENVIRONMENTAL RESEARCH COMMUNICATIONS, 2025, 7 (03):
  • [40] Ensemble Learning Stacked Generalization Algorithm for Type II/Gestational Diabetes Mellitus Prediction
    Conahap, A. D.
    Oguis, G. F.
    Ligue, K. D.
    Manliguez, C.
    DIABETES RESEARCH AND CLINICAL PRACTICE, 2024, 209