Stacked Ensemble for Bioactive Molecule Prediction

被引:11
|
作者
Petinrin, Olutomilayo Olayemi [1 ,2 ]
Saeed, Faisal [1 ,3 ]
机构
[1] Univ Teknol Malaysia Johor Bahru, Informat Syst Dept, Fac Comp, Skudai 81310, Malaysia
[2] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[3] Taibah Univ, Coll Comp Sci & Engn, Medina 42353, Saudi Arabia
关键词
Bioactive molecule prediction; chemoinformatics; drug discovery; ensemble; stacked ensemble; VOTING-BASED CLASSIFICATION; RANDOM FOREST; OPTIMIZATION; CLASSIFIERS; EXTRACTION;
D O I
10.1109/ACCESS.2019.2945422
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bioactive molecular compounds are essential for drug discovery. The biological activity of these compounds needs to be predicted as this is used to determine the drug-target ability. As ineffective drugs are discarded after production, leading to resource and time wastage, it is important to predict bioactive molecules with models having high predictive performance. This study utilizes the stacked ensemble which uses the prediction of multiple base classifiers as features, used to train a meta classifier which makes the final prediction. Using three datasets DS1, DS2, and DS3 gotten from MDL Drug Data Report (MDDR) database, the performance of stacked ensemble was compared to three other ensembles: adaboost, bagging, and vote ensemble, based on different evaluation criteria and also a statistical method, Kendall's W test. The accuracy of Stacked ensemble ranged from 96.7002%, 98.2260% and 94.9007% for the three datasets respectively, although Vote had the best accuracy using dataset DS2 which consist of structurally homogeneous bioactive molecules. Also, using Kendall's W test to rank the ensembles, Stacked ensemble was ranked best with datasets DS1 and DS3, with both having a mean average of 4.00 and an overall level of agreement, W, of 0.986 and 1.000 respectively. Using dataset DS2, it was ranked after Vote and Adaboost with mean average of 2.33 and an overall level of agreement, W of 0.857. Stacked ensemble is recommended for the prediction of heterogeneous bioactive molecules during drug discovery and can also be implemented in other research areas.
引用
收藏
页码:153952 / 153957
页数:6
相关论文
共 50 条
  • [41] A Stacked Ensemble Approach to Generalize the Classifier Prediction for the Detection of DDoS Attack in Cloud Network
    Verma, Priyanka
    Kowsik, A. Rama Krishna
    Pateriya, R. K.
    Bharot, Nitesh
    Vidyarthi, Ankit
    Gupta, Deepak
    MOBILE NETWORKS & APPLICATIONS, 2023, 29 (5): : 1618 - 1632
  • [42] Use of Ensemble Approach and Stacked Generalization for Neural Network Prediction of Geomagnetic Dst Index
    Shiroky, Vladimir
    Myagkova, Irina
    Dolenko, Sergey
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2016, PT II, 2016, 9887 : 541 - 541
  • [43] Stacked ensemble machine learning for porosity and absolute permeability prediction of carbonate rock plugs
    Ramanzani Kalule
    Hamid Ait Abderrahmane
    Waleed Alameri
    Mohamed Sassi
    Scientific Reports, 13
  • [44] Accurate Dissolved Oxygen Prediction for Aquaculture Using Stacked Ensemble Machine Learning Model
    Kozhiparamban, Rasheed Abdul Haq
    Swetha, P.
    Harigovindan, V. P.
    NATIONAL ACADEMY SCIENCE LETTERS-INDIA, 2023, 46 (03): : 203 - 207
  • [45] An integrated framework for rainfall prediction and analysis using a Stacked Heterogeneous Ensemble Model (SHEM)
    Umamaheswari, P.
    Ramaswamy, V.
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 256
  • [46] Electric vehicle energy consumption prediction using stacked generalization: an ensemble learning approach
    Ullah, Irfan
    Liu, Kai
    Yamamoto, Toshiyuki
    Zahid, Muhammad
    Jamal, Arshad
    INTERNATIONAL JOURNAL OF GREEN ENERGY, 2021, 18 (09) : 896 - 909
  • [47] StackDPPred: Multiclass prediction of defensin peptides using stacked ensemble learning with optimized features
    Arif, Muhammad
    Musleh, Saleh
    Ghulam, Ali
    Fida, Huma
    Alqahtani, Yasser
    Alam, Tanvir
    METHODS, 2024, 230 : 129 - 139
  • [48] Stacked ensemble machine learning for porosity and absolute permeability prediction of carbonate rock plugs
    Kalule, Ramanzani
    Abderrahmane, Hamid Ait
    Alameri, Waleed
    Sassi, Mohamed
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [49] Stacked ensemble learning on HaCaT cytotoxicity for skin irritation prediction: A case study on dipterocarpol
    Srisongkram, Tarapong
    Syahid, Nur Fadhilah
    Tookkane, Dheerapat
    Weerapreeyakul, Natthida
    Puthongking, Ploenthip
    FOOD AND CHEMICAL TOXICOLOGY, 2023, 181
  • [50] Prediction of Phishing Websites Using Stacked Ensemble Method and Hybrid Features Selection Method
    Pandey M.K.
    Singh M.K.
    Pal S.
    Tiwari B.B.
    SN Computer Science, 3 (6)