Development and rigorous validation of antimalarial predictive models using machine learning approaches

被引:11
|
作者
Danishuddin [1 ]
Madhukar, G. [1 ]
Malik, M. Z. [1 ]
Subbarao, N. [1 ]
机构
[1] Jawaharlal Nehru Univ, Sch Computat & Integrat Sci, New Delhi, India
关键词
Antimalarial; predictive models; machine learning; calibration; predictiveness curve; ARTEMISININ RESISTANCE; DISCOVERY; IDENTIFICATION; QSAR;
D O I
10.1080/1062936X.2019.1635526
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The large collection of known and experimentally verified compounds from the ChEMBL database was used to build different classification models for predicting the antimalarial activity against Plasmodium falciparum. Four different machine learning methods, namely the support vector machine (SVM), random forest (RF), k-nearest neighbour (kNN) and XGBoost have been used for the development of models using the diverse antimalarial dataset from ChEMBL. A well-established feature selection framework was used to select the best subset from a larger pool of descriptors. Performance of the models was rigorously evaluated by evaluation of the applicability domain, Y-scrambling and AUC-ROC curve. Additionally, the predictive power of the models was also assessed using probability calibration and predictiveness curves. SVM and XGBoost showed the best performances, yielding an accuracy of 85% on the independent test set. In term of probability prediction, SVM and XGBoost were well calibrated. Total gain (TG) from the predictiveness curve was more related to SVM (TG = 0.67) and XGBoost (TG = 0.75). These models also predict the high-affinity compounds from PubChem antimalarial bioassay (as external validation) with a high probability score. Our findings suggest that the selected models are robust and can be potentially useful for facilitating the discovery of antimalarial agents.
引用
收藏
页码:543 / 560
页数:18
相关论文
共 50 条
  • [1] DEVELOPMENT OF PREDICTIVE MODELS FOR INCIDENT FRAILTY IN OLDER ADULTS: MACHINE LEARNING APPROACHES
    Jung, Heeeun
    Kim, Miji
    Park, Mina
    Ryu, Jinhyung
    Mun, Kyung-Ryoul
    Won, Chang Won
    [J]. INNOVATION IN AGING, 2022, 6 : 836 - 836
  • [2] Predictive models for concrete properties using machine learning and deep learning approaches: A review
    Moein, Mohammad Mohtasham
    Saradar, Ashkan
    Rahmati, Komeil
    Mousavinejad, Seyed Hosein Ghasemzadeh
    Bristow, James
    Aramali, Vartenie
    Karakouzian, Moses
    [J]. JOURNAL OF BUILDING ENGINEERING, 2023, 63
  • [3] Development and validation of prediction models for papillary thyroid cancer structural recurrence using machine learning approaches
    Wang, Hongxi
    Zhang, Chao
    Li, Qianrui
    Tian, Tian
    Huang, Rui
    Qiu, Jiajun
    Tian, Rong
    [J]. BMC CANCER, 2024, 24 (01)
  • [4] High Accuracy of Predictive Models for SAH Using Different Machine Learning Approaches
    Litvak, Paul
    Medikonda, Jeevan
    Menon, Girish
    Mandava, Pitchaiah
    [J]. STROKE, 2020, 51
  • [5] Predictive classifier models built from natural products with antimalarial bioactivity using machine learning approach
    Egieyeh, Samuel
    Syce, James
    Malan, Sarel F.
    Christoffels, Alan
    [J]. PLOS ONE, 2018, 13 (09):
  • [6] Development of childhood asthma prediction models using machine learning approaches
    Kothalawala, Dilini M.
    Murray, Clare S.
    Simpson, Angela
    Custovic, Adnan
    Tapper, William J.
    Arshad, S. Hasan
    Holloway, John W.
    Rezwan, Faisal, I
    [J]. CLINICAL AND TRANSLATIONAL ALLERGY, 2021, 11 (09)
  • [7] Development and Validation of a Predictive Model for Coronary Artery Disease Using Machine Learning
    Wang, Chen
    Zhao, Yue
    Jin, Bingyu
    Gan, Xuedong
    Liang, Bin
    Xiang, Yang
    Zhang, Xiaokang
    Lu, Zhibing
    Zheng, Fang
    [J]. FRONTIERS IN CARDIOVASCULAR MEDICINE, 2021, 8
  • [8] Using Stacking Approaches for Machine Learning Models
    Pavlyshenko, Bohdan
    [J]. 2018 IEEE SECOND INTERNATIONAL CONFERENCE ON DATA STREAM MINING & PROCESSING (DSMP), 2018, : 255 - 258
  • [9] Development of Predictive Models in Patients with Epiphora Using Lacrimal Scintigraphy and Machine Learning
    Park, Yong-Jin
    Bae, Ji Hoon
    Shin, Mu Heon
    Hyun, Seung Hyup
    Cho, Young Seok
    Choe, Yearn Seong
    Choi, Joon Young
    Lee, Kyung-Han
    Kim, Byung-Tae
    Moon, Seung Hwan
    [J]. NUCLEAR MEDICINE AND MOLECULAR IMAGING, 2019, 53 (02) : 125 - 135
  • [10] Development of Predictive Models in Patients with Epiphora Using Lacrimal Scintigraphy and Machine Learning
    Yong-Jin Park
    Ji Hoon Bae
    Mu Heon Shin
    Seung Hyup Hyun
    Young Seok Cho
    Yearn Seong Choe
    Joon Young Choi
    Kyung-Han Lee
    Byung-Tae Kim
    Seung Hwan Moon
    [J]. Nuclear Medicine and Molecular Imaging, 2019, 53 : 125 - 135