Development and rigorous validation of antimalarial predictive models using machine learning approaches

被引:11
|
作者
Danishuddin [1 ]
Madhukar, G. [1 ]
Malik, M. Z. [1 ]
Subbarao, N. [1 ]
机构
[1] Jawaharlal Nehru Univ, Sch Computat & Integrat Sci, New Delhi, India
关键词
Antimalarial; predictive models; machine learning; calibration; predictiveness curve; ARTEMISININ RESISTANCE; DISCOVERY; IDENTIFICATION; QSAR;
D O I
10.1080/1062936X.2019.1635526
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The large collection of known and experimentally verified compounds from the ChEMBL database was used to build different classification models for predicting the antimalarial activity against Plasmodium falciparum. Four different machine learning methods, namely the support vector machine (SVM), random forest (RF), k-nearest neighbour (kNN) and XGBoost have been used for the development of models using the diverse antimalarial dataset from ChEMBL. A well-established feature selection framework was used to select the best subset from a larger pool of descriptors. Performance of the models was rigorously evaluated by evaluation of the applicability domain, Y-scrambling and AUC-ROC curve. Additionally, the predictive power of the models was also assessed using probability calibration and predictiveness curves. SVM and XGBoost showed the best performances, yielding an accuracy of 85% on the independent test set. In term of probability prediction, SVM and XGBoost were well calibrated. Total gain (TG) from the predictiveness curve was more related to SVM (TG = 0.67) and XGBoost (TG = 0.75). These models also predict the high-affinity compounds from PubChem antimalarial bioassay (as external validation) with a high probability score. Our findings suggest that the selected models are robust and can be potentially useful for facilitating the discovery of antimalarial agents.
引用
下载
收藏
页码:543 / 560
页数:18
相关论文
共 50 条
  • [21] Explaining and Integrating Machine Learning Models with Rigorous Simulation
    Schoeneberger, Jan C.
    Aker, Burcu
    Fricke, Armin
    CHEMIE INGENIEUR TECHNIK, 2021, 93 (12) : 1998 - 2009
  • [22] Development, comparison, and internal validation of prediction models to determine the visual prognosis of patients with open globe injuries using machine learning approaches
    Shariati, Mehrdad Motamed
    Eslami, Saeid
    Shoeibi, Nasser
    Eslampoor, Alireza
    Sedaghat, Mohammadreza
    Gharaei, Hamid
    Zarei-Ghanavati, Siamak
    Derakhshan, Akbar
    Abrishami, Majid
    Abrishami, Mojtaba
    Hosseini, Seyedeh Maryam
    Rad, Saeed Shokuhi
    Astaneh, Mohammadreza Ansari
    Farimani, Raheleh Mahboub
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
  • [23] Machine learning-based predictive models for the occurrence of behavioral and psychological symptoms of dementia: model development and validation
    Cho, Eunhee
    Kim, Sujin
    Heo, Seok-Jae
    Shin, Jinhee
    Hwang, Sinwoo
    Kwon, Eunji
    Lee, SungHee
    Kim, SangGyun
    Kang, Bada
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [24] Machine learning-based predictive models for the occurrence of behavioral and psychological symptoms of dementia: model development and validation
    Eunhee Cho
    Sujin Kim
    Seok-Jae Heo
    Jinhee Shin
    Sinwoo Hwang
    Eunji Kwon
    SungHee Lee
    SangGyun Kim
    Bada Kang
    Scientific Reports, 13
  • [25] Predictive models for diabetes mellitus using machine learning techniques
    Lai, Hang
    Huang, Huaxiong
    Keshavjee, Karim
    Guergachi, Aziz
    Gao, Xin
    BMC ENDOCRINE DISORDERS, 2019, 19 (01)
  • [26] A predictive study on HCV using automated machine learning models
    Değer, Serbun Ufuk
    Can, Hakan
    Computers in Biology and Medicine, 2025, 188
  • [27] Predictive models for diabetes mellitus using machine learning techniques
    Hang Lai
    Huaxiong Huang
    Karim Keshavjee
    Aziz Guergachi
    Xin Gao
    BMC Endocrine Disorders, 19
  • [28] Predictive Maintenance using Machine Learning Based Classification Models
    Chazhoor, Anisha
    Mounika, Y.
    Sarobin, Vergin Raja M.
    Sanjana, M., V
    Yasashvini, R.
    5TH INTERNATIONAL CONFERENCE ON MATERIALS AND MANUFACTURING ENGINEERING-2020 (ICMME-2020), 2020, 954
  • [29] Predictive models for charitable giving using machine learning techniques
    Farrokhvar, Leily
    Ansari, Azadeh
    Kamali, Behrooz
    PLOS ONE, 2018, 13 (10):
  • [30] Evaluation of predictive models of aneurysm focal growth and bleb development using machine learning techniques
    Hadad, Sara
    Mut, Fernando
    Slawski, Martin
    Robertson, Anne M.
    Cebral, Juan R.
    JOURNAL OF NEUROINTERVENTIONAL SURGERY, 2024, 16 (04) : 392 - 397