Prediction and Interpretation Microglia Cytotoxicity by Machine Learning

被引:0
|
作者
Liu, Qing [1 ]
He, Dakuo [1 ]
Fan, Mengmeng [1 ]
Wang, Jinpeng [1 ]
Cui, Zeyu [1 ]
Wang, Hao [1 ]
Mi, Yan [2 ]
Li, Ning [3 ]
Meng, Qingqi [2 ]
Hou, Yue [2 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China
[2] Northeastern Univ, Coll Life & Hlth Sci, Natl Frontiers Sci Ctr Ind Intelligence & Syst Opt, Key Lab Bioresource Res & Dev Liaoning Prov,Key La, Shenyang 110169, Peoples R China
[3] Shenyang Pharmaceut Univ, Sch Tradit Chinese Mat Med, Key Lab TCM Mat Basis Study & Innovat Drug Dev She, Shenyang 110016, Peoples R China
关键词
ADMET EVALUATION; TOXICITY; CLASSIFICATION; INFORMATION; MODEL; QSAR;
D O I
10.1021/acs.jcim.4c00366
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Ameliorating microglia-mediated neuroinflammation is a crucial strategy in developing new drugs for neurodegenerative diseases. Plant compounds are an important screening target for the discovery of drugs for the treatment of neurodegenerative diseases. However, due to the spatial complexity of phytochemicals, it becomes particularly important to evaluate the effectiveness of compounds while avoiding the mixing of cytotoxic substances in the early stages of compound screening. Traditional high-throughput screening methods suffer from high cost and low efficiency. A computational model based on machine learning provides a novel avenue for cytotoxicity determination. In this study, a microglia cytotoxicity classifier was developed using a machine learning approach. First, we proposed a data splitting strategy based on the molecule murcko generic scaffold, under this condition, three machine learning approaches were coupled with three kinds of molecular representation methods to construct microglia cytotoxicity classifier, which were then compared and assessed by the predictive accuracy, balanced accuracy, F-1-score, and Matthews Correlation Coefficient. Then, the recursive feature elimination integrated with support vector machine (RFE-SVC) dimension reduction method was introduced to molecular fingerprints with high dimensions to further improve the model performance. Among all the microglial cytotoxicity classifiers, the SVM coupled with ECFP4 fingerprint after feature selection (ECFP4-RFE-SVM) obtained the most accurate classification for the test set (ACC of 0.99, BA of 0.99, F-1-score of 0.99, MCC of 0.97). Finally, the Shapley additive explanations (SHAP) method was used in interpreting the microglia cytotoxicity classifier and key substructure smart identified as structural alerts. Experimental results show that ECFP4-RFE-SVM have reliable classification capability for microglia cytotoxicity, and SHAP can not only provide a rational explanation for microglia cytotoxicity predictions, but also offer a guideline for subsequent molecular cytotoxicity modifications.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Machine Learning for Energy Load Prediction and its Interpretation
    Charytanowicz, Malgorzata
    Olwert, Anna
    Radziszewska, Weronika
    Jarnicka, Jolanta
    Gajowniczek, Krzysztof
    Zabkowski, Tomasz
    Brozyna, Jacek
    Mentel, Grzegorz
    Matejko, Grzegorz
    [J]. 2022 IEEE 11th International Conference on Intelligent Systems, IS 2022, 2022,
  • [2] Machine learning prediction and interpretation of the impact of microplastics on soil properties
    Withana, Piumi Amasha
    Li, Jie
    Senadheera, Sachini Supunsala
    Fan, Chuanfang
    Wang, Yin
    Ok, Yong Sik
    [J]. ENVIRONMENTAL POLLUTION, 2024, 341
  • [3] Interpretation of machine learning methods for the prediction of breast and prostate cancer risk
    Klie, Adam
    Talwar, James
    Pagadala, Meghana
    Carter, Hannah
    [J]. CANCER RESEARCH, 2022, 82 (12)
  • [4] Validation of Machine Learning Models for Structural Dam Behaviour Interpretation and Prediction
    Mata, Juan
    Salazar, Fernando
    Barateiro, Jose
    Antunes, Antonio
    [J]. WATER, 2021, 13 (19)
  • [5] A Comparison and Interpretation of Machine Learning Algorithm for the Prediction of Online Purchase Conversion
    Lee, Jungwon
    Jung, Okkyung
    Lee, Yunhye
    Kim, Ohsung
    Park, Cheol
    [J]. JOURNAL OF THEORETICAL AND APPLIED ELECTRONIC COMMERCE RESEARCH, 2021, 16 (05): : 1472 - 1491
  • [6] CYTOTOXICITY OF MICROGLIA
    BANATI, RB
    GEHRMANN, J
    SCHUBERT, P
    KREUTZBERG, GW
    [J]. GLIA, 1993, 7 (01) : 111 - 118
  • [7] Machine learning-based prediction model and visual interpretation for prostate cancer
    Gang Chen
    Xuchao Dai
    Mengqi Zhang
    Zhujun Tian
    Xueke Jin
    Kun Mei
    Hong Huang
    Zhigang Wu
    [J]. BMC Urology, 23
  • [8] Prediction of Tropical Cyclogenesis Based on Machine Learning Methods and Its SHAP Interpretation
    Loi, Chi Lok
    Wu, Chun-Chieh
    Liang, Yu-Chiao
    [J]. JOURNAL OF ADVANCES IN MODELING EARTH SYSTEMS, 2024, 16 (03)
  • [9] Construction and Interpretation of Prediction Model of Teicoplanin Trough Concentration via Machine Learning
    Ma, Pan
    Liu, Ruixiang
    Gu, Wenrui
    Dai, Qing
    Gan, Yu
    Cen, Jing
    Shang, Shenglan
    Liu, Fang
    Chen, Yongchuan
    [J]. FRONTIERS IN MEDICINE, 2022, 9
  • [10] Machine learning-based prediction and interpretation of decomposition temperatures of energetic materials
    Wu, Jun-nan
    Song, Si-wei
    Tian, Xiao-lan
    Wang, Yi
    Qi, Xiu-juan
    [J]. ENERGETIC MATERIALS FRONTIERS, 2023, 4 (04): : 254 - 261