Interpretable machine learning in predicting drug-induced liver injury among tuberculosis patients: model development and validation study

被引:4
|
作者
Xiao, Yue [1 ]
Chen, Yanfei [1 ]
Huang, Ruijian [1 ]
Jiang, Feng [1 ]
Zhou, Jifang [1 ]
Yang, Tianchi [2 ]
机构
[1] China Pharmaceut Univ, Sch Int Pharmaceut Business, Nanjing, Jiangsu, Peoples R China
[2] Ningbo Municipal Ctr Dis Control & Prevent, Inst TB Prevent & Control, 237 Yongfeng Rd, Ningbo, Zhejiang, Peoples R China
关键词
Machine learning; Logistic regression; Tuberculosis; Drug-induced liver injury; Retrospective study; HEALTH; HEPATOTOXICITY; GUIDELINES;
D O I
10.1186/s12874-024-02214-5
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background The objective of this research was to create and validate an interpretable prediction model for drug-induced liver injury (DILI) during tuberculosis (TB) treatment.Methods A dataset of TB patients from Ningbo City was used to develop models employing the eXtreme Gradient Boosting (XGBoost), random forest (RF), and the least absolute shrinkage and selection operator (LASSO) logistic algorithms. The model's performance was evaluated through various metrics, including the area under the receiver operating characteristic curve (AUROC) and the area under the precision recall curve (AUPR) alongside the decision curve. The Shapley Additive exPlanations (SHAP) method was used to interpret the variable contributions of the superior model.Results A total of 7,071 TB patients were identified from the regional healthcare dataset. The study cohort consisted of individuals with a median age of 47 years, 68.0% of whom were male, and 16.3% developed DILI. We utilized part of the high dimensional propensity score (HDPS) method to identify relevant variables and obtained a total of 424 variables. From these, 37 variables were selected for inclusion in a logistic model using LASSO. The dataset was then split into training and validation sets according to a 7:3 ratio. In the validation dataset, the XGBoost model displayed improved overall performance, with an AUROC of 0.89, an AUPR of 0.75, an F1 score of 0.57, and a Brier score of 0.07. Both SHAP analysis and XGBoost model highlighted the contribution of baseline liver-related ailments such as DILI, drug-induced hepatitis (DIH), and fatty liver disease (FLD). Age, alanine transaminase (ALT), and total bilirubin (Tbil) were also linked to DILI status.Conclusion XGBoost demonstrates improved predictive performance compared to RF and LASSO logistic in this study. Moreover, the introduction of the SHAP method enhances the clinical understanding and potential application of the model. For further research, external validation and more detailed feature integration are necessary.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Anti-Tuberculosis Drug-Induced Liver Injury in Shanghai: Validation of Hy’s Law
    Xin Shen
    Zheng’an Yuan
    Jian Mei
    Zurong Zhang
    Juntao Guo
    Zheyuan Wu
    Jie Wu
    Haihua Zhang
    Jieping Pan
    Wenming Huang
    Huili Gong
    Dong Yuan
    Ping Xiao
    Yanqin Wang
    Yi Shuai
    Senlin Lin
    Qichao Pan
    Tong Zhou
    Paul B. Watkins
    Fan Wu
    Drug Safety, 2014, 37 : 43 - 51
  • [32] Anti-Tuberculosis Drug-Induced Liver Injury in Shanghai: Validation of Hy's Law
    Shen, Xin
    Yuan, Zheng'an
    Mei, Jian
    Zhang, Zurong
    Guo, Juntao
    Wu, Zheyuan
    Wu, Jie
    Zhang, Haihua
    Pan, Jieping
    Huang, Wenming
    Gong, Huili
    Yuan, Dong
    Xiao, Ping
    Wang, Yanqin
    Shuai, Yi
    Lin, Senlin
    Pan, Qichao
    Zhou, Tong
    Watkins, Paul B.
    Wu, Fan
    DRUG SAFETY, 2014, 37 (01) : 43 - 51
  • [33] Predicting Drug-Induced Liver Injury Using Ensemble Learning Methods and Molecular Fingerprints
    Ai, Haixin
    Chen, Wen
    Zhang, Li
    Huang, Liangchao
    Yin, Zimo
    Hu, Huan
    Zhao, Qi
    Zhao, Jian
    Liu, Hongsheng
    TOXICOLOGICAL SCIENCES, 2018, 165 (01) : 100 - 107
  • [34] Predicting drug-induced liver injury: The pharmaceutical industry perspective
    Weaver, Richard
    TOXICOLOGY LETTERS, 2014, 229 : S37 - S38
  • [35] Predicting drug-induced liver injury: The importance of data curation
    Kotsampasakou, Eleni
    Montanari, Floriane
    Ecker, Gerhard F.
    TOXICOLOGY, 2017, 389 : 139 - 145
  • [36] Approaches to the Study of Drug-Induced Liver Injury
    Fontana, R. J.
    CLINICAL PHARMACOLOGY & THERAPEUTICS, 2010, 88 (03) : 416 - 419
  • [37] Drug-Induced Liver Injury and Drug Development: Industry Perspective
    Regev, Arie
    SEMINARS IN LIVER DISEASE, 2014, 34 (02) : 227 - 239
  • [38] Development and validation of an interpretable machine learning model for predicting the risk of distant metastasis in papillary thyroid cancer: a multicenter study
    Hou, Fei
    Zhu, Yun
    Zhao, Hongbo
    Cai, Haolin
    Wang, Yinghui
    Peng, Xiaoqi
    Lu, Lin
    He, Rongli
    Hou, Yan
    Li, Zhenhui
    Chen, Ting
    ECLINICALMEDICINE, 2024, 77
  • [39] Development and Validation of a Test to Identify Drugs That Cause Idiosyncratic Drug-Induced Liver Injury
    Benesic, Andreas
    Rotter, Isabelle
    Dragoi, Diana
    Weber, Sabine
    Buchholtz, Marie-Luise
    Gerbes, Alexander L.
    CLINICAL GASTROENTEROLOGY AND HEPATOLOGY, 2018, 16 (09) : 1488 - +
  • [40] AMALPHI: A Machine Learning Platform for Predicting Drug-Induced PhospholIpidosis
    Lomuscio, Maria Cristina
    Abate, Carmen
    Alberga, Domenico
    Laghezza, Antonio
    Corriero, Nicola
    Colabufo, Nicola Antonio
    Saviano, Michele
    Delre, Pietro
    Mangiatordi, Giuseppe Felice
    MOLECULAR PHARMACEUTICS, 2023, 21 (02) : 864 - 872