Interpretable machine learning in predicting drug-induced liver injury among tuberculosis patients: model development and validation study

被引:4
|
作者
Xiao, Yue [1 ]
Chen, Yanfei [1 ]
Huang, Ruijian [1 ]
Jiang, Feng [1 ]
Zhou, Jifang [1 ]
Yang, Tianchi [2 ]
机构
[1] China Pharmaceut Univ, Sch Int Pharmaceut Business, Nanjing, Jiangsu, Peoples R China
[2] Ningbo Municipal Ctr Dis Control & Prevent, Inst TB Prevent & Control, 237 Yongfeng Rd, Ningbo, Zhejiang, Peoples R China
关键词
Machine learning; Logistic regression; Tuberculosis; Drug-induced liver injury; Retrospective study; HEALTH; HEPATOTOXICITY; GUIDELINES;
D O I
10.1186/s12874-024-02214-5
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background The objective of this research was to create and validate an interpretable prediction model for drug-induced liver injury (DILI) during tuberculosis (TB) treatment.Methods A dataset of TB patients from Ningbo City was used to develop models employing the eXtreme Gradient Boosting (XGBoost), random forest (RF), and the least absolute shrinkage and selection operator (LASSO) logistic algorithms. The model's performance was evaluated through various metrics, including the area under the receiver operating characteristic curve (AUROC) and the area under the precision recall curve (AUPR) alongside the decision curve. The Shapley Additive exPlanations (SHAP) method was used to interpret the variable contributions of the superior model.Results A total of 7,071 TB patients were identified from the regional healthcare dataset. The study cohort consisted of individuals with a median age of 47 years, 68.0% of whom were male, and 16.3% developed DILI. We utilized part of the high dimensional propensity score (HDPS) method to identify relevant variables and obtained a total of 424 variables. From these, 37 variables were selected for inclusion in a logistic model using LASSO. The dataset was then split into training and validation sets according to a 7:3 ratio. In the validation dataset, the XGBoost model displayed improved overall performance, with an AUROC of 0.89, an AUPR of 0.75, an F1 score of 0.57, and a Brier score of 0.07. Both SHAP analysis and XGBoost model highlighted the contribution of baseline liver-related ailments such as DILI, drug-induced hepatitis (DIH), and fatty liver disease (FLD). Age, alanine transaminase (ALT), and total bilirubin (Tbil) were also linked to DILI status.Conclusion XGBoost demonstrates improved predictive performance compared to RF and LASSO logistic in this study. Moreover, the introduction of the SHAP method enhances the clinical understanding and potential application of the model. For further research, external validation and more detailed feature integration are necessary.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Predicting Antituberculosis Drug-Induced Liver Injury Using an Interpretable Machine Learning Method: Model Development and Validation Study
    Zhong, Tao
    Zhuang, Zian
    Dong, Xiaoli
    Wong, Ka Hing
    Wong, Wing Tak
    Wang, Jian
    He, Daihai
    Liu, Shengyuan
    JMIR MEDICAL INFORMATICS, 2021, 9 (07)
  • [2] Predicting Antituberculosis Drug-Induced Liver Injury Using an Interpretable Machine Learning Method: Model Development and Validation Study (vol 9, e29226, 2021)
    Zhong, Tao
    Zhuang, Zian
    Dong, Xiaoli
    Wong, Ka Hing
    Wong, Wing Tak
    Wang, Jian
    He, Daihai
    Liu, Shengyuan
    JMIR MEDICAL INFORMATICS, 2021, 9 (08)
  • [3] Predicting Drug-Induced Liver Injury with Bayesian Machine Learning
    Williams, Dominic P.
    Lazic, Stanley E.
    Foster, Alison J.
    Semenova, Elizaveta
    Morgan, Paul
    CHEMICAL RESEARCH IN TOXICOLOGY, 2020, 33 (01) : 239 - 248
  • [4] A NOVEL MACHINE LEARNING MODEL FOR DISTINGUISHING DRUG-INDUCED LIVER INJURY FROM AUTOIMMUNE HEPATITIS: DEVELOPMENT AND VALIDATION
    Wang, Yu
    HEPATOLOGY, 2024, 80 : S944 - S944
  • [5] Machine Learning to Predict Drug-Induced Liver Injury and Its Validation on Failed Drug Candidates in Development
    Mostafa, Fahad
    Howle, Victoria
    Chen, Minjun
    TOXICS, 2024, 12 (06)
  • [6] Comparing Machine Learning Algorithms for Predicting Drug-Induced Liver Injury (DILI)
    Minerali, Eni
    Foil, Daniel H.
    Zorn, Kimberley M.
    Lane, Thomas R.
    Ekins, Sean
    MOLECULAR PHARMACEUTICS, 2020, 17 (07) : 2628 - 2637
  • [7] Predictability of drug-induced liver injury by machine learning
    Chierici, Marco
    Francescatto, Margherita
    Bussola, Nicole
    Jurman, Giuseppe
    Furlanello, Cesare
    BIOLOGY DIRECT, 2020, 15 (01)
  • [8] Predictability of drug-induced liver injury by machine learning
    Marco Chierici
    Margherita Francescatto
    Nicole Bussola
    Giuseppe Jurman
    Cesare Furlanello
    Biology Direct, 15
  • [9] Predicting Drug-Induced Liver Injury Using Machine Learning on a Diverse Set of Predictors
    Adeluwa, Temidayo
    McGregor, Brett A.
    Guo, Kai
    Hur, Junguk
    FRONTIERS IN PHARMACOLOGY, 2021, 12
  • [10] Risk score of drug-induced liver injury among new tuberculosis patients
    Ivanova, Diana
    Borisov, Sergey
    EUROPEAN RESPIRATORY JOURNAL, 2018, 52