Using Item Response Theory for Explainable Machine Learning in Predicting Mortality in the Intensive Care Unit: Case-Based Approach

被引:5
|
作者
Kline, Adrienne [1 ,2 ,3 ]
Kline, Theresa [4 ]
Abad, Zahra Shakeri Hossein [3 ,5 ]
Lee, Joon [3 ,5 ,6 ]
机构
[1] Univ Calgary, Dept Biomed Engn, 2500 Univ Dr NW, Calgary, AB, Canada
[2] Univ Calgary, Cumming Sch Med, Undergrad Med Educ, Calgary, AB, Canada
[3] Univ Calgary, Cumming Sch Med, Data Intelligence Hlth Lab, Calgary, AB, Canada
[4] Univ Calgary, Dept Psychol, Calgary, AB, Canada
[5] Univ Calgary, Cumming Sch Med, Dept Community Hlth Sci, Calgary, AB, Canada
[6] Univ Calgary, Cumming Sch Med, Dept Cardiac Sci, Calgary, AB, Canada
关键词
item response theory; machine learning; statistical model; mortality; COMA;
D O I
10.2196/20268
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Supervised machine learning (ML) is being featured in the health care literature with study results frequently reported using metrics such as accuracy, sensitivity, specificity, recall, or F1 score. Although each metric provides a different perspective on the performance, they remain to be overall measures for the whole sample, discounting the uniqueness of each case or patient. Intuitively, we know that all cases are not equal, but the present evaluative approaches do not take case difficulty into account. Objective: A more case-based, comprehensive approach is warranted to assess supervised ML outcomes and forms the rationale for this study. This study aims to demonstrate how the item response theory (IRT) can be used to stratify the data based on how difficult each case is to classify, independent of the outcome measure of interest (eg, accuracy). This stratification allows the evaluation of ML classifiers to take the form of a distribution rather than a single scalar value. Methods: Two large, public intensive care unit data sets, Medical Information Mart for Intensive Care III and electronic intensive care unit, were used to showcase this method in predicting mortality. For each data set, a balanced sample (n=8078 and n=21,940, respectively) and an imbalanced sample (n=12,117 and n=32,910, respectively) were drawn. A 2-parameter logistic model was used to provide scores for each case. Several ML algorithms were used in the demonstration to classify cases based on their health-related features: logistic regression, linear discriminant analysis, K-nearest neighbors, decision tree, naive Bayes, and a neural network. Generalized linear mixed model analyses were used to assess the effects of case difficulty strata, ML algorithm, and the interaction between them in predicting accuracy. Results: The results showed significant effects (P<.001) for case difficulty strata, ML algorithm, and their interaction in predicting accuracy and illustrated that all classifiers performed better with easier-to-classify cases and that overall the neural network performed best. Significant interactions suggest that cases that fall in the most arduous strata should be handled by logistic regression, linear discriminant analysis, decision tree, or neural network but not by naive Bayes or K-nearest neighbors. Conventional metrics for ML classification have been reported for methodological comparison. Conclusions: This demonstration shows that using the IRT is a viable method for understanding the data that are provided to ML algorithms, independent of outcome measures, and highlights how well classifiers differentiate cases of varying difficulty. This method explains which features are indicative of healthy states and why. It enables end users to tailor the classifier that is appropriate to the difficulty level of the patient for personalized medicine.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Explainable Machine Learning Model for Predicting GI Bleed Mortality in the Intensive Care Unit
    Deshmukh, Farah
    Merchant, Shamel S.
    [J]. AMERICAN JOURNAL OF GASTROENTEROLOGY, 2020, 115 (10): : 1657 - 1668
  • [2] Using Explainable Machine Learning to Improve Intensive Care Unit Alarm Systems
    Gonzalez-Novoa, Jose A.
    Busto, Laura
    Rodriguez-Andina, Juan J.
    Farina, Jose
    Segura, Marta
    Gomez, Vanesa
    Vila, Dolores
    Veiga, Cesar
    [J]. SENSORS, 2021, 21 (21)
  • [3] Prediction of in-hospital Mortality of Intensive Care Unit Patients with Acute Pancreatitis Based on an Explainable Machine Learning Algorithm
    Ren, Wensen
    Zou, Kang
    Huang, Shu
    Xu, Huan
    Zhang, Wei
    Shi, Xiaomin
    Shi, Lei
    Zhong, Xiaolin
    Peng, Yan
    Tang, Xiaowei
    Lu, Muhan
    [J]. JOURNAL OF CLINICAL GASTROENTEROLOGY, 2024, 58 (06) : 619 - 626
  • [4] Predicting mortality in the intensive care unit: Man against machine
    Shapiro, NI
    Talmor, D
    [J]. CRITICAL CARE MEDICINE, 2006, 34 (03) : 932 - 933
  • [5] PREDICTING CARDIAC ARREST IN THE PEDIATRIC INTENSIVE CARE UNIT USING MACHINE LEARNING
    Kenet, Adam
    Pemmaraju, Rahul
    Ghate, Sejal
    Raghunath, Shreeya
    Zhang, Yifan
    Yuan, Mordred
    Wei, Tony
    Desman, Jacob
    Greenstein, Joseph
    Taylor, Casey
    Ruchti, Timothy
    Fackler, Jim
    Bergmann, Jules
    [J]. CRITICAL CARE MEDICINE, 2023, 51 (01) : 30 - 30
  • [6] Predicting Decompensation Risk in Intensive Care Unit Patients Using Machine Learning
    Aikodon, Nosa
    Ortega-Martorell, Sandra
    Olier, Ivan
    [J]. ALGORITHMS, 2024, 17 (01)
  • [7] A New Risk Model based on the Machine Learning Approach for Prediction of Mortality in the Respiratory Intensive Care Unit
    Yan, Peng
    Huang, Siwan
    Li, Ye
    Chen, Tiange
    Li, Xiang
    Zhang, Yuan
    Wu, Huan
    Xu, Jianqiao
    Xie, Guotong
    Xie, Lixin
    Mo, Guoxin
    [J]. CURRENT PHARMACEUTICAL BIOTECHNOLOGY, 2023, 24 (13) : 1673 - 1681
  • [8] Predicting Length of Stay for Cardiovascular Hospitalizations in the Intensive Care Unit: Machine Learning Approach
    Alsinglawi, Belal
    Alnajjar, Fady
    Mubin, Omar
    Novoa, Mauricio
    Alorjani, Mohammed
    Karajeh, Ola
    Darwish, Omar
    [J]. 42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 5442 - 5445
  • [9] Reduction of false alarms in the intensive care unit using an optimized machine learning based approach
    Au-Yeung, Wan-Tai M.
    Sahani, Ashish K.
    Isselbacher, Eric M.
    Armoundas, Antonis A.
    [J]. NPJ DIGITAL MEDICINE, 2019, 2 (1)
  • [10] Reduction of false alarms in the intensive care unit using an optimized machine learning based approach
    Wan-Tai M. Au-Yeung
    Ashish K. Sahani
    Eric M. Isselbacher
    Antonis A. Armoundas
    [J]. npj Digital Medicine, 2