Using Item Response Theory for Explainable Machine Learning in Predicting Mortality in the Intensive Care Unit: Case-Based Approach

被引：5

作者：

Kline, Adrienne ^{[1
,2
,3
]}

Kline, Theresa ^{[4
]}

Abad, Zahra Shakeri Hossein ^{[3
,5
]}

Lee, Joon ^{[3
,5
,6
]}

机构：

[1] Univ Calgary, Dept Biomed Engn, 2500 Univ Dr NW, Calgary, AB, Canada

[2] Univ Calgary, Cumming Sch Med, Undergrad Med Educ, Calgary, AB, Canada

[3] Univ Calgary, Cumming Sch Med, Data Intelligence Hlth Lab, Calgary, AB, Canada

[4] Univ Calgary, Dept Psychol, Calgary, AB, Canada

[5] Univ Calgary, Cumming Sch Med, Dept Community Hlth Sci, Calgary, AB, Canada

[6] Univ Calgary, Cumming Sch Med, Dept Cardiac Sci, Calgary, AB, Canada

来源：

JOURNAL OF MEDICAL INTERNET RESEARCH | 2020年 / 22卷 / 09期

关键词：

item response theory; machine learning; statistical model; mortality; COMA;

D O I：

10.2196/20268

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

Background: Supervised machine learning (ML) is being featured in the health care literature with study results frequently reported using metrics such as accuracy, sensitivity, specificity, recall, or F1 score. Although each metric provides a different perspective on the performance, they remain to be overall measures for the whole sample, discounting the uniqueness of each case or patient. Intuitively, we know that all cases are not equal, but the present evaluative approaches do not take case difficulty into account. Objective: A more case-based, comprehensive approach is warranted to assess supervised ML outcomes and forms the rationale for this study. This study aims to demonstrate how the item response theory (IRT) can be used to stratify the data based on how difficult each case is to classify, independent of the outcome measure of interest (eg, accuracy). This stratification allows the evaluation of ML classifiers to take the form of a distribution rather than a single scalar value. Methods: Two large, public intensive care unit data sets, Medical Information Mart for Intensive Care III and electronic intensive care unit, were used to showcase this method in predicting mortality. For each data set, a balanced sample (n=8078 and n=21,940, respectively) and an imbalanced sample (n=12,117 and n=32,910, respectively) were drawn. A 2-parameter logistic model was used to provide scores for each case. Several ML algorithms were used in the demonstration to classify cases based on their health-related features: logistic regression, linear discriminant analysis, K-nearest neighbors, decision tree, naive Bayes, and a neural network. Generalized linear mixed model analyses were used to assess the effects of case difficulty strata, ML algorithm, and the interaction between them in predicting accuracy. Results: The results showed significant effects (P<.001) for case difficulty strata, ML algorithm, and their interaction in predicting accuracy and illustrated that all classifiers performed better with easier-to-classify cases and that overall the neural network performed best. Significant interactions suggest that cases that fall in the most arduous strata should be handled by logistic regression, linear discriminant analysis, decision tree, or neural network but not by naive Bayes or K-nearest neighbors. Conventional metrics for ML classification have been reported for methodological comparison. Conclusions: This demonstration shows that using the IRT is a viable method for understanding the data that are provided to ML algorithms, independent of outcome measures, and highlights how well classifiers differentiate cases of varying difficulty. This method explains which features are indicative of healthy states and why. It enables end users to tailor the classifier that is appropriate to the difficulty level of the patient for personalized medicine.

引用

页数：21

共 50 条

[41] Predicting mortality in intensive care unit survivors using a subjective scoring system
Afessa, Bekele
Keegan, Mark T.
[J]. CRITICAL CARE, 2007, 11 (01):
[42] Predicting Mortality Rate based on Comprehensive Features of Intensive Care Unit Patients
Danda, Jagan Moahan Reddy
Priyansh, Kumar
Shahriar, Hossain
Haddad, Hisham
Cuzzocrea, Alfredo
Sakib, Nazmus
[J]. 2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 1402 - 1407
[43] Predicting Hospital Length of Stay After Intensive Care Unit Discharge with Machine Learning
Rojas, J. C.
Venable, L. R.
Fahrenbach, J. P.
Carey, K. A.
Edelson, D. P.
Howell, M. D.
Churpek, M. M.
[J]. AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2018, 197
[44] New model averaging approach in predicting mortality rate of intensive care unit patients
Padzil, Siti Aisyah Mohd
Pillay, Khuneswari Gopal
Rusiman, Mohd Saifullah
Salleh, Rohayu Mohd
[J]. 2ND INTERNATIONAL CONFERENCE ON APPLIED & INDUSTRIAL MATHEMATICS AND STATISTICS, 2019, 1366
[45] Maintaining case-based reasoning systems: A machine learning approach
Arshadi, N
Jurisica, I
[J]. ADVANCES IN CASE-BASED REASONING, PROCEEDINGS, 2004, 3155 : 17 - 31
[46] Predicting the Need For Vasopressors in the Intensive Care Unit Using an Attention Based Deep Learning Model
Kwak, Gloria Hyunjung
Ling, Lowell
Hui, Pan
[J]. SHOCK, 2021, 56 (01): : 73 - 79
[47] Machine learning models to evaluate mortality in pediatric patients with pneumonia in the intensive care unit
Lin, Siang-Rong
Wu, Jeng-Hung
Liu, Yun-Chung
Chiu, Pei-Hsin
Chang, Tu-Hsuan
Wu, En-Ting
Chou, Chia-Ching
Chang, Luan-Yin
Lai, Fei-Pei
[J]. PEDIATRIC PULMONOLOGY, 2024, 59 (05) : 1256 - 1265
[48] Machine Learning Prediction Models for Mortality in Intensive Care Unit Patients with Lactic Acidosis
Pattharanitima, Pattharawin
Thongprayoon, Charat
Kaewput, Wisit
Qureshi, Fawad
Qureshi, Fahad
Petnak, Tananchai
Srivali, Narat
Gembillo, Guido
O'Corragain, Oisin A.
Chesdachai, Supavit
Vallabhajosyula, Saraschandra
Guru, Pramod K.
Mao, Michael A.
Garovic, Vesna D.
Dillon, John J.
Cheungpasitporn, Wisit
[J]. JOURNAL OF CLINICAL MEDICINE, 2021, 10 (21)
[49] Inter operator variability of machine learning researchers predicting all-cause mortality in patients admitted to intensive care unit
Jones, Y.
Cleland, J.
Li, C.
Pellicori, P.
Friday, J.
[J]. EUROPEAN HEART JOURNAL, 2021, 42 : 3052 - 3052
[50] Predicting the drift capacity of precast concrete columns using explainable machine learning approach
Wang, Zhen
Liu, Tongxu
Long, Zilin
Wang, Jingquan
Zhang, Jian
[J]. ENGINEERING STRUCTURES, 2023, 282

← 1 2 3 4 5 →