Prediction of sepsis mortality in ICU patients using machine learning methods

被引:0
|
作者
Gao, Jiayi [1 ]
Lu, Yuying [1 ]
Ashrafi, Negin [1 ]
Domingo, Ian [2 ]
Alaei, Kamiar [3 ]
Pishgar, Maryam [1 ]
机构
[1] Univ Southern Calif, Dept Ind Syst Engn, 3715 McClintock Ave, Los Angeles, CA 90089 USA
[2] Univ Calif Irvine, Dept Informat & Comp Sci, Inner Ring Rd, Irvine, CA 92697 USA
[3] Calif State Univ Long Beach, Dept Hlth Sci, 1250 Bellflower Blvd,HHS2-117, Long Beach, CA 90840 USA
关键词
Sepsis; Random Forest; Machine learning; MIMIC-IV; ICU mortality;
D O I
10.1186/s12911-024-02630-z
中图分类号
R-058 [];
学科分类号
摘要
ProblemSepsis, a life-threatening condition, accounts for the deaths of millions of people worldwide. Accurate prediction of sepsis outcomes is crucial for effective treatment and management. Previous studies have utilized machine learning for prognosis, but have limitations in feature sets and model interpretability.AimThis study aims to develop a machine learning model that enhances prediction accuracy for sepsis outcomes using a reduced set of features, thereby addressing the limitations of previous studies and enhancing model interpretability.MethodsThis study analyzes intensive care patient outcomes using the MIMIC-IV database, focusing on adult sepsis cases. Employing the latest data extraction tools, such as Google BigQuery, and following stringent selection criteria, we selected 38 features in this study. This selection is also informed by a comprehensive literature review and clinical expertise. Data preprocessing included handling missing values, regrouping categorical variables, and using the Synthetic Minority Over-sampling Technique (SMOTE) to balance the data. We evaluated several machine learning models: Decision Trees, Gradient Boosting, XGBoost, LightGBM, Multilayer Perceptrons (MLP), Support Vector Machines (SVM), and Random Forest. The Sequential Halving and Classification (SHAC) algorithm was used for hyperparameter tuning, and both train-test split and cross-validation methodologies were employed for performance and computational efficiency.ResultsThe Random Forest model was the most effective, achieving an area under the receiver operating characteristic curve (AUROC) of 0.94 with a confidence interval of +/- 0.01. This significantly outperformed other models and set a new benchmark in the literature. The model also provided detailed insights into the importance of various clinical features, with the Sequential Organ Failure Assessment (SOFA) score and average urine output being highly predictive. SHAP (Shapley Additive Explanations) analysis further enhanced the model's interpretability, offering a clearer understanding of feature impacts.ConclusionThis study demonstrates significant improvements in predicting sepsis outcomes using a Random Forest model, supported by advanced machine learning techniques and thorough data preprocessing. Our approach provided detailed insights into the key clinical features impacting sepsis mortality, making the model both highly accurate and interpretable. By enhancing the model's practical utility in clinical settings, we offer a valuable tool for healthcare professionals to make data-driven decisions, ultimately aiming to minimize sepsis-induced fatalities.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Using machine learning methods to predict in-hospital mortality of sepsis patients in the ICU
    Kong, Guilan
    Lin, Ke
    Hu, Yonghua
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2020, 20 (01)
  • [2] Using machine learning methods to predict in-hospital mortality of sepsis patients in the ICU
    Guilan Kong
    Ke Lin
    Yonghua Hu
    [J]. BMC Medical Informatics and Decision Making, 20
  • [3] Mortality Prediction in ICU Patients Using Machine Learning Models
    Ahmad, Fawad
    Ayub, Huma
    Liaqat, Rehan
    Khan, Akhyar Ali
    Nawaz, Ali
    Younis, Babar
    [J]. PROCEEDINGS OF 2021 INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGIES (IBCAST), 2021, : 372 - 376
  • [4] Using Machine Learning Methods to Predict the Lactate Trend of Sepsis Patients in the ICU
    Arslantas, Mustafa Kemal
    Asuroglu, Tunc
    Arslantas, Reyhan
    Pashazade, Emin
    Dincer, Pelin Corman
    Altun, Gulbin Tore
    Kararmaz, Alper
    [J]. DIGITAL HEALTH AND WIRELESS SOLUTIONS, PT II, NCDHWS 2024, 2024, 2084 : 3 - 16
  • [5] A Machine Learning Model for Accurate Prediction of Sepsis in ICU Patients
    Wang, Dong
    Li, Jinbo
    Sun, Yali
    Ding, Xianfei
    Zhang, Xiaojuan
    Liu, Shaohua
    Han, Bing
    Wang, Haixu
    Duan, Xiaoguang
    Sun, Tongwen
    [J]. FRONTIERS IN PUBLIC HEALTH, 2021, 9
  • [6] Development and validation of a novel blending machine learning model for hospital mortality prediction in ICU patients with Sepsis
    Zeng, Zhixuan
    Yao, Shuo
    Zheng, Jianfei
    Gong, Xun
    [J]. BIODATA MINING, 2021, 14 (01)
  • [7] Development and validation of a novel blending machine learning model for hospital mortality prediction in ICU patients with Sepsis
    Zhixuan Zeng
    Shuo Yao
    Jianfei Zheng
    Xun Gong
    [J]. BioData Mining, 14
  • [8] Mortality Prediction for ICU Patients Using Just-in-time Learning and Extreme Learning Machine
    Ding, Yangyang
    Li, Xuejian
    Wang, Youqing
    [J]. PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 939 - 944
  • [9] Machine-learning models for prediction of sepsis patients mortality
    Bao, C.
    Deng, F.
    Zhao, S.
    [J]. MEDICINA INTENSIVA, 2023, 47 (06) : 315 - 325
  • [10] Early Prediction of Sepsis in the ICU Using Machine Learning: A Systematic Review
    Moor, Michael
    Rieck, Bastian
    Horn, Max
    Jutzeler, Catherine R.
    Borgwardt, Karsten
    [J]. FRONTIERS IN MEDICINE, 2021, 8