A Hybrid Machine Learning Approach for Improving Mortality Risk Prediction on Imbalanced Data

被引:1
|
作者
Tashkandi, Araek [1 ,3 ]
Wiese, Lena [2 ]
机构
[1] Georg August Univ Goettingen, Inst Comp Sci, Gottingen, Germany
[2] Leibniz Univ Hannover, L3S Res Ctr, Knowledge Based Syst Grp, Hannover, Germany
[3] Univ Jeddah, Coll Comp Sci & Engn, Jeddah, Saudi Arabia
关键词
Machine Learning; Imbalanced Data; Risk of Mortality; Gradient Boosting Decision Tree; Under-sampling; Decision Support System;
D O I
10.1145/3366030.3366040
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The efficiency of Machine Learning (ML) models has widely been acknowledged in the healthcare area. However, the quality of the underlying medical data is a major challenge when applying ML in medical decision making. In particular, the imbalanced class distribution problem causes the ML model to be biased towards the majority class. Furthermore, the accuracy will be biased, too, which produces the Accuracy Paradox. In this paper, we identify an optimal ML model for predicting mortality risk for Intensive Care Units (ICU) patients. We comprehensively assess an approach that leverages the efficiency of ML ensemble learning (in particular, Gradient Boosting Decision Tree) and clustering-based data sampling to handle the imbalanced data problem that this model faces. We comprehensively compare different competitors (in terms of ML models as well as clustering methods) on a big real-world ICU dataset achieving a maximum area under the curve value of 0.956.
引用
收藏
页码:83 / 92
页数:10
相关论文
共 50 条
  • [1] Machine Learning and Synthetic Minority Oversampling Techniques for Imbalanced Data: Improving Machine Failure Prediction
    Wah, Yap Bee
    Ismail, Azlan
    Azid, Nur Niswah Naslina
    Jaafar, Jafreezal
    Aziz, Izzatdin Abdul
    Hasan, Mohd Hilmi
    Zain, Jasni Mohamad
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (03): : 4821 - 4841
  • [2] A hybrid machine learning approach for hypertension risk prediction
    Fang, Min
    Chen, Yingru
    Xue, Rui
    Wang, Huihui
    Chakraborty, Nilesh
    Su, Ting
    Dai, Yuyan
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (20): : 14487 - 14497
  • [3] A hybrid machine learning approach for hypertension risk prediction
    Min Fang
    Yingru Chen
    Rui Xue
    Huihui Wang
    Nilesh Chakraborty
    Ting Su
    Yuyan Dai
    [J]. Neural Computing and Applications, 2023, 35 : 14487 - 14497
  • [4] Machine Learning on Imbalanced Data in Credit Risk
    Birla, Shiivong
    Kohli, Kashish
    Dutta, Akash
    [J]. 7TH IEEE ANNUAL INFORMATION TECHNOLOGY, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE IEEE IEMCON-2016, 2016,
  • [5] Improving mortality prediction in Acute Pancreatitis by machine learning and data augmentation
    Bin Hameed, M. Asad
    Alamgir, Zareen
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 150
  • [6] A hybrid machine learning approach to cerebral stroke prediction based on imbalanced medical dataset
    Liu, Tianyu
    Fan, Wenhui
    Wu, Cheng
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2019, 101
  • [7] A hybrid machine learning approach for early mortality prediction of ICU patients
    Mansouri, Ardeshir
    Noei, Mohammadreza
    Abadeh, Mohammad Saniee
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE, 2022, 11 (04) : 333 - 347
  • [8] A hybrid machine learning approach for early mortality prediction of ICU patients
    Ardeshir Mansouri
    Mohammadreza Noei
    Mohammad Saniee Abadeh
    [J]. Progress in Artificial Intelligence, 2022, 11 : 333 - 347
  • [9] Neonatal mortality prediction with routinely collected data: a machine learning approach
    André F. M. Batista
    Carmen S. G. Diniz
    Eliana A. Bonilha
    Ichiro Kawachi
    Alexandre D. P. Chiavegatto Filho
    [J]. BMC Pediatrics, 21
  • [10] Neonatal mortality prediction with routinely collected data: a machine learning approach
    Batista, Andre F. M.
    Diniz, Carmen S. G.
    Bonilha, Eliana A.
    Kawachi, Ichiro
    Chiavegatto Filho, Alexandre D. P.
    [J]. BMC PEDIATRICS, 2021, 21 (01)