Understanding Arteriosclerotic Heart Disease Patients Using Electronic Health Records: A Machine Learning and Shapley Additive exPlanations Approach

被引:3
|
作者
Miranda, Eka [1 ]
Adiarto, Suko [2 ]
Bhatti, Faqir M. [3 ]
Zakiyyah, Alfi Yusrotis [4 ]
Aryuni, Mediana [1 ]
Bernando, Charles [1 ]
机构
[1] Bina Nusantara Univ, Sch Informat Syst, Dept Informat Syst, Jakarta 11480, Indonesia
[2] Univ Indonesia, Fac Med, Natl Cardiovasc Ctr Harapan Kita, Dept Cardiol & Vasc Med, Jakarta, Indonesia
[3] Riphah Int Univ, Riphah Inst Comp & Appl Sci, Lahore, Pakistan
[4] Bina Nusantara Univ, Sch Comp Sci, Math Dept, Jakarta, Indonesia
关键词
Machine Learning; Coronary Artery Disease; Hematology; Supervised Machine Learning; PREDICTION;
D O I
10.4258/hir.2023.29.3.228
中图分类号
R-058 [];
学科分类号
摘要
Objectives: The number of deaths from cardiovascular disease is projected to reach 23.3 million by 2030. As a contribution to preventing this phenomenon, this paper proposed a machine learning (ML) model to predict patients with arteriosclerotic heart disease (AHD). We also interpreted the prediction model results based on the ML approach and deployed modelagnostic ML methods to identify informative features and their interpretations. Methods: We used a hematology Electronic Health Record (EHR) with information on erythrocytes, hematocrit, hemoglobin, mean corpuscular hemoglobin, mean corpuscular hemoglobin concentration, leukocytes, thrombocytes, age, and sex. To detect and predict AHD, we explored random forest (RF), XGBoost, and AdaBoost models. We examined the prediction model results based on the confusion matrix and accuracy measures. We used the Shapley Additive exPlanations (SHAP) framework to interpret the ML model and quantify the contribution of features to predictions. Results: Our study included data from 6,837 patients, with 4,702 records from patients diagnosed with AHD and 2,135 records from patients without an AHD diagnosis. AdaBoost outperformed RF and XGBoost, achieving an accuracy of 0.78, precision of 0.82, F1-score of 0.85, and recall of 0.88. According to the SHAP summary bar plot method, hemoglobin was the most important attribute for detecting and predicting AHD patients. The SHAP local interpretability bar plot revealed that hemoglobin and mean corpuscular hemoglobin concentration had positive impacts on AHD prediction based on a single observation. Conclusions: ML models based on real clinical data can be used to predict AHD.
引用
收藏
页码:228 / 238
页数:11
相关论文
共 50 条
  • [1] LEADING PREDICTORS OF INCIDENT HYPERTENSION AMONG PATIENTS WITH CANCER IN COMMUNITY HEALTH CENTERS: A MACHINE LEARNING APPROACH WITH SHAPLEY ADDITIVE EXPLANATIONS USING ELECTRONIC HEALTH RECORDS
    Park, C.
    Han, S.
    Sambamoorthi, U.
    [J]. VALUE IN HEALTH, 2024, 27 (06) : S17 - S17
  • [2] Bankruptcy prediction using machine learning and Shapley additive explanations
    Nguyen, Hoang Hiep
    Viviani, Jean-Laurent
    Ben Jabeur, Sami
    [J]. REVIEW OF QUANTITATIVE FINANCE AND ACCOUNTING, 2023,
  • [3] Interpretable Machine Learning in Damage Detection Using Shapley Additive Explanations
    Movsessian, Artur
    Cava, David Garcia
    Tcherniak, Dmitri
    [J]. ASCE-ASME JOURNAL OF RISK AND UNCERTAINTY IN ENGINEERING SYSTEMS PART B-MECHANICAL ENGINEERING, 2022, 8 (02):
  • [4] Estimation of Bone Mineral Density using Machine Learning and SHapley Additive exPlanations
    Bezerra, Gabriel M.
    Ohata, Elene F.
    Loureiro, Luiz L.
    Bittencourt, Victor Z.
    Capistrano Junior, Valden L. M.
    da Rochat, Atslands R.
    Reboucas Filho, Pedro P.
    [J]. 2024 IEEE 37TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS 2024, 2024, : 424 - 429
  • [5] Influence of metakaolin on pervious concrete strength: a machine learning approach with shapley additive explanations
    Sathiparan, Navaratnarajah
    Jeyananthan, Pratheeba
    Subramaniam, Daniel Niruban
    [J]. MULTISCALE AND MULTIDISCIPLINARY MODELING EXPERIMENTS AND DESIGN, 2024, 7 (04) : 3919 - 3946
  • [6] Electricity Consumption Forecasting: An Approach Using Cooperative Ensemble Learning with SHapley Additive exPlanations
    Alba, Eduardo Luiz
    Oliveira, Gilson Adamczuk
    Ribeiro, Matheus Henrique Dal Molin
    Rodrigues, erick Oliveira
    [J]. FORECASTING, 2024, 6 (03): : 839 - 863
  • [7] Landslide Modeling in a Tropical Mountain Basin Using Machine Learning Algorithms and Shapley Additive Explanations
    Vega, Johnny
    Sepulveda-Murillo, Fabio Humberto
    Parra, Melissa
    [J]. AIR SOIL AND WATER RESEARCH, 2023, 16
  • [8] Machine learning-based Shapley additive explanations approach for corroded pipeline failure mode identification
    Ben Seghier, Mohamed El Amine
    Mohamed, Osama Ahmed
    Ouaer, Hocine
    [J]. STRUCTURES, 2024, 65
  • [9] Comparison of Explainable Machine-Learning Models for Decision-Making in Health Intensive Care Using SHapley Additive exPlanations
    Vidal, Igor Pereira
    Pereira, Marluce Rodrigues
    Freire, Andre Pimenta
    Resende, Uanderson
    Maziero, Erick Galani
    [J]. PROCEEDINGS OF THE 19TH BRAZILIAN SYMPOSIUM ON INFORMATION SYSTEMS, 2023, : 300 - 307
  • [10] Predicting Cardiovascular Disease in Psychiatric Patients: Machine Learning with Electronic Health Records
    Bernstorff, M.
    Danielsen, A.
    Dinesen, S.
    [J]. EUROPEAN PSYCHIATRY, 2022, 65 : S678 - S678