Comparing Machine Learning to Regression Methods for Mortality Prediction Using Veterans Affairs Electronic Health Record Clinical Data

被引:7
|
作者
Jing, Bocheng [1 ,2 ,3 ]
Boscardin, W. John [1 ,3 ,4 ]
Deardorff, W. James [3 ]
Jeon, Sun Young [1 ,3 ]
Lee, Alexandra K. [1 ,3 ]
Donovan, Anne L. [5 ]
Lee, Sei J. [1 ,3 ]
机构
[1] San Francisco VA Hlth Care Syst, San Francisco, CA USA
[2] Northern Calif Inst Res & Educ, San Francisco, CA USA
[3] Univ Calif San Francisco, Div Geriatr, San Francisco, CA USA
[4] Univ Calif San Francisco, Dept Epidemiol & Biostat, San Francisco, CA USA
[5] Univ Calif San Francisco, Dept Anesthesia & Perioperat Med, San Francisco, CA USA
关键词
prediction; machine learning; electronic health record; Veterans Affairs; REGULARIZATION PATHS; LOGISTIC-REGRESSION; PERFORMANCE; MODELS; RISK; PREVENTION; DIAGNOSIS; SELECTION; SURVIVAL;
D O I
10.1097/MLR.0000000000001720
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: It is unclear whether machine learning methods yield more accurate electronic health record (EHR) prediction models compared with traditional regression methods. Objective: The objective of this study was to compare machine learning and traditional regression models for 10-year mortality prediction using EHR data. Design: This was a cohort study. Setting: Veterans Affairs (VA) EHR data. Participants: Veterans age above 50 with a primary care visit in 2005, divided into separate training and testing cohorts (n= 124,360 each). Measurements and Analytic Methods: The primary outcome was 10-year all-cause mortality. We considered 924 potential predictors across a wide range of EHR data elements including demographics (3), vital signs (9), medication classes (399), disease diagnoses (293), laboratory results (71), and health care utilization (149). We compared discrimination (c-statistics), calibration metrics, and diagnostic test characteristics (sensitivity, specificity, and positive and negative predictive values) of machine learning and regression models. Results: Our cohort mean age (SD) was 68.2 (10.5), 93.9% were male; 39.4% died within 10 years. Models yielded testing cohort c-statistics between 0.827 and 0.837. Utilizing all 924 predictors, the Gradient Boosting model yielded the highest c-statistic [0.837, 95% confidence interval (CI): 0.835-0.839]. The full (unselected) logistic regression model had the highest c-statistic of regression models (0.833, 95% CI: 0.830-0.835) but showed evidence of overfitting. The discrimination of the stepwise selection logistic model (101 predictors) was similar (0.832, 95% CI: 0.830-0.834) with minimal overfitting. All models were well-calibrated and had similar diagnostic test characteristics. Limitation: Our results should be confirmed in non-VA EHRs. Conclusion: The differences in c-statistic between the best machine learning model (924-predictor Gradient Boosting) and 101-predictor stepwise logistic models for 10-year mortality prediction were modest, suggesting stepwise regression methods continue to be a reasonable method for VA EHR mortality prediction model development.
引用
收藏
页码:470 / 479
页数:10
相关论文
共 50 条
  • [1] Development of a Hypoglycemia Prediction Model for Veterans With Diabetes Using Supervised Machine Learning Applied to Electronic Health Record Data
    Raghavan, Sridharan
    Liu, Wenhui
    Baron, Anna
    Saxon, David
    Plomondon, Meg
    Ho, Michael
    Caplan, Liron
    [J]. CIRCULATION, 2020, 141
  • [2] Prediction of Acute Kidney Injury in the Emergency Department Using Electronic Health Record Data and Machine Learning Methods
    Hinson, J. S.
    Martinez, D. A.
    Grams, M. S.
    Levin, S.
    [J]. ANNALS OF EMERGENCY MEDICINE, 2018, 72 (04) : S154 - S154
  • [3] Preoperative Prediction of Postoperative Infections Using Machine Learning and Electronic Health Record Data
    Zhuang, Yaxu
    Dyas, Adam
    Meguid, Robert A.
    Henderson, William G.
    Bronsert, Michael
    Madsen, Helen
    Colborn, Kathryn L.
    [J]. ANNALS OF SURGERY, 2024, 279 (04) : 720 - 726
  • [4] Postoperative delirium prediction using machine learning models and preoperative electronic health record data
    Andrew Bishara
    Catherine Chiu
    Elizabeth L. Whitlock
    Vanja C. Douglas
    Sei Lee
    Atul J. Butte
    Jacqueline M. Leung
    Anne L. Donovan
    [J]. BMC Anesthesiology, 22
  • [5] Prediction of Atherosclerotic Cardiovascular Disease Risk Using Machine Learning and Electronic Health Record Data
    Ward, Andrew
    Sarraju, Ashish
    Chung, Sukyung
    Palaniappan, Latha
    Scheinker, David
    Rodriguez, Fatima
    [J]. CIRCULATION, 2019, 140
  • [6] Postoperative delirium prediction using machine learning models and preoperative electronic health record data
    Bishara, Andrew
    Chiu, Catherine
    Whitlock, Elizabeth L.
    Douglas, Vanja C.
    Lee, Sei
    Butte, Atul J.
    Leung, Jacqueline M.
    Donovan, Anne L.
    [J]. BMC ANESTHESIOLOGY, 2022, 22 (01)
  • [7] Prediction of early childhood obesity with machine learning and electronic health record data
    Pang, Xueqin
    Forrest, Christopher B.
    Le-Scherban, Felice
    Masino, Aaron J.
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2021, 150
  • [8] Prediction of incident myocardial infarction using machine learning applied to harmonized electronic health record data
    Divneet Mandair
    Premanand Tiwari
    Steven Simon
    Kathryn L. Colborn
    Michael A. Rosenberg
    [J]. BMC Medical Informatics and Decision Making, 20
  • [9] Using Machine Learning Approaches for Emergency Room Visit Prediction Based on Electronic Health Record Data
    Qiao, Zhi
    Sun, Ning
    Li, Xiang
    Xia, Eryu
    Zhao, Shiwan
    Qin, Yong
    [J]. BUILDING CONTINENTS OF KNOWLEDGE IN OCEANS OF DATA: THE FUTURE OF CO-CREATED EHEALTH, 2018, 247 : 111 - 115
  • [10] Prediction of incident myocardial infarction using machine learning applied to harmonized electronic health record data
    Mandair, Divneet
    Tiwari, Premanand
    Simon, Steven
    Colborn, Kathryn L.
    Rosenberg, Michael A.
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2020, 20 (01)