Comparing Machine Learning to Regression Methods for Mortality Prediction Using Veterans Affairs Electronic Health Record Clinical Data

被引:7
|
作者
Jing, Bocheng [1 ,2 ,3 ]
Boscardin, W. John [1 ,3 ,4 ]
Deardorff, W. James [3 ]
Jeon, Sun Young [1 ,3 ]
Lee, Alexandra K. [1 ,3 ]
Donovan, Anne L. [5 ]
Lee, Sei J. [1 ,3 ]
机构
[1] San Francisco VA Hlth Care Syst, San Francisco, CA USA
[2] Northern Calif Inst Res & Educ, San Francisco, CA USA
[3] Univ Calif San Francisco, Div Geriatr, San Francisco, CA USA
[4] Univ Calif San Francisco, Dept Epidemiol & Biostat, San Francisco, CA USA
[5] Univ Calif San Francisco, Dept Anesthesia & Perioperat Med, San Francisco, CA USA
关键词
prediction; machine learning; electronic health record; Veterans Affairs; REGULARIZATION PATHS; LOGISTIC-REGRESSION; PERFORMANCE; MODELS; RISK; PREVENTION; DIAGNOSIS; SELECTION; SURVIVAL;
D O I
10.1097/MLR.0000000000001720
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: It is unclear whether machine learning methods yield more accurate electronic health record (EHR) prediction models compared with traditional regression methods. Objective: The objective of this study was to compare machine learning and traditional regression models for 10-year mortality prediction using EHR data. Design: This was a cohort study. Setting: Veterans Affairs (VA) EHR data. Participants: Veterans age above 50 with a primary care visit in 2005, divided into separate training and testing cohorts (n= 124,360 each). Measurements and Analytic Methods: The primary outcome was 10-year all-cause mortality. We considered 924 potential predictors across a wide range of EHR data elements including demographics (3), vital signs (9), medication classes (399), disease diagnoses (293), laboratory results (71), and health care utilization (149). We compared discrimination (c-statistics), calibration metrics, and diagnostic test characteristics (sensitivity, specificity, and positive and negative predictive values) of machine learning and regression models. Results: Our cohort mean age (SD) was 68.2 (10.5), 93.9% were male; 39.4% died within 10 years. Models yielded testing cohort c-statistics between 0.827 and 0.837. Utilizing all 924 predictors, the Gradient Boosting model yielded the highest c-statistic [0.837, 95% confidence interval (CI): 0.835-0.839]. The full (unselected) logistic regression model had the highest c-statistic of regression models (0.833, 95% CI: 0.830-0.835) but showed evidence of overfitting. The discrimination of the stepwise selection logistic model (101 predictors) was similar (0.832, 95% CI: 0.830-0.834) with minimal overfitting. All models were well-calibrated and had similar diagnostic test characteristics. Limitation: Our results should be confirmed in non-VA EHRs. Conclusion: The differences in c-statistic between the best machine learning model (924-predictor Gradient Boosting) and 101-predictor stepwise logistic models for 10-year mortality prediction were modest, suggesting stepwise regression methods continue to be a reasonable method for VA EHR mortality prediction model development.
引用
收藏
页码:470 / 479
页数:10
相关论文
共 50 条
  • [41] Using Natural Language Processing and Machine Learning to Identify Opioids in Electronic Health Record Data
    McDermott, Sean P.
    Wasan, Ajay D.
    [J]. JOURNAL OF PAIN RESEARCH, 2023, 16 : 2133 - 2140
  • [42] Predicting Intensive Care Unit Readmission with Machine Learning Using Electronic Health Record Data
    Rojas, J. C.
    Carey, K. A.
    Edelson, D. P.
    Venable, L. R.
    Howell, M. D.
    Churpek, M. M.
    [J]. AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2018, 197
  • [43] Predicting Intensive Care Unit Readmission with Machine Learning Using Electronic Health Record Data
    Rojas, Juan C.
    Carey, Kyle A.
    Edelson, Dana P.
    Venable, Laura R.
    Howell, Michael D.
    Churpek, Matthew M.
    [J]. ANNALS OF THE AMERICAN THORACIC SOCIETY, 2018, 15 (07) : 846 - 853
  • [44] Machine Learning-Based Short-Term Mortality Prediction Models for Patients With Cancer Using Electronic Health Record Data: Systematic Review and Critical Appraisal
    Lu, Sheng-Chieh
    Xu, Cai
    Nguyen, Chandler H.
    Geng, Yimin
    Pfob, Andre
    Sidey-Gibbons, Chris
    [J]. JMIR MEDICAL INFORMATICS, 2022, 10 (03)
  • [45] Delirium Prediction using Machine Learning Models on Preoperative Electronic Health Records Data
    Davoudi, Anis
    Ebadi, Ashkan
    Rashidi, Parisa
    Ozrazgat-Baslanti, Tazcan
    Bihorac, Azra
    Bursian, Alberto C.
    [J]. 2017 IEEE 17TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2017, : 568 - 573
  • [46] Machine Learning Prediction of Kidney Stone Composition Using Electronic Health Record-Derived Features
    Abraham, Abin
    Kavoussi, Nicholas L.
    Sui, Wilson
    Bejan, Cosmin
    Capra, John A.
    Hsi, Ryan
    [J]. JOURNAL OF ENDOUROLOGY, 2022, 36 (02) : 243 - 250
  • [47] Learning About Machine Learning: The Promise and Pitfalls of Big Data and the Electronic Health Record
    Deo, Rahul C.
    Nallamothu, Brahmajee K.
    [J]. CIRCULATION-CARDIOVASCULAR QUALITY AND OUTCOMES, 2016, 9 (06): : 618 - 620
  • [48] Real-time prediction of mortality, readmission, and length of stay using electronic health record data
    Cai, Xiongcai
    Perez-Concha, Oscar
    Coiera, Enrico
    Martin-Sanchez, Fernando
    Day, Richard
    Roffe, David
    Gallego, Blanca
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2016, 23 (03) : 553 - 561
  • [49] Assessment of a Machine Learning Model Applied to Harmonized Electronic Health Record Data for the Prediction of Incident Atrial Fibrillation
    Tiwari, Premanand
    Colborn, Kathryn L.
    Smith, Derek E.
    Xing, Fuyong
    Ghosh, Debashis
    Rosenberg, Michael A.
    [J]. JAMA NETWORK OPEN, 2020, 3 (01)
  • [50] Electronic Medical Record based Machine Learning Methods for Adverse Pregnancy Outcome Prediction
    Hang, Yuwei
    Zhang, Yan
    Lv, Yan
    Yu, Wenbin
    Lin, Yi
    [J]. TWELFTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING SYSTEMS, 2021, 11719