Improved Cardiovascular Risk Prediction Using Nonparametric Regression and Electronic Health Record Data

被引:38
|
作者
Kennedy, Edward H. [1 ]
Wiitala, Wyndy L. [1 ]
Hayward, Rodney A. [1 ,2 ]
Sussman, Jeremy B. [1 ,2 ]
机构
[1] Ann Arbor VA Hlth Serv Res & Dev HSR&D Ctr Excell, VA Ctr Clin Management Res, Ann Arbor, MI USA
[2] Univ Michigan, Robert Wood Johnson Fdn Clin Scholars Program, Dept Internal Med, Ann Arbor, MI 48109 USA
关键词
cardiovascular disease; electronic health record; Framingham risk score; machine learning; nonparametric regression; risk prediction; HEART-DISEASE; INFORMATION-TECHNOLOGY; PREVENTION; STROKE; VALIDATION; ALGORITHMS; COMMITTEE; VETERANS; QUALITY; UPDATE;
D O I
10.1097/MLR.0b013e31827da594
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Use of the electronic health record (EHR) is expected to increase rapidly in the near future, yet little research exists on whether analyzing internal EHR data using flexible, adaptive statistical methods could improve clinical risk prediction. Extensive implementation of EHR in the Veterans Health Administration provides an opportunity for exploration. Objectives: To compare the performance of various approaches for predicting risk of cerebrovascular and cardiovascular (CCV) death, using traditional risk predictors versus more comprehensive EHR data. Research Design: Retrospective cohort study. We identified all Veterans Health Administration patients without recent CCV events treated at 12 facilities from 2003 to 2007, and predicted risk using the Framingham risk score, logistic regression, generalized additive modeling, and gradient tree boosting. Measures: The outcome was CCV-related death within 5 years. We assessed each method's predictive performance with the area under the receiver operating characteristic curve (AUC), the Hosmer-Lemeshow goodness-of-fit test, plots of estimated risk, and reclassification tables, using cross-validation to penalize overfitting. Results: Regression methods outperformed the Framingham risk score, even with the same predictors (AUC increased from 71% to 73% and calibration also improved). Even better performance was attained in models using additional EHR-derived predictor variables (AUC increased to 78% and net reclassification improvement was as large as 0.29). Nonparametric regression further improved calibration and discrimination compared with logistic regression. Conclusions: Despite the EHR lacking some risk factors and its imperfect data quality, health care systems may be able to substantially improve risk prediction for their patients by using internally developed EHR-derived models and flexible statistical methodology.
引用
收藏
页码:251 / 258
页数:8
相关论文
共 50 条
  • [21] Preoperative Prediction of Postoperative Infections Using Machine Learning and Electronic Health Record Data
    Zhuang, Yaxu
    Dyas, Adam
    Meguid, Robert A.
    Henderson, William G.
    Bronsert, Michael
    Madsen, Helen
    Colborn, Kathryn L.
    [J]. ANNALS OF SURGERY, 2024, 279 (04) : 720 - 726
  • [22] Development and validation of a dynamic inpatient risk prediction model for clinically significant hypokalemia using electronic health record data
    Li, Yan
    Staley, Benjamin
    Henriksen, Carl
    Xu, Dandan
    Lipori, Gloria
    Winterstein, Almut G.
    [J]. AMERICAN JOURNAL OF HEALTH-SYSTEM PHARMACY, 2019, 76 (05) : 301 - 311
  • [23] Risk prediction for chronic kidney disease progression using heterogeneous electronic health record data and time series analysis
    Perotte, Adler
    Ranganath, Rajesh
    Hirsch, Jamie S.
    Blei, David
    Elhadad, Noemie
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2015, 22 (04) : 872 - 880
  • [24] Evaluation of Electronic Health Record-Based Suicide Risk Prediction Models on Contemporary Data
    Walker, Rod L.
    Shortreed, Susan M.
    Ziebell, Rebecca A.
    Johnson, Eric
    Boggs, Jennifer M.
    Lynch, Frances L.
    Daida, Yihe G.
    Ahmedani, Brian K.
    Rossom, Rebecca
    Coleman, Karen J.
    Simon, Gregory E.
    [J]. APPLIED CLINICAL INFORMATICS, 2021, 12 (04): : 778 - 787
  • [25] The Impact of Longitudinal Data-Completeness of Electronic Health Record Data on the Prediction Performance of Clinical Risk Scores
    Jin, Yinzhu
    Weberpals, Janick G.
    Wang, Shirley V.
    Desai, Rishi J.
    Merola, David
    Lin, Kueiyu Joshua
    [J]. CLINICAL PHARMACOLOGY & THERAPEUTICS, 2023, 113 (06) : 1359 - 1367
  • [26] Diabetic Retinopathy Risk Estimation Using Fuzzy Rules on Electronic Health Record Data
    Saleh, Emran
    Valls, Aida
    Moreno, Antonio
    Romero-Aroca, Pedro
    de la Riva-Fernandez, Sofia
    Sagarra-Alamo, Ramon
    [J]. MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, (MDAI 2016), 2016, 9880 : 263 - 274
  • [27] Identifying Racial/Ethnic Differences In Patterns Of Early Childhood Cardiovascular Health Using Electronic Health Record Data
    Pool, Lindsay
    Yang, Xiaoyun
    Krefman, Amy
    Perak, Amanda M.
    Davis, Matthew
    Rosenman, Marc
    Hou, Lifang
    Marino, Bradley S.
    Van Horn, Linda
    Labarthe, Darwin R.
    Lloyd-Jones, Donald M.
    Allen, Norrina B.
    [J]. CIRCULATION, 2021, 143
  • [28] PREDICTING BACTEREMIA USING ELECTRONIC HEALTH RECORD DATA
    Lonjers, Zachary
    Bhavani, Sivasubramanium
    Carey, Kyle
    Gilbert, Emily
    Afshar, Majid
    Churpek, Matthew
    [J]. CHEST, 2019, 156 (04) : 1607A - 1607A
  • [29] Machine Learning Models for Pancreatic Cancer Risk Prediction Using Electronic Health Record Data-A Systematic Review and Assessment
    Mishra, Anup Kumar
    Chong, Bradford
    Arunachalam, Shivaram P.
    Oberg, Ann L.
    Majumder, Shounak
    [J]. AMERICAN JOURNAL OF GASTROENTEROLOGY, 2024, 119 (08): : 1466 - 1482
  • [30] Postoperative delirium prediction using machine learning models and preoperative electronic health record data
    Andrew Bishara
    Catherine Chiu
    Elizabeth L. Whitlock
    Vanja C. Douglas
    Sei Lee
    Atul J. Butte
    Jacqueline M. Leung
    Anne L. Donovan
    [J]. BMC Anesthesiology, 22