EVALUATING RISK-PREDICTION MODELS USING DATA FROM ELECTRONIC HEALTH RECORDS

被引:18
|
作者
Wang, Le [1 ]
Shaw, Pamela A. [1 ]
Mathelier, Hansie M. [2 ]
Kimmel, Stephen E. [1 ]
French, Benjamin [1 ]
机构
[1] Univ Penn, Dept Biostat & Epidemiol, 423 Guardian Dr, Philadelphia, PA 19104 USA
[2] Univ Penn, Dept Med, 51 N 39th St, Philadelphia, PA 19104 USA
来源
ANNALS OF APPLIED STATISTICS | 2016年 / 10卷 / 01期
关键词
Outcome misclassification; prediction accuracy; risk reclassification; ROC curves; HEART-FAILURE; LOGISTIC-REGRESSION; HOSPITAL READMISSION; 30-DAY READMISSION; BINARY REGRESSION; ECONOMIC BURDEN; VALIDATION DATA; MEDICAL-RECORD; RELATIVE RISK; ROC CURVES;
D O I
10.1214/15-AOAS891
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The availability of data from electronic health records facilitates the development and evaluation of risk-prediction models, but estimation of prediction accuracy could be limited by outcome misclassification, which can arise if events are not captured. We evaluate the robustness of prediction accuracy summaries, obtained from receiver operating characteristic curves and risk-reclassification methods, if events are not captured (i.e., "false negatives"). We derive estimators for sensitivity and specificity if misclassification is independent of marker values. In simulation studies, we quantify the potential for bias in prediction accuracy summaries if misclassification depends on marker values. We compare the accuracy of alternative prognostic models for 30-day all-cause hospital readmission among 4548 patients discharged from the University of Pennsylvania Health System with a primary diagnosis of heart failure. Simulation studies indicate that if misclassification depends on marker values, then the estimated accuracy improvement is also biased, but the direction of the bias depends on the direction of the association between markers and the probability of misclassification. In our application, 29% of the 1143 readmitted patients were readmitted to a hospital elsewhere in Pennsylvania, which reduced prediction accuracy. Outcome misclassification can result in erroneous conclusions regarding the accuracy of risk-prediction models.
引用
收藏
页码:286 / 304
页数:19
相关论文
共 50 条
  • [1] Area Deprivation Index and Cardiac Readmissions: Evaluating Risk-prediction in an Electronic Health Record
    Johnson, Amber E.
    Zhu, Jianhui
    Garrard, William
    Thoma, Floyd W.
    Mulukutla, Suresh R.
    Kershaw, Kiarri N.
    Magnani, Jared W.
    [J]. CIRCULATION, 2020, 142
  • [2] Area Deprivation Index and Cardiac Readmissions: Evaluating Risk-Prediction in an Electronic Health Record
    Johnson, Amber E.
    Zhu, Jianhui
    Garrard, William
    Thoma, Floyd W.
    Mulukutla, Suresh
    Kershaw, Kiarri N.
    Magnani, Jared W.
    [J]. JOURNAL OF THE AMERICAN HEART ASSOCIATION, 2021, 10 (13):
  • [3] FLEXIBLE RISK PREDICTION MODELS FOR LEFT OR INTERVAL-CENSORED DATA FROM ELECTRONIC HEALTH RECORDS
    Hyun, Noorie
    Cheung, Li C.
    Pan, Qing
    Schiffman, Mark
    Katki, Hormuzd A.
    [J]. ANNALS OF APPLIED STATISTICS, 2017, 11 (02): : 1063 - 1084
  • [4] Deep and machine learning models to improve risk prediction of cardiovascular disease using data extraction from electronic health records
    Korsakov, I.
    Gusev, A.
    Kuznetsova, T.
    Gavrilov, D.
    Novitskiy, R.
    [J]. EUROPEAN HEART JOURNAL, 2019, 40 : 1213 - 1213
  • [5] Delirium Prediction using Machine Learning Models on Preoperative Electronic Health Records Data
    Davoudi, Anis
    Ebadi, Ashkan
    Rashidi, Parisa
    Ozrazgat-Baslanti, Tazcan
    Bihorac, Azra
    Bursian, Alberto C.
    [J]. 2017 IEEE 17TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2017, : 568 - 573
  • [6] Reinforced Risk Prediction With Budget Constraint Using Irregularly Measured Data From Electronic Health Records
    Pan, Yinghao
    Laber, Eric B.
    Smith, Maureen A.
    Zhao, Ying-Qi
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (542) : 1090 - 1101
  • [7] Opportunities and challenges in developing risk prediction models with electronic health records data: a systematic review
    Goldstein, Benjamin A.
    Navar, Ann Marie
    Pencina, Michael J.
    Ioannidis, John P. A.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2017, 24 (01) : 198 - 208
  • [8] Risk Prediction With Electronic Health Records
    Goldstein, Benjamin A.
    Navar, Ann Marie
    Pencina, Michael J.
    [J]. JAMA CARDIOLOGY, 2016, 1 (09) : 976 - 977
  • [9] Prediction models using artificial intelligence and longitudinal data from electronic health records: a systematic methodological review
    Carrasco-Ribelles, Lucia A.
    Llanes-Jurado, Jose
    Gallego-Moll, Carlos
    Cabrera-Bean, Margarita
    Monteagudo-Zaragoza, Monica
    Violan, Concepcion
    Zabaleta-del-Olmo, Edurne
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2023, 30 (12) : 2072 - 2082
  • [10] Breast Cancer Risk Prediction using Electronic Health Records
    Wu, Yirong
    Burnside, Elizabeth S.
    Cox, Jennifer
    Fan, Jun
    Yuan, Ming
    Yin, Jie
    Peissig, Peggy
    Cobian, Alexander
    Page, David
    Craven, Mark
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2017, : 224 - 228