EVALUATING RISK-PREDICTION MODELS USING DATA FROM ELECTRONIC HEALTH RECORDS

被引:18
|
作者
Wang, Le [1 ]
Shaw, Pamela A. [1 ]
Mathelier, Hansie M. [2 ]
Kimmel, Stephen E. [1 ]
French, Benjamin [1 ]
机构
[1] Univ Penn, Dept Biostat & Epidemiol, 423 Guardian Dr, Philadelphia, PA 19104 USA
[2] Univ Penn, Dept Med, 51 N 39th St, Philadelphia, PA 19104 USA
来源
ANNALS OF APPLIED STATISTICS | 2016年 / 10卷 / 01期
关键词
Outcome misclassification; prediction accuracy; risk reclassification; ROC curves; HEART-FAILURE; LOGISTIC-REGRESSION; HOSPITAL READMISSION; 30-DAY READMISSION; BINARY REGRESSION; ECONOMIC BURDEN; VALIDATION DATA; MEDICAL-RECORD; RELATIVE RISK; ROC CURVES;
D O I
10.1214/15-AOAS891
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The availability of data from electronic health records facilitates the development and evaluation of risk-prediction models, but estimation of prediction accuracy could be limited by outcome misclassification, which can arise if events are not captured. We evaluate the robustness of prediction accuracy summaries, obtained from receiver operating characteristic curves and risk-reclassification methods, if events are not captured (i.e., "false negatives"). We derive estimators for sensitivity and specificity if misclassification is independent of marker values. In simulation studies, we quantify the potential for bias in prediction accuracy summaries if misclassification depends on marker values. We compare the accuracy of alternative prognostic models for 30-day all-cause hospital readmission among 4548 patients discharged from the University of Pennsylvania Health System with a primary diagnosis of heart failure. Simulation studies indicate that if misclassification depends on marker values, then the estimated accuracy improvement is also biased, but the direction of the bias depends on the direction of the association between markers and the probability of misclassification. In our application, 29% of the 1143 readmitted patients were readmitted to a hospital elsewhere in Pennsylvania, which reduced prediction accuracy. Outcome misclassification can result in erroneous conclusions regarding the accuracy of risk-prediction models.
引用
收藏
页码:286 / 304
页数:19
相关论文
共 50 条
  • [41] RISK MANAGEMENT IN INTENSIVE CARE UNITS WITH ARTIFICIAL INTELLIGENCE TECHNOLOGIES: SYSTEMATIC REVIEW OF PREDICTION MODELS USING ELECTRONIC HEALTH RECORDS
    Cayirtepe, Zuhal
    Senel, Ahmet Can
    [J]. JOURNAL OF BASIC AND CLINICAL HEALTH SCIENCES, 2022, 6 (03): : 958 - 976
  • [42] Extracting antipsychotic polypharmacy data from electronic health records: developing and evaluating a novel process
    Giouliana Kadra
    Robert Stewart
    Hitesh Shetty
    Richard G. Jackson
    Mark A. Greenwood
    Angus Roberts
    Chin-Kuo Chang
    James H. MacCabe
    Richard D. Hayes
    [J]. BMC Psychiatry, 15
  • [43] Evaluating the comparability of patient-level social risk data extracted from electronic health records: A systematic scoping review
    Linfield, Gaia H.
    Patel, Shyam
    Ko, Hee Joo
    Lacar, Benjamin
    Gottlieb, Laura M.
    Adler-Milstein, Julia
    Singh, Nina, V
    Pantell, Matthew S.
    De Marchis, Emilia H.
    [J]. HEALTH INFORMATICS JOURNAL, 2023, 29 (03)
  • [44] Extracting antipsychotic polypharmacy data from electronic health records: developing and evaluating a novel process
    Kadra, Giouliana
    Stewart, Robert
    Shetty, Hitesh
    Jackson, Richard G.
    Greenwood, Mark A.
    Roberts, Angus
    Chang, Chin-Kuo
    MacCabe, James H.
    Hayes, Richard D.
    [J]. BMC PSYCHIATRY, 2015, 15
  • [45] A Stakeholder-Informed Ethical Framework to Guide Implementation of Suicide Risk Prediction Models Derived from Electronic Health Records
    Yarborough, Bobbi Jo H.
    Stumbo, Scott P.
    [J]. ARCHIVES OF SUICIDE RESEARCH, 2023, 27 (02) : 704 - 717
  • [46] Using electronic health records data for clinical research
    Wu, Li-Tzy T.
    Spratt, Susan
    Heidenfelder, Brooke
    Tai, Betty
    Ghitza, Udi
    [J]. DRUG AND ALCOHOL DEPENDENCE, 2017, 171 : E219 - E219
  • [47] Using Natural Language Processing on Electronic Health Records to Enhance Detection and Prediction of Psychosis Risk
    Irving, Jessica
    Patel, Rashmi
    Oliver, Dominic
    Colling, Craig
    Pritchard, Megan
    Broadbent, Matthew
    Baldwin, Helen
    Stahl, Daniel
    Stewart, Robert
    Fusar-Poli, Paolo
    [J]. SCHIZOPHRENIA BULLETIN, 2021, 47 (02) : 405 - 414
  • [48] A comparison of risk prediction methods using repeated observations: an application to electronic health records for hemodialysis
    Goldstein, Benjamin A.
    Pomann, Gina Maria
    Winkelmayer, Wolfgang C.
    Pencina, Michael J.
    [J]. STATISTICS IN MEDICINE, 2017, 36 (17) : 2750 - 2763
  • [49] Label Dependent Attention Model for Disease Risk Prediction Using Multimodal Electronic Health Records
    Niu, Shuai
    Yin, Qing
    Song, Yunya
    Guo, Yike
    Yang, Xian
    [J]. 2021 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2021), 2021, : 449 - 458
  • [50] Prediction and diagnosis of depression using machine learning with electronic health records data: a systematic review
    Nickson, David
    Meyer, Caroline
    Walasek, Lukasz
    Toro, Carla
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2023, 23 (01)