Evaluation of predictive model performance of an existing model in the presence of missing data

被引:2
|
作者
Li, Pin [1 ]
Taylor, Jeremy M. G. [1 ,2 ]
Spratt, Daniel E. [2 ]
Karnes, R. Jeffery [3 ]
Schipper, Matthew J. [1 ,2 ]
机构
[1] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Dept Radiat Oncol, Ann Arbor, MI 48109 USA
[3] Mayo Clin, Dept Urol, Rochester, MN USA
基金
美国国家卫生研究院;
关键词
area under the ROC curve; augmented inverse probability weighting; Brier score; inverse probability weighting; multiple imputation;
D O I
10.1002/sim.8978
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In medical research, the Brier score (BS) and the area under the receiver operating characteristic (ROC) curves (AUC) are two common metrics used to evaluate prediction models of a binary outcome, such as using biomarkers to predict the risk of developing a disease in the future. The assessment of an existing prediction models using data with missing covariate values is challenging. In this article, we propose inverse probability weighted (IPW) and augmented inverse probability weighted (AIPW) estimates of AUC and BS to handle the missing data. An alternative approach uses multiple imputation (MI), which requires a model for the distribution of the missing variable. We evaluated the performance of IPW and AIPW in comparison with MI in simulation studies under missing completely at random, missing at random, and missing not at random scenarios. When there are missing observations in the data, MI and IPW can be used to obtain unbiased estimates of BS and AUC if the imputation model for the missing variable or the model for the missingness is correctly specified. MI is more efficient than IPW. Our simulation results suggest that AIPW can be more efficient than IPW, and also achieves double robustness from miss-specification of either the missingness model or the imputation model. The outcome variable should be included in the model for the missing variable under all scenarios, while it only needs to be included in missingness model if the missingness depends on the outcome. We illustrate these methods using an example from prostate cancer.
引用
收藏
页码:3477 / 3498
页数:22
相关论文
共 50 条
  • [41] A data-driven approach for model predictive control performance monitoring
    Zhang, Guang-Ming
    Li, Ning
    Li, Shao-Yuan
    Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2011, 45 (08): : 1113 - 1118
  • [42] Experimental Evaluation of Model Predictive Control using Data Driven Models
    Paranjape, Pournima Vikas
    Patel, Nitinkumar, V
    2017 IEEE INTERNATIONAL CONFERENCE ON POWER, CONTROL, SIGNALS AND INSTRUMENTATION ENGINEERING (ICPCSI), 2017, : 1187 - 1191
  • [43] Evaluation of hyperspectral data for deep learning model performance
    Butler, Samantha J.
    Price, Stanton R.
    Carley, Samantha S.
    Land, Haley B.
    Price, Steven R.
    ALGORITHMS, TECHNOLOGIES, AND APPLICATIONS FOR MULTISPECTRAL AND HYPERSPECTRAL IMAGING XXX, 2024, 13031
  • [44] On the Performance Evaluation Model Based on Data Envelopment Analysis
    Jin, Ying
    2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 250 - 253
  • [45] An evaluation of the predictive performance and mapping power of the BayesR model for genomic prediction
    Mollandin, Fanny
    Rau, Andrea
    Croiseau, Pascal
    G3-GENES GENOMES GENETICS, 2021, 11 (11):
  • [46] A Class of Model Predictive Safety Performance Metrics for Driving Behavior Evaluation
    Weng, Bowen
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 180 - 187
  • [47] Performance evaluation of a MPPT controller with model predictive control for a photovoltaic system
    Pradhan, Roshan
    Panda, Aurobinda
    INTERNATIONAL JOURNAL OF ELECTRONICS, 2020, 107 (10) : 1543 - 1558
  • [48] An interoceptive predictive coding model of conscious presence
    Sethi, Anil K.
    Suzuki, Keisuke
    Critchley, Hugo D.
    FRONTIERS IN PSYCHOLOGY, 2012, 3
  • [49] A Data-Driven Model Predictive Control for Alleviating Thermal Overloads in the Presence of Possible False Data
    Ma, Rui
    Basumallik, Sagnik
    Eftekharnejad, Sara
    Kong, Fanxin
    IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2021, 57 (02) : 1872 - 1881
  • [50] Envelope index evaluation model of existing buildings
    Rodrigues, M. Fernanda S.
    Cardoso Teixeira, J. M.
    Cardoso, J. Claudino P.
    Batel Anjos, A. J.
    CIVIL ENGINEERING AND ENVIRONMENTAL SYSTEMS, 2013, 30 (01) : 26 - 39