Evaluation of predictive model performance of an existing model in the presence of missing data

被引:2
|
作者
Li, Pin [1 ]
Taylor, Jeremy M. G. [1 ,2 ]
Spratt, Daniel E. [2 ]
Karnes, R. Jeffery [3 ]
Schipper, Matthew J. [1 ,2 ]
机构
[1] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Dept Radiat Oncol, Ann Arbor, MI 48109 USA
[3] Mayo Clin, Dept Urol, Rochester, MN USA
基金
美国国家卫生研究院;
关键词
area under the ROC curve; augmented inverse probability weighting; Brier score; inverse probability weighting; multiple imputation;
D O I
10.1002/sim.8978
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In medical research, the Brier score (BS) and the area under the receiver operating characteristic (ROC) curves (AUC) are two common metrics used to evaluate prediction models of a binary outcome, such as using biomarkers to predict the risk of developing a disease in the future. The assessment of an existing prediction models using data with missing covariate values is challenging. In this article, we propose inverse probability weighted (IPW) and augmented inverse probability weighted (AIPW) estimates of AUC and BS to handle the missing data. An alternative approach uses multiple imputation (MI), which requires a model for the distribution of the missing variable. We evaluated the performance of IPW and AIPW in comparison with MI in simulation studies under missing completely at random, missing at random, and missing not at random scenarios. When there are missing observations in the data, MI and IPW can be used to obtain unbiased estimates of BS and AUC if the imputation model for the missing variable or the model for the missingness is correctly specified. MI is more efficient than IPW. Our simulation results suggest that AIPW can be more efficient than IPW, and also achieves double robustness from miss-specification of either the missingness model or the imputation model. The outcome variable should be included in the model for the missing variable under all scenarios, while it only needs to be included in missingness model if the missingness depends on the outcome. We illustrate these methods using an example from prostate cancer.
引用
收藏
页码:3477 / 3498
页数:22
相关论文
共 50 条
  • [21] Extensions to the Visual Predictive Check to facilitate model performance evaluation
    Teun M. Post
    Jan I. Freijer
    Bart A. Ploeger
    Meindert Danhof
    Journal of Pharmacokinetics and Pharmacodynamics, 2008, 35 : 185 - 202
  • [22] Extensions to the Visual Predictive Check to facilitate model performance evaluation
    Post, Teun M.
    Freijer, Jan I.
    Ploeger, Bart A.
    Danhof, Meindert
    JOURNAL OF PHARMACOKINETICS AND PHARMACODYNAMICS, 2008, 35 (02) : 185 - 202
  • [23] Performance evaluation and relative predictive model of parallel file system
    Zhao T.-Z.
    Dong S.-B.
    Verdi M.
    See S.
    Ruan Jian Xue Bao/Journal of Software, 2011, 22 (09): : 2206 - 2221
  • [24] Comparative Performance Analysis of Coordinated Model Predictive Control Schemes in the Presence of Model-Plant Mismatch
    Anand, Abhay
    Samavedham, Lakshminarayanan
    Sundaramoorthy, Sitanandam
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2012, 51 (24) : 8273 - 8285
  • [25] Making correct statistical inferences in the presence of missing data using a wrong probability model
    Golden, R
    Henley, S
    White, H
    Kashner, TM
    Katz, R
    JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2005, 49 (01) : 102 - 102
  • [26] Modeling Change in the Presence of Nonrandomly Missing Data: Evaluating a Shared Parameter Mixture Model
    Gottfredson, Nisha C.
    Bauer, Daniel J.
    Baldwin, Scott A.
    STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2014, 21 (02) : 196 - 209
  • [27] A residuals-based transition model for longitudinal analysis with estimation in the presence of missing data
    Koru-Sengul, Tulay
    Stoffer, David S.
    Day, Nancy L.
    STATISTICS IN MEDICINE, 2007, 26 (17) : 3330 - 3341
  • [28] Least square estimator of the parameter of AR-ARCH model in the presence of missing data
    Hamaz, Abdelghani
    Altendji, Belkais
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS & STATISTICS, 2021, 60 (01): : 15 - 23
  • [29] Evaluation of orthogonal composite designs for second-order model in presence of missing observation
    Ezievuo, Chibuzo Solomon
    Oladugba, Abimibola Victoria
    Babatunde, Oluwagbenga Tobi
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2025, 41 (01) : 204 - 234
  • [30] Model Evaluation in the Presence of Categorical Data: Bayesian Model Checking as an Alternative to Traditional Methods
    Bonifay, Wes
    Depaoli, Sarah
    PREVENTION SCIENCE, 2023, 24 (03) : 467 - 479