Assessing risk model calibration with missing covariates

被引:1
|
作者
Shin, Yei Eun [1 ]
Gail, Mitchell H. [1 ]
Pfeiffer, Ruth M. [1 ]
机构
[1] NCI, Biostat Branch, Div Canc Epidemiol & Genet, 9609 Med Ctr Dr, Rockville, MD 20850 USA
关键词
Case-cohort study; External validation; Missing; Model calibration; Nested case-control study; Pseudo-risk model; Survey calibration; Weight adjustment; NESTED CASE-CONTROL; CASE-COHORT; PREDICTION; DESIGNS;
D O I
10.1093/biostatistics/kxaa060
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
When validating a risk model in an independent cohort, some predictors may be missing for some subjects. Missingness can be unplanned or by design, as in case-cohort or nested case-control studies, in which some covariates are measured only in subsampled subjects. Weighting methods and imputation are used to handle missing data. We propose methods to increase the efficiency of weighting to assess calibration of a risk model (i.e. bias in model predictions), which is quantified by the ratio of the number of observed events, O, to expected events, E, computed from the model. We adjust known inverse probability weights by incorporating auxiliary information available for all cohort members. We use survey calibration that requires the weighted sum of the auxiliary statistics in the complete data subset to equal their sum in the full cohort. We show that a pseudo-risk estimate that approximates the actual risk value but uses only variables available for the entire cohort is an excellent auxiliary statistic to estimate E. We derive analytic variance formulas for O/E with adjusted weights. In simulations, weight adjustment with pseudo-risk was much more efficient than inverse probability weighting and yielded consistent estimates even when the pseudo-risk was a poor approximation. Multiple imputation was often efficient but yielded biased estimates when the imputation model was misspecified. Using these methods, we assessed calibration of an absolute risk model for second primary thyroid cancer in an independent cohort.
引用
收藏
页码:875 / 890
页数:16
相关论文
共 50 条
  • [1] Regression Analysis with Covariates Missing at Random: A Piece-wise Nonparametric Model for Missing Covariates
    Zhao, Yang
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2009, 38 (20) : 3736 - 3744
  • [2] Model averaging with covariates that are missing completely at random
    Zhang, Xinyu
    [J]. ECONOMICS LETTERS, 2013, 121 (03) : 360 - 363
  • [3] Model checking for a general linear model with nonignorable missing covariates
    Sun, Zhi-hua
    Ip, Wai-Cheung
    Wong, Heung
    [J]. ACTA MATHEMATICAE APPLICATAE SINICA-ENGLISH SERIES, 2012, 28 (01): : 99 - 110
  • [4] Model checking for a general linear model with nonignorable missing covariates
    Zhi-hua Sun
    Wai-Cheung Ip
    Heung Wong
    [J]. Acta Mathematicae Applicatae Sinica, English Series, 2012, 28 : 99 - 110
  • [5] Model Checking for a General Linear Model with Nonignorable Missing Covariates
    Zhihua SUN WaiCheung IP Heung WONG School of Mathematical sciences Graduate University of Chinese Academy of Sciences Beijing China Academy of Mathematics and Systems Science Chinese Academy of Sciences Beijing China Department of Applied Mathematics Hong Kong Polytechnic University Hung Hom Kowloon Hong Kong China
    [J]. Acta Mathematicae Applicatae Sinica(English Series)., 2012, 28 (01) - 110
  • [7] Reweighting estimators for the additive hazards model with missing covariates
    Hao, Meiling
    Song, Xinyuan
    Sun, Liuquan
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2014, 42 (02): : 285 - 307
  • [8] Estimation in a Markov chain regression model with missing covariates
    Dabrowska, DM
    Elashoff, RM
    Morton, DL
    [J]. PROBABILITY, STATISTICS AND MODELLING IN PUBLIC HEALTH, 2006, : 90 - +
  • [9] Diagnostic measures for the Cox regression model with missing covariates
    Zhu, Hongtu
    Ibrahim, Joseph G.
    Chen, Ming-Hui
    [J]. BIOMETRIKA, 2015, 102 (04) : 907 - 923
  • [10] On using the Cox proportional hazards model with missing covariates
    Paik, MC
    Tsai, WY
    [J]. BIOMETRIKA, 1997, 84 (03) : 579 - 593