Evaluating model-based imputation methods for missing covariates in regression models with interactions

被引:28
|
作者
Kim, Soeun [1 ]
Sugar, Catherine A. [2 ,3 ]
Belin, Thomas R. [2 ]
机构
[1] Univ Texas Hlth Sci Ctr Houston, Sch Publ Hlth, Dept Biostat, Houston, TX 77030 USA
[2] Univ Calif Los Angeles, Dept Biostat, Los Angeles, CA 90095 USA
[3] Univ Calif Los Angeles, Dept Psychiat, Los Angeles, CA 90095 USA
基金
美国国家卫生研究院;
关键词
interaction; missing covariate; multiple imputation; multivariate normal; regression; MULTIPLE IMPUTATION; VALUES;
D O I
10.1002/sim.6435
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Imputation strategies are widely used in settings that involve inference with incomplete data. However, implementation of a particular approach always rests on assumptions, and subtle distinctions between methods can have an impact on subsequent analyses. In this research article, we are concerned with regression models in which the true underlying relationship includes interaction terms. We focus in particular on a linear model with one fully observed continuous predictor, a second partially observed continuous predictor, and their interaction. We derive the conditional distribution of the missing covariate and interaction term given the observed covariate and the outcome variable, and examine the performance of a multiple imputation procedure based on this distribution. We also investigate several alternative procedures that can be implemented by adapting multivariate normal multiple imputation software in ways that might be expected to perform well despite incompatibilities between model assumptions and true underlying relationships among the variables. The methods are compared in terms of bias, coverage, and CI width. As expected, the procedure based on the correct conditional distribution performs well across all scenarios. Just as importantly for general practitioners, several of the approaches based on multivariate normality perform comparably with the correct conditional distribution in a number of circumstances, although interestingly, procedures that seek to preserve the multiplicative relationship between the interaction term and the main-effects are found to be substantially less reliable. For illustration, the various procedures are applied to an analysis of post-traumatic stress disorder symptoms in a study of childhood trauma. Copyright (c) 2015 John Wiley & Sons, Ltd.
引用
收藏
页码:1876 / 1888
页数:13
相关论文
共 50 条
  • [41] Model-based regression adjustment with model-free covariates for network interference
    Han, Kevin
    Ugander, Johan
    [J]. JOURNAL OF CAUSAL INFERENCE, 2023, 11 (01)
  • [42] Diagnostic measures for the Cox regression model with missing covariates
    Zhu, Hongtu
    Ibrahim, Joseph G.
    Chen, Ming-Hui
    [J]. BIOMETRIKA, 2015, 102 (04) : 907 - 923
  • [43] Estimation in a Markov chain regression model with missing covariates
    Dabrowska, DM
    Elashoff, RM
    Morton, DL
    [J]. PROBABILITY, STATISTICS AND MODELLING IN PUBLIC HEALTH, 2006, : 90 - +
  • [44] A Model-Based Imputation Procedure for Multilevel Regression Models With Random Coefficients, Interaction Effects, and Nonlinear Terms
    Enders, Craig K.
    Du, Han
    Keller, Brian T.
    [J]. PSYCHOLOGICAL METHODS, 2020, 25 (01) : 88 - 112
  • [45] Comparing methods for handling missing covariates in meta-regression
    Lee, Jihyun
    Beretvas, S. Natasha
    [J]. RESEARCH SYNTHESIS METHODS, 2023, 14 (01) : 117 - 136
  • [46] Multi-index regression models with missing covariates at random
    Guo, Xu
    Xu, Wangli
    Zhu, Lixing
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2014, 123 : 345 - 363
  • [47] Monte Carlo EM for missing covariates in parametric regression models
    Ibrahim, JG
    Chen, MH
    Lipsitz, SR
    [J]. BIOMETRICS, 1999, 55 (02) : 591 - 596
  • [48] Performance Evaluation of Imputation Methods for Missing Data in Logistic Regression Model: Simulation and Application
    Mohamed, Salah M.
    Abonazel, Mohamed R.
    Ghallab, Mohamed G.
    [J]. THAILAND STATISTICIAN, 2023, 21 (04): : 926 - 942
  • [49] Bayesian Methods for Missing Covariates in Cure Rate Models
    Ming-Hui Chen
    Joseph G. Ibrahim
    Stuart R. Lipsitz
    [J]. Lifetime Data Analysis, 2002, 8 : 117 - 146
  • [50] Bayesian methods for missing covariates in cure rate models
    Chen, MH
    Ibrahim, JG
    Lipsitz, SR
    [J]. LIFETIME DATA ANALYSIS, 2002, 8 (02) : 117 - 146