Evaluating model-based imputation methods for missing covariates in regression models with interactions

被引:28
|
作者
Kim, Soeun [1 ]
Sugar, Catherine A. [2 ,3 ]
Belin, Thomas R. [2 ]
机构
[1] Univ Texas Hlth Sci Ctr Houston, Sch Publ Hlth, Dept Biostat, Houston, TX 77030 USA
[2] Univ Calif Los Angeles, Dept Biostat, Los Angeles, CA 90095 USA
[3] Univ Calif Los Angeles, Dept Psychiat, Los Angeles, CA 90095 USA
基金
美国国家卫生研究院;
关键词
interaction; missing covariate; multiple imputation; multivariate normal; regression; MULTIPLE IMPUTATION; VALUES;
D O I
10.1002/sim.6435
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Imputation strategies are widely used in settings that involve inference with incomplete data. However, implementation of a particular approach always rests on assumptions, and subtle distinctions between methods can have an impact on subsequent analyses. In this research article, we are concerned with regression models in which the true underlying relationship includes interaction terms. We focus in particular on a linear model with one fully observed continuous predictor, a second partially observed continuous predictor, and their interaction. We derive the conditional distribution of the missing covariate and interaction term given the observed covariate and the outcome variable, and examine the performance of a multiple imputation procedure based on this distribution. We also investigate several alternative procedures that can be implemented by adapting multivariate normal multiple imputation software in ways that might be expected to perform well despite incompatibilities between model assumptions and true underlying relationships among the variables. The methods are compared in terms of bias, coverage, and CI width. As expected, the procedure based on the correct conditional distribution performs well across all scenarios. Just as importantly for general practitioners, several of the approaches based on multivariate normality perform comparably with the correct conditional distribution in a number of circumstances, although interestingly, procedures that seek to preserve the multiplicative relationship between the interaction term and the main-effects are found to be substantially less reliable. For illustration, the various procedures are applied to an analysis of post-traumatic stress disorder symptoms in a study of childhood trauma. Copyright (c) 2015 John Wiley & Sons, Ltd.
引用
收藏
页码:1876 / 1888
页数:13
相关论文
共 50 条
  • [1] Imputation and variable selection in linear regression models with missing covariates
    Yang, XW
    Belin, TR
    Boscardin, WJ
    [J]. BIOMETRICS, 2005, 61 (02) : 498 - 506
  • [2] A Comparison of Model-Based Imputation Methods for Handling Missing Predictor Values in a Linear Regression Model: A Simulation Study
    Hasan, Haliza
    Ahmad, Sanizah
    Osman, Balkish Mohd
    Sapri, Shamsiah
    Othman, Nadirah
    [J]. PROCEEDINGS OF THE 24TH NATIONAL SYMPOSIUM ON MATHEMATICAL SCIENCES (SKSM24): MATHEMATICAL SCIENCES EXPLORATION FOR THE UNIVERSAL PRESERVATION, 2017, 1870
  • [3] Estimation of logistic regression with covariates missing separately or simultaneously via multiple imputation methods
    Lee, Shen-Ming
    Le, Truong-Nhat
    Tran, Phuoc-Loc
    Li, Chin-Shang
    [J]. COMPUTATIONAL STATISTICS, 2023, 38 (02) : 899 - 934
  • [4] Estimation of logistic regression with covariates missing separately or simultaneously via multiple imputation methods
    Shen-Ming Lee
    Truong-Nhat Le
    Phuoc-Loc Tran
    Chin-Shang Li
    [J]. Computational Statistics, 2023, 38 : 899 - 934
  • [5] Dealing with missing information on covariates for excess mortality hazard regression models - Making the imputation model compatible with the substantive model
    Antunes, Luis
    Mendonca, Denisa
    Bento, Maria Jose
    Njagi, Edmund Njeru
    Belot, Aurelien
    Rachet, Bernard
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2021, 30 (10) : 2256 - 2268
  • [6] Semiparametric Bayesian multiple imputation for regression models with missing mixed continuous–discrete covariates
    Ryo Kato
    Takahiro Hoshino
    [J]. Annals of the Institute of Statistical Mathematics, 2020, 72 : 803 - 825
  • [7] Estimation of a zero-inflated Poisson regression model with missing covariates via nonparametric multiple imputation methods
    Lee, Shen-Ming
    Lukusa, T. Martin
    Li, Chin-Shang
    [J]. COMPUTATIONAL STATISTICS, 2020, 35 (02) : 725 - 754
  • [8] Estimation of a zero-inflated Poisson regression model with missing covariates via nonparametric multiple imputation methods
    Shen-Ming Lee
    T. Martin Lukusa
    Chin-Shang Li
    [J]. Computational Statistics, 2020, 35 : 725 - 754
  • [9] Multiple imputation for missing covariates in clinical trials with interactions
    Kim, Soeun
    [J]. TRIALS, 2017, 18
  • [10] Methods for missing covariates in logistic regression
    Paik, MC
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2000, 29 (01) : 1 - 19