Multiple imputation with missing indicators as proxies for unmeasured variables: simulation study

被引:22
|
作者
Sperrin, Matthew [1 ]
Martin, Glen P. [1 ]
机构
[1] Univ Manchester, Fac Biol Med & Hlth, Vaughan House, Manchester M13 9PL, Lancs, England
关键词
Missing data; Missing indicator; Multiple imputation; Simulation study; REGRESSION; BIAS;
D O I
10.1186/s12874-020-01068-x
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background Within routinely collected health data, missing data for an individual might provide useful information in itself. This occurs, for example, in the case of electronic health records, where the presence or absence of data is informative. While the naive use of missing indicators to try to exploit such information can introduce bias, its use in conjunction with multiple imputation may unlock the potential value of missingness to reduce bias in causal effect estimation, particularly in missing not at random scenarios and where missingness might be associated with unmeasured confounders. Methods We conducted a simulation study to determine when the use of a missing indicator, combined with multiple imputation, would reduce bias for causal effect estimation, under a range of scenarios including unmeasured variables, missing not at random, and missing at random mechanisms. We use directed acyclic graphs and structural models to elucidate a variety of causal structures of interest. We handled missing data using complete case analysis, and multiple imputation with and without missing indicator terms. Results We find that multiple imputation combined with a missing indicator gives minimal bias for causal effect estimation in most scenarios. In particular the approach: 1) does not introduce bias in missing (completely) at random scenarios; 2) reduces bias in missing not at random scenarios where the missing mechanism depends on the missing variable itself; and 3) may reduce or increase bias when unmeasured confounding is present. Conclusion In the presence of missing data, careful use of missing indicators, combined with multiple imputation, can improve causal effect estimation when missingness is informative, and is not detrimental when missingness is at random.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Multiple imputation with missing indicators as proxies for unmeasured variables: simulation study
    Matthew Sperrin
    Glen P. Martin
    [J]. BMC Medical Research Methodology, 20
  • [2] Multiple imputation with missing data indicators
    Beesley, Lauren J.
    Bondarenko, Irina
    Elliot, Michael R.
    Kurian, Allison W.
    Katz, Steven J.
    Taylor, Jeremy M. G.
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2021, 30 (12) : 2685 - 2700
  • [3] Imputation and missing indicators for handling missing data in the development and deployment of clinical prediction models: A simulation study
    Sisk, Rose
    Sperrin, Matthew
    Peek, Niels
    van Smeden, Maarten
    Martin, Glen Philip
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2023, 32 (08) : 1461 - 1477
  • [4] Auxiliary Variables in Multiple Imputation When Data Are Missing Not at Random
    Mustillo, Sarah
    Kwon, Soyoung
    [J]. JOURNAL OF MATHEMATICAL SOCIOLOGY, 2015, 39 (02): : 73 - 91
  • [5] A simulation study on missing data imputation for dichotomous variables using statistical and machine learning methods
    Ge, Yingfeng
    Li, Zhiwei
    Zhang, Jinxin
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [6] A simulation study on missing data imputation for dichotomous variables using statistical and machine learning methods
    Yingfeng Ge
    Zhiwei Li
    Jinxin Zhang
    [J]. Scientific Reports, 13
  • [7] Multiple imputation validation study: addressing unmeasured survey data in a longitudinal design
    Claire A. Kolaja
    Ben Porter
    Teresa M. Powell
    Rudolph P. Rull
    [J]. BMC Medical Research Methodology, 21
  • [8] Multiple imputation in practice: Comparison of software packages for regression models with missing variables
    Horton, NJ
    Lipsitz, SR
    [J]. AMERICAN STATISTICIAN, 2001, 55 (03): : 244 - 254
  • [9] Multiple imputation validation study: addressing unmeasured survey data in a longitudinal design
    Kolaja, Claire A.
    Porter, Ben
    Powell, Teresa M.
    Rull, Rudolph P.
    [J]. BMC MEDICAL RESEARCH METHODOLOGY, 2021, 21 (01)
  • [10] Multiple imputation of missing values: Further update of ice, with an emphasis an categorical variables
    Royston, Patrick
    [J]. STATA JOURNAL, 2009, 9 (03): : 466 - 477