A bias-corrected estimator in multiple imputation for missing data

被引:10
|
作者
Tomita, Hiroaki [1 ]
Fujisawa, Hironori [1 ,2 ,3 ]
Henmi, Masayuki [1 ,2 ]
机构
[1] SOKENDAI, Sch Multidisciplinary Sci, Dept Stat Sci, Tokyo, Japan
[2] Inst Stat Math, 10-3 Midori Cho, Tachikawa, Tokyo 1908562, Japan
[3] Nagoya Univ, Grad Sch Med, Dept Math Stat, Nagoya, Aichi, Japan
关键词
density ratio; kernel density estimation; multiple imputation; weighted likelihood; COVARIATE DATA; VALUES; DISTRIBUTIONS; REGRESSION; INFERENCE;
D O I
10.1002/sim.7833
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Multiple imputation (MI) is one of the most popular methods to deal with missing data, and its use has been rapidly increasing in medical studies. Although MI is rather appealing in practice since it is possible to use ordinary statistical methods for a complete data set once the missing values are fully imputed, the method of imputation is still problematic. If the missing values are imputed from some parametric model, the validity of imputation is not necessarily ensured, and the final estimate for a parameter of interest can be biased unless the parametric model is correctly specified. Nonparametric methods have been also proposed for MI, but it is not so straightforward as to produce imputation values from nonparametrically estimated distributions. In this paper, we propose a new method for MI to obtain a consistent (or asymptotically unbiased) final estimate even if the imputation model is misspecified. The key idea is to use an imputation model from which the imputation values are easily produced and to make a proper correction in the likelihood function after the imputation by using the density ratio between the imputation model and the true conditional density function for the missing variable as a weight. Although the conditional density must be nonparametrically estimated, it is not used for the imputation. The performance of our method is evaluated by both theory and simulation studies. A real data analysis is also conducted to illustrate our method by using the Duke Cardiac Catheterization Coronary Artery Disease Diagnostic Dataset.
引用
收藏
页码:3373 / 3386
页数:14
相关论文
共 50 条
  • [31] MULTIPLE IMPUTATION AS A MISSING DATA MACHINE
    BRAND, J
    VANBUUREN, S
    VANMULLIGEN, EM
    TIMMERS, T
    GELSEMA, E
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1994, : 303 - 306
  • [32] Multiple imputation with missing data indicators
    Beesley, Lauren J.
    Bondarenko, Irina
    Elliot, Michael R.
    Kurian, Allison W.
    Katz, Steven J.
    Taylor, Jeremy M. G.
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2021, 30 (12) : 2685 - 2700
  • [33] Multiple imputation: dealing with missing data
    de Goeij, Moniek C. M.
    van Diepen, Merel
    Jager, Kitty J.
    Tripepi, Giovanni
    Zoccali, Carmine
    Dekker, Friedo W.
    NEPHROLOGY DIALYSIS TRANSPLANTATION, 2013, 28 (10) : 2415 - 2420
  • [34] Multiple imputation for nonignorable missing data
    Im, Jongho
    Kim, Soeun
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2017, 46 (04) : 583 - 592
  • [35] Multiple imputation for nonignorable missing data
    Jongho Im
    Soeun Kim
    Journal of the Korean Statistical Society, 2017, 46 : 583 - 592
  • [36] Bias-corrected bootstrap and model uncertainty
    Steck, H
    Jaakkola, TS
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 521 - 528
  • [37] A bias-corrected precipitation climatology for China
    Ye, BS
    Yang, DQ
    Ding, YJ
    Han, TD
    Koike, T
    JOURNAL OF HYDROMETEOROLOGY, 2004, 5 (06) : 1147 - 1160
  • [38] On the bias of the multiple-imputation variance estimator in survey sampling
    Kim, Jae Kwang
    Brick, J. Michael
    Fuller, Wayne A.
    Kalton, Graham
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2006, 68 : 509 - 521
  • [39] Bias-corrected random forests in regression
    Zhang, Guoyi
    Lu, Yan
    JOURNAL OF APPLIED STATISTICS, 2012, 39 (01) : 151 - 160
  • [40] Bias-corrected estimates of GED returns
    Heckman, James J.
    LaFontaine, Paul A.
    JOURNAL OF LABOR ECONOMICS, 2006, 24 (03) : 661 - 700