A hot deck imputation procedure for multiply imputing nonignorable missing data: The proxy pattern-mixture hot deck

被引:12
|
作者
Sullivan, Danielle [1 ]
Andridge, Rebecca [1 ]
机构
[1] Ohio State Univ, Coll Publ Hlth, Div Biostat, Columbus, OH 43210 USA
关键词
Hot deck; Nonignorable missingness; Donor selection; Sensitivity analysis; VARIANCE-ESTIMATION; BOOTSTRAP; MODELS;
D O I
10.1016/j.csda.2014.09.008
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Hot deck imputation is a common method for handling item nonresponse in surveys, but most implementations assume data are missing at random (MAR). A new hot deck method for imputation of a continuous partially missing outcome variable that harnesses the power of available covariates but does not assume data are MAR is proposed. A parametric model is used to create predicted means for both donors and donees under varying assumptions on the missing data mechanism, ranging from MAR to missing not at random (MNAR). For a given assumption on the missingness mechanism, the predicted means are used to define distances between donors and donees and probabilities of selection proportional to those distances. Multiple imputation using the hot deck is performed to create a set of completed data sets, using an approximate Bayesian bootstrap to ensure "proper" imputations. This new hot deck method creates an intuitive sensitivity analysis where imputations may be performed under MAR and under varying MNAR mechanisms, and the resulting impact on inference can be evaluated. In addition, a donor quality metric is proposed to help identify situations where close matches of donor to donee are not available, which can occur under strong MNAR assumptions. Bias and coverage of estimates from the proposed method are investigated through simulation and the method is applied to estimation of income in the Ohio Medicaid Assessment Survey. Results show that the method performs best when covariates are at least moderately predictive of the partially missing outcome, and without such covariates it effectively reduces to a simple random hot deck for all missingness assumptions. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:173 / 185
页数:13
相关论文
共 26 条
  • [1] Hot Deck Multiple Imputation for Handling Missing Accelerometer Data
    Butera, Nicole M.
    Li, Siying
    Evenson, Kelly R.
    Di, Chongzhi
    Buchner, David M.
    LaMonte, Michael J.
    LaCroix, Andrea Z.
    Herring, Amy
    [J]. STATISTICS IN BIOSCIENCES, 2019, 11 (02) : 422 - 448
  • [2] Hot Deck Multiple Imputation for Handling Missing Accelerometer Data
    Nicole M. Butera
    Siying Li
    Kelly R. Evenson
    Chongzhi Di
    David M. Buchner
    Michael J. LaMonte
    Andrea Z. LaCroix
    Amy Herring
    [J]. Statistics in Biosciences, 2019, 11 : 422 - 448
  • [3] A Comparison of Hot Deck Imputation and Substitution Methods in The Estimation of Missing Data
    Yesilova, Abdullah
    Kaya, Yilmaz
    Almali, M. Nuri
    [J]. GAZI UNIVERSITY JOURNAL OF SCIENCE, 2011, 24 (01): : 69 - 75
  • [4] Missing Value Analysis of Numerical Data using Fractional Hot Deck Imputation
    Christopher, Samuel Zico
    Siswantining, Titin
    Sarwinda, Devvi
    Bustaman, Alhadi
    [J]. 2019 3RD INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS 2019), 2019,
  • [5] On Limiting Donor Usage for Imputation of Missing Data via Hot Deck Methods
    Bankhofer, Udo
    Joenssen, Dieter William
    [J]. DATA ANALYSIS, MACHINE LEARNING AND KNOWLEDGE DISCOVERY, 2014, : 3 - 11
  • [6] A global Water Quality Index and hot-deck imputation of missing data
    Srebotnjak, Tanja
    Carr, Genevieve
    de Sherbinin, Alexander
    Rickwood, Carrie
    [J]. ECOLOGICAL INDICATORS, 2012, 17 : 108 - 119
  • [7] A hot-deck multiple imputation procedure for gaps in longitudinal data on recurrent events
    Little, Roderick J.
    Yosef, Matheos
    Cain, Kevin C.
    Nan, Bin
    Harlow, Sioban D.
    [J]. STATISTICS IN MEDICINE, 2008, 27 (01) : 103 - 120
  • [8] DATA-ANALYSIS USING HOT DECK MULTIPLE IMPUTATION
    REILLY, M
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES D-THE STATISTICIAN, 1993, 42 (03) : 307 - 313
  • [9] Goodbye, Listwise Deletion: Presenting Hot Deck Imputation as an Easy and Effective Tool for Handling Missing Data
    Myers, Teresa A.
    [J]. COMMUNICATION METHODS AND MEASURES, 2011, 5 (04) : 297 - 310
  • [10] FINDING A FLEXIBLE HOT-DECK IMPUTATION METHOD FOR MULTINOMIAL DATA
    Andridge, Rebecca
    Bechtel, Laura
    Thompson, Katherine Jenny
    [J]. JOURNAL OF SURVEY STATISTICS AND METHODOLOGY, 2021, 9 (04) : 789 - 809