The relationship between hot-deck multiple imputation and weighted likelihood

被引:0
|
作者
Reilly, M [1 ]
Pepe, M [1 ]
机构
[1] FRED HUTCHINSON CANC RES CTR,SEATTLE,WA 98104
关键词
D O I
10.1002/(SICI)1097-0258(19970115)16:1<5::AID-SIM469>3.0.CO;2-8
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Hot-deck imputation is an intuitively simple and popular method of accommodating incomplete data, Users of the method will often use the usual multiple imputation variance estimator which is not appropriate in this case. However, no variance expression has yet been derived for this easily implemented method applied to missing covariates in regression models, The simple hot-deck method is in fact asymptotically equivalent to the mean-score method for the estimation of a regression model parameter, so that hot-deck can be understood in the context of likelihood methods. Both of these methods accommodate data where missingness may depend on the observed variables but not on the unobserved value of the incomplete covariate, that is, missing at random (MAR). The asymptotic properties of hot-deck are derived here for the case where the fully observed variables are categorical, though the incomplete covariate(s) may be continuous. Simulation studies indicate that the two methods compare well in small samples and for small numbers of imputations. Current users of hot-deck may now conduct their analysis using mean-score, which is a weighted likelihood method and can thus be implemented by a single pass through the data using any standard package which accommodates weighted regression models. Valid inference is now straightforward using the variance expression provided here, The equivalence of mean-score and hot-deck is illustrated using three clinical data sets where an important covariate is missing for a large number of study subjects.
引用
收藏
页码:5 / 19
页数:15
相关论文
共 50 条
  • [1] Weighted Hot-Deck Imputation in Farm and Fishery Household Economy Surveys
    Kim, Kyu-Seong
    Lee, Kee-Jae
    Kim, Jin
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2005, 18 (02) : 311 - 328
  • [2] A hot-deck multiple imputation procedure for gaps in longitudinal data on recurrent events
    Little, Roderick J.
    Yosef, Matheos
    Cain, Kevin C.
    Nan, Bin
    Harlow, Sioban D.
    [J]. STATISTICS IN MEDICINE, 2008, 27 (01) : 103 - 120
  • [3] Comparison of hot-deck and neural-network imputation
    Wilmot, CG
    Shivananjappa, S
    [J]. TRANSPORT SURVEY QUALITY AND INNOVATION, 2003, : 543 - 554
  • [4] Multiple hot-deck imputation for network inference from RNA sequencing data
    Imbert, Alyssa
    Valsesia, Armand
    Le Gall, Caroline
    Armenise, Claudia
    Lefebvre, Gregory
    Gourraud, Pierre-Antoine
    Viguerie, Nathalie
    Villa-Vialaneix, Nathalie
    [J]. BIOINFORMATICS, 2018, 34 (10) : 1726 - 1732
  • [5] A Hot-Deck Multiple Imputation Procedure for Gaps in Longitudinal Recurrent Event Histories
    Wang, Chia-Ning
    Little, Roderick
    Nan, Bin
    Harlow, Sioban D.
    [J]. BIOMETRICS, 2011, 67 (04) : 1573 - 1582
  • [6] FINDING A FLEXIBLE HOT-DECK IMPUTATION METHOD FOR MULTINOMIAL DATA
    Andridge, Rebecca
    Bechtel, Laura
    Thompson, Katherine Jenny
    [J]. JOURNAL OF SURVEY STATISTICS AND METHODOLOGY, 2021, 9 (04) : 789 - 809
  • [7] Hot-deck imputation with SAS® arrays and macros for large surveys
    Stiller, J
    Dalzell, DR
    [J]. PROCEEDINGS OF THE TWENTY-THIRD ANNUAL SAS USERS GROUP INTERNATIONAL CONFERENCE, 1998, : 1378 - 1383
  • [8] Calibrated Hot-Deck Donor Imputation Subject to Edit Restrictions
    Coutinho, Wieger
    de Waal, Ton
    Shlomo, Natalie
    [J]. JOURNAL OF OFFICIAL STATISTICS, 2013, 29 (02) : 299 - 321
  • [9] Multiple imputation using an iterative hot-deck with distance-based donor selection
    Siddique, Juned
    Belin, Thomas R.
    [J]. STATISTICS IN MEDICINE, 2008, 27 (01) : 83 - 102
  • [10] Impacts of Fractional Hot-Deck Imputation on Learning and Prediction of Engineering Data
    Song, Ikkyun
    Yang, Yicheng
    Im, Jongho
    Tong, Tong
    Ceylan, Halil
    Cho, In Ho
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (12) : 2363 - 2373