Donor Limited Hot Deck Imputation: A Constrained Optimization Problem

被引:1
|
作者
Joenssen, Dieter William [1 ]
机构
[1] Ilmenau Univ Technol, Helmholtzpl 3, D-98693 Ilmenau, Germany
关键词
MISSING DATA;
D O I
10.1007/978-3-662-44983-7_28
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Hot deck methods impute missing data by matching records that are complete to those that are missing values. Observations absent within the recipient are then replaced by replicating the values from the matched donor. Some hot deck procedures constrain the frequency with which any donor may be matched to increase the precision of post-imputation parameter-estimates. This constraint, called a donor limit, also mitigates risks of exclusively using one donor for all imputations or using one donor with an extreme value or values "too often." Despite these desirable properties, imputation results of a donor limited hot deck are dependent on the recipients' order of imputation, an undesirable property. For nearest neighbor type hot deck procedures, the implementation of a constraint on donor usage causes the stepwise matching between each recipient and its closest donor to no longer minimize the sum of all donor-recipient distances. Thus, imputation results may further be improved by procedures that minimize the total donor-recipient distance-sum. The discrete optimization problem is formulated and a simulation detailing possible improvements when solving this integer program is presented.
引用
收藏
页码:319 / 328
页数:10
相关论文
共 50 条
  • [1] Calibrated Hot-Deck Donor Imputation Subject to Edit Restrictions
    Coutinho, Wieger
    de Waal, Ton
    Shlomo, Natalie
    [J]. JOURNAL OF OFFICIAL STATISTICS, 2013, 29 (02) : 299 - 321
  • [2] Fractional hot deck imputation
    Kim, JK
    Fuller, W
    [J]. BIOMETRIKA, 2004, 91 (03) : 559 - 578
  • [3] On Limiting Donor Usage for Imputation of Missing Data via Hot Deck Methods
    Bankhofer, Udo
    Joenssen, Dieter William
    [J]. DATA ANALYSIS, MACHINE LEARNING AND KNOWLEDGE DISCOVERY, 2014, : 3 - 11
  • [4] Regression fractional hot deck imputation
    Kim, Jae Kwang
    [J]. JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2007, 36 (03) : 423 - 434
  • [5] The Use of Sample Weights in Hot Deck Imputation
    Andridge, Rebecca R.
    Little, Roderick J.
    [J]. JOURNAL OF OFFICIAL STATISTICS, 2009, 25 (01) : 21 - 36
  • [6] Multiple imputation using an iterative hot-deck with distance-based donor selection
    Siddique, Juned
    Belin, Thomas R.
    [J]. STATISTICS IN MEDICINE, 2008, 27 (01) : 83 - 102
  • [7] Fractional Regression Hot Deck Imputation Weight Adjustment
    Paik, Minhui
    Larsen, Michael D.
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2013, 42 (07) : 1514 - 1532
  • [8] FHDI: An R Package for Fractional Hot Deck Imputation
    Im, Jongho
    Cho, In Ho
    Kim, Jae Kwang
    [J]. R JOURNAL, 2018, 10 (01): : 140 - 154
  • [9] DATA-ANALYSIS USING HOT DECK MULTIPLE IMPUTATION
    REILLY, M
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES D-THE STATISTICIAN, 1993, 42 (03) : 307 - 313
  • [10] Empirical likelihood-based hot deck imputation methods
    Xue, Yijie
    Lazar, Nicole A.
    [J]. JOURNAL OF NONPARAMETRIC STATISTICS, 2012, 24 (03) : 629 - 646