Obtaining Predictions from Models Fit to Multiply Imputed Data

被引:19
|
作者
Miles, Andrew [1 ]
机构
[1] Univ Toronto, Sociol, William G Davis Bldg,Room DV-3217, Mississauga, ON L5L 1C6, Canada
关键词
multiple imputation; prediction; missing data; linear transformation; nonlinear transformation; IMPUTATION;
D O I
10.1177/0049124115610345
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Obtaining predictions from regression models fit to multiply imputed data can be challenging because treatments of multiple imputation seldom give clear guidance on how predictions can be calculated, and because available software often does not have built-in routines for performing the necessary calculations. This research note reviews how predictions can be obtained using Rubin's rules, that is, by being estimated separately in each imputed data set and then combined. It then demonstrates that predictions can also be calculated directly from the final analysis model. Both approaches yield identical results when predictions rely solely on linear transformations of the coefficients and calculate standard errors using the delta method and diverge only slightly when using nonlinear transformations. However, calculation from the final model is faster, easier to implement, and generates predictions with a clearer relationship to model coefficients. These principles are illustrated using data from the General Social Survey and with a simulation.
引用
收藏
页码:175 / 185
页数:11
相关论文
共 50 条
  • [1] Assessing the Fit of Structural Equation Models With Multiply Imputed Data
    Enders, Craig K.
    Mansolf, Maxwell
    [J]. PSYCHOLOGICAL METHODS, 2018, 23 (01) : 76 - 93
  • [2] Evaluating Close Fit in Ordinal Factor Analysis Models With Multiply Imputed Data
    Shi, Dexin
    Zhang, Bo
    Liu, Ren
    Jiang, Zhehan
    [J]. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2024, 84 (01) : 171 - 189
  • [3] Multiply Imputed Synthetic Data: Evaluation of Hierarchical Bayesian Imputation Models
    Graham, Patrick
    Young, Jim
    Penny, Richard
    [J]. JOURNAL OF OFFICIAL STATISTICS, 2009, 25 (02) : 245 - 268
  • [4] Validation of prediction models based on lasso regression with multiply imputed data
    Musoro, Jammbe Z.
    Zwinderman, Aeilko H.
    Puhan, Milo A.
    ter Riet, Gerben
    Geskus, Ronald B.
    [J]. BMC MEDICAL RESEARCH METHODOLOGY, 2014, 14
  • [5] Validation of prediction models based on lasso regression with multiply imputed data
    Jammbe Z Musoro
    Aeilko H Zwinderman
    Milo A Puhan
    Gerben ter Riet
    Ronald B Geskus
    [J]. BMC Medical Research Methodology, 14
  • [6] Power calculation in multiply imputed data
    Ruochen Zha
    Ofer Harel
    [J]. Statistical Papers, 2021, 62 : 533 - 559
  • [7] Power calculation in multiply imputed data
    Zha, Ruochen
    Harel, Ofer
    [J]. STATISTICAL PAPERS, 2021, 62 (01) : 533 - 559
  • [8] Analysis of Variance of Multiply Imputed Data
    van Ginkel, Joost R.
    Kroonenberg, Pieter M.
    [J]. MULTIVARIATE BEHAVIORAL RESEARCH, 2014, 49 (01) : 78 - 91
  • [9] The estimation and use of predictions for the assessment of model performance using large samples with multiply imputed data
    Wood, Angela M.
    Royston, Patrick
    White, Ian R.
    [J]. BIOMETRICAL JOURNAL, 2015, 57 (04) : 614 - 632
  • [10] Order selection tests with multiply imputed data
    Consentino, Fabrizio
    Claeskens, Gerda
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (10) : 2284 - 2295