Multiple imputation in the presence of non-normal data

被引:52
|
作者
Lee, Katherine J. [1 ,2 ]
Carlin, John B. [1 ,2 ]
机构
[1] Murdoch Childrens Res Inst, Clin Epidemiol & Biostat Unit, Flemington Rd, Melbourne, Vic, Australia
[2] Univ Melbourne, Dept Paediat, Melbourne, Vic, Australia
基金
英国医学研究理事会;
关键词
multiple imputation; missing data; non-normal data; transformation; predictive mean matching;
D O I
10.1002/sim.7173
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Multiple imputation (MI) is becoming increasingly popular for handling missing data. Standard approaches for MI assume normality for continuous variables (conditionally on the other variables in the imputation model). However, it is unclear how to impute non-normally distributed continuous variables. Using simulation and a case study, we compared various transformations applied prior to imputation, including a novel non-parametric transformation, to imputation on the raw scale and using predictive mean matching (PMM) when imputing non-normal data. We generated data from a range of non-normal distributions, and set 50% to missing completely at random or missing at random. We then imputed missing values on the raw scale, following a zero-skewness log, Box-Cox or non-parametric transformation and using PMM with both type 1 and 2 matching. We compared inferences regarding the marginal mean of the incomplete variable and the association with a fully observed outcome. We also compared results from these approaches in the analysis of depression and anxiety symptoms in parents of very preterm compared with term-born infants. The results provide novel empirical evidence that the decision regarding how to impute a non-normal variable should be based on the nature of the relationship between the variables of interest. If the relationship is linear in the untransformed scale, transformation can introduce bias irrespective of the transformation used. However, if the relationship is non-linear, it may be important to transform the variable to accurately capture this relationship. A useful alternative is to impute the variable using PMM with type 1 matching. Copyright (C) 2016 John Wiley & Sons, Ltd.
引用
收藏
页码:606 / 617
页数:12
相关论文
共 50 条
  • [41] Mortgage default decisions in the presence of non-normal, spatially dependent disturbances
    Calabrese, Raffaella
    McCollum, Meagan
    Pace, R. Kelley
    REGIONAL SCIENCE AND URBAN ECONOMICS, 2019, 76 : 103 - 114
  • [42] Efficiency of multiple imputation to test for association in the presence of missing data
    Pascal Croiseau
    Claire Bardel
    Emmanuelle Génin
    BMC Proceedings, 1 (Suppl 1)
  • [43] Multiple Imputation for Missing Data: Fully Conditional Specification Versus Multivariate Normal Imputation
    Lee, Katherine J.
    Carlin, John B.
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2010, 171 (05) : 624 - 632
  • [44] A comparison of two methods for transforming non-normal manufacturing data
    Chung, S. H.
    Pearn, W. L.
    Yang, Y. S.
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2007, 31 (9-10): : 957 - 968
  • [45] Transforming non-normal data to normality in statistical process control
    Chou, YM
    Polansky, AM
    Mason, RL
    JOURNAL OF QUALITY TECHNOLOGY, 1998, 30 (02) : 133 - 141
  • [46] Using Johnson Curves to Describe Non-normal Process Data
    Farnum, N. R.
    Quality Engineering, 9 (02):
  • [47] Modelling an energy market with Bayesian networks for non-normal data
    Vincenzina Vitale
    Flaminia Musella
    Paola Vicard
    Valentina Guizzi
    Computational Management Science, 2020, 17 : 47 - 64
  • [48] Expansions for the risk of Stein type estimates for non-normal data
    Withers, Christopher S.
    Nadarajah, Saralees
    STATISTICS & RISK MODELING, 2011, 28 (02) : 81 - 95
  • [49] A comparison of two methods for transforming non-normal manufacturing data
    S. H. Chung
    W. L. Pearn
    Y. S. Yang
    The International Journal of Advanced Manufacturing Technology, 2007, 31 : 957 - 968
  • [50] Non-normal Data Simulation using Piecewise Linear Transforms
    Foldnes, Njal
    Gronneberg, Steffen
    STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2022, 29 (01) : 36 - 46