A comparison of multiple imputation strategies to deal with missing nonnormal data in structural equation modeling

被引:7
|
作者
Jia, Fan [1 ]
Wu, Wei [2 ]
机构
[1] Univ Calif Merced, Psychol Sci, 5200 N Lake Rd, Merced, CA 95343 USA
[2] Indiana Univ Purdue Univ, Indianapolis, IN 46202 USA
关键词
Missing data; Nonnormality; Multiple imputation; Full information maximum likelihood; Predictive mean matching; Classification and regression trees; Random forest; INFORMATION MAXIMUM-LIKELIHOOD; FULLY CONDITIONAL SPECIFICATION; COVARIANCE STRUCTURE-ANALYSIS; 2-STAGE APPROACH; STANDARD ERRORS; TEST STATISTICS; PERFORMANCE; ROBUSTNESS; SAMPLE; IMPUTE;
D O I
10.3758/s13428-022-01936-y
中图分类号
B841 [心理学研究方法];
学科分类号
040201 ;
摘要
Missing data and nonnormality are two common factors that can affect analysis results from structural equation modeling (SEM). The current study aims to address a challenging situation in which the two factors coexist (i.e., missing nonnormal data). Using Monte Carlo simulation, we evaluated the performance of four multiple imputation (MI) strategies with respect to parameter and standard error estimation. These strategies include MI with normality-based model (MI-NORM), predictive mean matching (MI-PMM), classification and regression trees (MI-CART), and random forest (MI-RF). We also compared these MI strategies with robust full information maximum likelihood (RFIML), a popular (non-imputation) method to deal with missing nonnormal data in SEM. The results suggest that MI-NORM had similar performance to RFIML. MI-PMM outperformed the other methods when data were not missing on the heavy tail of a skewed distribution. Although MI-CART and MI-RF do not require any distribution assumption, they did not perform well compared with the others. Based on the results, practical guidance is provided.
引用
收藏
页码:3100 / 3119
页数:20
相关论文
共 50 条
  • [41] Multiple imputation for missing data: a brief introduction
    Baccini, Michela
    [J]. EPIDEMIOLOGIA & PREVENZIONE, 2008, 32 (03): : 162 - 163
  • [42] Multiple imputation for missing data - A cautionary tale
    Allison, PD
    [J]. SOCIOLOGICAL METHODS & RESEARCH, 2000, 28 (03) : 301 - 309
  • [43] Introduction to multiple imputation for dealing with missing data
    Lee, Katherine J.
    Simpson, Julie A.
    [J]. RESPIROLOGY, 2014, 19 (02) : 162 - 167
  • [44] The use of multiple imputation for the analysis of missing data
    Sinharay, S
    Stern, HS
    Russell, D
    [J]. PSYCHOLOGICAL METHODS, 2001, 6 (04) : 317 - 329
  • [45] Multiple imputation of ordinal missing not at random data
    Hammon, Angelina
    [J]. ASTA-ADVANCES IN STATISTICAL ANALYSIS, 2023, 107 (04) : 671 - 692
  • [46] Regression multiple imputation for missing data analysis
    Yu, Lili
    Liu, Liang
    Peace, Karl E.
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2020, 29 (09) : 2647 - 2664
  • [47] Multiple imputation of ordinal missing not at random data
    Angelina Hammon
    [J]. AStA Advances in Statistical Analysis, 2023, 107 : 671 - 692
  • [48] A comparison of imputation techniques for handling missing data
    Musil, CM
    Warner, CB
    Yobas, PK
    Jones, SL
    [J]. WESTERN JOURNAL OF NURSING RESEARCH, 2002, 24 (07) : 815 - 829
  • [49] Imputation of missing longitudinal data: a comparison of methods
    Engels, JM
    Diehr, P
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2003, 56 (10) : 968 - 976
  • [50] Missing traffic data: comparison of imputation methods
    Li, Yuebiao
    Li, Zhiheng
    Li, Li
    [J]. IET INTELLIGENT TRANSPORT SYSTEMS, 2014, 8 (01) : 51 - 57