Multiple Imputation for Incomplete Data in Environmental Epidemiology Research

被引:13
|
作者
Allotey, Prince Addo [1 ]
Harel, Ofer [1 ]
机构
[1] Univ Connecticut, Coll Liberal Arts & Sci, Dept Stat, 215 Glenbrook Rd Unit, Storrs, CT 06269 USA
关键词
Complete case analysis; Complete data; Missing data; Multiple imputation; Traditional statistical methods; Spontaneous abortion; FULLY CONDITIONAL SPECIFICATION; SMALL-SAMPLE DEGREES; MISSING-DATA; CHAINED EQUATIONS; FREEDOM; VALUES; IMPLEMENTATION;
D O I
10.1007/s40572-019-00230-y
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Purpose of ReviewIncomplete data are a common problem in statistical analysis of environmental epidemiological research. However, many researchers still ignore this complication. We evaluate the performance of two commonly used multiple imputation (MI) methods (fully conditional specification and multivariate normal) for handling missing data and compare them to complete case analysis (CCA) method. We further discuss issues that arise when these methods are being used.Recent FindingsMI is a simulation-based approach to deal with incomplete data. In general, MI will perform better then ad hoc techniques such as CCA. MI is an approach which replaces the missing data with plausible values and allows for additional uncertainty due to the missing information caused by the incomplete data. To illustrate this, we use data of 944 women from the Collaborative Perinatal Project and compare estimates between these methods. The goal is to examine if each of two outcomes, birth-weight and spontaneous abortion, in the data set are associated with mothers' smoking status during pregnancy adjusting for baseline covariates in the model.SummaryResults indicate that MI is better suited for handling incomplete data and led to a significant improvement in parameter estimates compared to CCA. The two MI methods produced similar point estimates, but slightly different standard errors.
引用
收藏
页码:62 / 71
页数:10
相关论文
共 50 条
  • [31] Special issue: Incomplete data: multiple imputation and model-based analysis
    van Buuren, S
    Eisinga, R
    STATISTICA NEERLANDICA, 2003, 57 (01) : 1 - 2
  • [32] Multiple imputation for high-dimensional mixed incomplete continuous and binary data
    He, Ren
    Belin, Thomas
    STATISTICS IN MEDICINE, 2014, 33 (13) : 2251 - 2262
  • [33] Missing Data in Clinical Research: A Tutorial on Multiple Imputation
    Austin, Peter C.
    White, Ian R.
    Lee, Douglas S.
    van Buuren, Stef
    CANADIAN JOURNAL OF CARDIOLOGY, 2021, 37 (09) : 1322 - 1331
  • [34] A Simulation Study Comparing Multiple Imputation Methods for Incomplete Longitudinal Ordinal Data
    Donneau, A. F.
    Mauer, M.
    Molenberghs, G.
    Albert, A.
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2015, 44 (05) : 1311 - 1338
  • [35] Missing data and multiple imputation in clinical epidemiological research
    Pedersen, Alma B.
    Mikkelsen, Ellen M.
    Cronin-Fenton, Deirdre
    Kristensen, Nickolaj R.
    Tra My Pham
    Pedersen, Lars
    Petersen, Irene
    CLINICAL EPIDEMIOLOGY, 2017, 9 : 157 - 165
  • [36] Handling missing data in nursing research with multiple imputation
    Kneipp, SM
    McIntosh, M
    NURSING RESEARCH, 2001, 50 (06) : 384 - 389
  • [37] Incomplete clustering analysis via multiple imputation
    Lee, Jung Wun
    Harel, Ofer
    JOURNAL OF APPLIED STATISTICS, 2023, 50 (09) : 1962 - 1979
  • [38] Multiple imputation for the analysis of incomplete compound variables
    Zhao, Jiwei
    Cook, Richard J.
    Wu, Changbao
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2015, 43 (02): : 240 - 264
  • [39] Multivariable data imputation for the analysis of incomplete credit data
    Lan, Qiujun
    Xu, Xuqing
    Ma, Haojie
    Li, Gang
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 141 (141)
  • [40] Erratum to: Assessment of predictive performance in incomplete data by combining internal validation and multiple imputation
    Simone Wahl
    Anne-Laure Boulesteix
    Astrid Zierer
    Barbara Thorand
    Mark A. van de Wiel
    BMC Medical Research Methodology, 16