Multiple Imputation for Incomplete Data in Environmental Epidemiology Research

被引:12
|
作者
Allotey, Prince Addo [1 ]
Harel, Ofer [1 ]
机构
[1] Univ Connecticut, Coll Liberal Arts & Sci, Dept Stat, 215 Glenbrook Rd Unit, Storrs, CT 06269 USA
关键词
Complete case analysis; Complete data; Missing data; Multiple imputation; Traditional statistical methods; Spontaneous abortion; FULLY CONDITIONAL SPECIFICATION; SMALL-SAMPLE DEGREES; MISSING-DATA; CHAINED EQUATIONS; FREEDOM; VALUES; IMPLEMENTATION;
D O I
10.1007/s40572-019-00230-y
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Purpose of ReviewIncomplete data are a common problem in statistical analysis of environmental epidemiological research. However, many researchers still ignore this complication. We evaluate the performance of two commonly used multiple imputation (MI) methods (fully conditional specification and multivariate normal) for handling missing data and compare them to complete case analysis (CCA) method. We further discuss issues that arise when these methods are being used.Recent FindingsMI is a simulation-based approach to deal with incomplete data. In general, MI will perform better then ad hoc techniques such as CCA. MI is an approach which replaces the missing data with plausible values and allows for additional uncertainty due to the missing information caused by the incomplete data. To illustrate this, we use data of 944 women from the Collaborative Perinatal Project and compare estimates between these methods. The goal is to examine if each of two outcomes, birth-weight and spontaneous abortion, in the data set are associated with mothers' smoking status during pregnancy adjusting for baseline covariates in the model.SummaryResults indicate that MI is better suited for handling incomplete data and led to a significant improvement in parameter estimates compared to CCA. The two MI methods produced similar point estimates, but slightly different standard errors.
引用
收藏
页码:62 / 71
页数:10
相关论文
共 50 条
  • [1] Multiple Imputation for Incomplete Data in Environmental Epidemiology Research
    Prince Addo Allotey
    Ofer Harel
    [J]. Current Environmental Health Reports, 2019, 6 : 62 - 71
  • [2] Using multiple imputation for analysis of incomplete data in clinical research
    McCleary, L
    [J]. NURSING RESEARCH, 2002, 51 (05) : 339 - 343
  • [3] Multiple imputation for incomplete data with semicontinuous variables
    Javaras, KN
    Van Dyk, DA
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2003, 98 (463) : 703 - 715
  • [4] A multiple imputation strategy for incomplete longitudinal data
    Landrum, MB
    Becker, MP
    [J]. STATISTICS IN MEDICINE, 2001, 20 (17-18) : 2741 - 2760
  • [5] Multiple Imputation for Incomplete Data in Epidemiologic Studies
    Harel, Ofer
    Mitchell, Emily M.
    Perkins, Neil J.
    Cole, Stephen R.
    Tchetgen, Eric J. Tchetgen
    Sun, BaoLuo
    Schisterman, Enrique F.
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 2018, 187 (03) : 576 - 584
  • [6] Multiple Imputation and Genetic Programming for Classification with Incomplete Data
    Cao Truong Tran
    Zhang, Mengjie
    Andreae, Peter
    Xue, Bing
    [J]. PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'17), 2017, : 521 - 528
  • [7] Multiple Imputation and Ensemble Learning for Classification with Incomplete Data
    Cao Truong Tran
    Zhang, Mengjie
    Andreae, Peter
    Xue, Bing
    Lam Thu Bui
    [J]. INTELLIGENT AND EVOLUTIONARY SYSTEMS, IES 2016, 2017, 8 : 401 - 415
  • [8] A functional multiple imputation approach to incomplete longitudinal data
    He, Yulei
    Yucel, Recai
    Raghunathan, Trivellore E.
    [J]. STATISTICS IN MEDICINE, 2011, 30 (10) : 1137 - 1156
  • [9] Multiple imputation for analysis of incomplete data in distributed health data networks
    Changgee Chang
    Yi Deng
    Xiaoqian Jiang
    Qi Long
    [J]. Nature Communications, 11
  • [10] Multiple imputation for analysis of incomplete data in distributed health data networks
    Chang, Changgee
    Deng, Yi
    Jiang, Xiaoqian
    Long, Qi
    [J]. NATURE COMMUNICATIONS, 2020, 11 (01)