Multiple Imputation in a Longitudinal Cohort Study: A Case Study of Sensitivity to Imputation Methods

被引:20
|
作者
Romaniuk, Helena [1 ,2 ,3 ]
Patton, George C. [1 ,3 ,4 ]
Carlin, John B. [2 ,3 ,5 ]
机构
[1] Murdoch Childrens Res Inst, Parkville, Vic 3052, Australia
[2] Murdoch Childrens Res Inst, Clin Epidemiol & Biostat Unit, Parkville, Vic 3052, Australia
[3] Univ Melbourne, Dept Paediat, Parkville, Vic 3052, Australia
[4] Royal Childrens Hosp, Ctr Adolescent Hlth, Parkville, Vic 3052, Australia
[5] Univ Melbourne, Ctr Epidemiol & Biostat, Melbourne Sch Populat & Global Hlth, Melbourne, Vic, Australia
基金
澳大利亚国家健康与医学研究理事会;
关键词
longitudinal cohort study; missing data; multiple imputation; sensitivity analysis; MISSING-DATA; CANNABIS USE; MENTAL-HEALTH; SUBSTANCE USE; STRATEGIES; DISORDERS; EQUATIONS; SMOKING;
D O I
10.1093/aje/kwu224
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Multiple imputation has entered mainstream practice for the analysis of incomplete data. We have used it extensively in a large Australian longitudinal cohort study, the Victorian Adolescent Health Cohort Study (1992-2008). Although we have endeavored to follow best practices, there is little published advice on this, and we have not previously examined the extent to which variations in our approach might lead to different results. Here, we examined sensitivity of analytical results to imputation decisions, investigating choice of imputation method, inclusion of auxiliary variables, omission of cases with excessive missing data, and approaches for imputing highly skewed continuous distributions that are analyzed as dichotomous variables. Overall, we found that decisions made about imputation approach had a discernible but rarely dramatic impact for some types of estimates. For model-based estimates of association, the choice of imputation method and decisions made to build the imputation model had little effect on results, whereas estimates of overall prevalence and prevalence stratified by subgroup were more sensitive to imputation method and settings. Multiple imputation by chained equations gave more plausible results than multivariate normal imputation for prevalence estimates but appeared to be more susceptible to numerical instability related to a highly skewed variable.
引用
收藏
页码:920 / 932
页数:13
相关论文
共 50 条
  • [1] Multiple imputation for missing data in a longitudinal cohort study: a tutorial based on a detailed case study involving imputation of missing outcome data
    Lee, Katherine J.
    Roberts, Gehan
    Doyle, Lex W.
    Anderson, Peter J.
    Carlin, John B.
    [J]. INTERNATIONAL JOURNAL OF SOCIAL RESEARCH METHODOLOGY, 2016, 19 (05) : 575 - 591
  • [2] Multiple imputation methods: a case study of daily gold price
    Alrawajfi, Ala
    Ismail, Mohd Tahir
    Al Wadi, Sadam
    Atiewi, Saleh
    Awajan, Ahmad
    [J]. PEERJ COMPUTER SCIENCE, 2024, 10
  • [3] Multiple Imputation in the Context of Case-Cohort Studies: Simulation and Case Study
    Middleton, Melissa
    Moreno-Betancur, Margarita
    Carlin, John
    Lee, Katherine J.
    [J]. INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2021, 50 : 155 - 155
  • [4] A Simulation Study Comparing Multiple Imputation Methods for Incomplete Longitudinal Ordinal Data
    Donneau, A. F.
    Mauer, M.
    Molenberghs, G.
    Albert, A.
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2015, 44 (05) : 1311 - 1338
  • [5] Longitudinal Wealth Data and Multiple Imputation An Evaluation Study
    Westermeier, Christian
    Grabka, Markus M.
    [J]. SURVEY RESEARCH METHODS, 2016, 10 (03): : 237 - 252
  • [6] A CASE-STUDY ON THE USE OF MULTIPLE IMPUTATION
    FREEDMAN, VA
    WOLF, DA
    [J]. DEMOGRAPHY, 1995, 32 (03) : 459 - 470
  • [7] Multiple imputation methods for longitudinal blood pressure measurements from the Framingham Heart Study
    Kang, T
    Kraft, P
    Gauderman, WJ
    Thomas, D
    [J]. BMC GENETICS, 2003, 4 (Suppl 1)
  • [8] Multiple imputation methods for longitudinal blood pressure measurements from the Framingham Heart Study
    Terri Kang
    Peter Kraft
    W James Gauderman
    Duncan Thomas
    [J]. BMC Genetics, 4
  • [9] Model checking in multiple imputation: An overview and case study
    Nguyen C.D.
    Carlin J.B.
    Lee K.J.
    [J]. Emerging Themes in Epidemiology, 14 (1):
  • [10] Multiple imputation analysis of case-cohort studies
    Marti, Helena
    Chavance, Michel
    [J]. STATISTICS IN MEDICINE, 2011, 30 (13) : 1595 - 1607