Multiple imputation of baseline data in the cardiovascular health study

被引:94
|
作者
Arnold, AM [1 ]
Kronmal, RA [1 ]
机构
[1] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
关键词
biometry; epidemiologic methods; imputation; missing data; regression analysis;
D O I
10.1093/aje/kwf156
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Most epidemiologic studies will encounter missing covariate data. Software packages typically used for analyzing data delete any cases with a missing covariate to perform a complete case analysis. The deletion of cases complicates variable selection when different variables are missing on different cases, reduces power, and creates the potential for bias in the resulting estimates. Recently, software has become available for producing multiple imputations of missing data that account for the between-imputation variability. The implementation of the software to impute missing baseline data in the setting of the Cardiovascular Health Study, a large, observational study, is described. Results of exploratory analyses using the imputed data were largely consistent with results using only complete cases, even in a situation where one third of the cases were excluded from the complete case analysis. There were few differences in the exploratory results across three imputations, and the combined results from the multiple imputations were very similar to results from a single imputation. An increase in power was evident and variable selection simplified when using the imputed data sets.
引用
收藏
页码:74 / 84
页数:11
相关论文
共 50 条
  • [1] Results Differ by Applying Distinctive Multiple Imputation Approaches on the Longitudinal Cardiovascular Health Study Data
    Ning, Yuming
    McAvay, Gail
    Chaudhry, Sarwat I.
    Arnold, Alice M.
    Allore, Heather G.
    [J]. EXPERIMENTAL AGING RESEARCH, 2013, 39 (01) : 27 - 43
  • [2] Multiple imputation for analysis of incomplete data in distributed health data networks
    Changgee Chang
    Yi Deng
    Xiaoqian Jiang
    Qi Long
    [J]. Nature Communications, 11
  • [3] Multiple imputation for analysis of incomplete data in distributed health data networks
    Chang, Changgee
    Deng, Yi
    Jiang, Xiaoqian
    Long, Qi
    [J]. NATURE COMMUNICATIONS, 2020, 11 (01)
  • [4] Multiple Imputation of Missing Data in Longitudinal Electronic Health Records
    Petersen, Irene
    Welch, Catherine
    Bartlett, Jonathan
    Morris, Richard
    Walters, Kate
    Nazareth, Irwin
    Marston, Louise
    White, Ian
    Carpenter, James
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2013, 22 : 302 - 302
  • [5] Longitudinal Wealth Data and Multiple Imputation An Evaluation Study
    Westermeier, Christian
    Grabka, Markus M.
    [J]. SURVEY RESEARCH METHODS, 2016, 10 (03): : 237 - 252
  • [6] Multiple Imputation With Large Data Sets: A Case Study of the Children's Mental Health Initiative
    Stuart, Elizabeth A.
    Azur, Melissa
    Frangakis, Constantine
    Leaf, Philip
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 2009, 169 (09) : 1133 - 1139
  • [7] Multiple imputation and synthetic data
    Kim, Joungyoun
    Park, Min-Jeong
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2019, 32 (01) : 83 - 97
  • [8] Missing Data and Multiple Imputation
    Cummings, Peter
    [J]. JAMA PEDIATRICS, 2013, 167 (07) : 656 - 661
  • [9] Multiple imputation for missing data
    Patrician, PA
    [J]. RESEARCH IN NURSING & HEALTH, 2002, 25 (01) : 76 - 84
  • [10] Multiple imputation of missing data
    Lydersen, Stian
    [J]. TIDSSKRIFT FOR DEN NORSKE LAEGEFORENING, 2022, 142 (02) : 151 - 151