Missing value imputation in longitudinal measures of alcohol consumption

被引:25
|
作者
Grittner, Ulrike [1 ]
Gmel, Gerhard [2 ,3 ,4 ,5 ]
Ripatti, Samuli [6 ]
Bloomfield, Kim [1 ,7 ]
Wicki, Matthias [2 ]
机构
[1] Charite, Inst Biometr & Clin Epidemiol, D-10098 Berlin, Germany
[2] Swiss Inst Prevent Alcohol & Drug Problems SIPA, Lausanne, Switzerland
[3] Univ Lausanne Hosp CHUV, Alcohol Treatment Ctr, Lausanne, Switzerland
[4] Ctr Addict & Mental Hlth, Toronto, ON, Canada
[5] Univ W England, Bristol BS16 1QY, Avon, England
[6] Univ Helsinki, Inst Mol Med Finland FIMM, Helsinki, Finland
[7] Aarhus Univ, Ctr Alcohol & Drug Res, Copenhagen, Denmark
基金
英国医学研究理事会;
关键词
panel surveys; missing data; multiple imputation; Bayesian models; alcohol consumption; MULTIPLE IMPUTATION; HOT-DECK; SPECIFICATION; STABILITY; ATTRITION; SELECTION; DISCRETE; DRINKING; OUTCOMES; MODEL;
D O I
10.1002/mpr.330
中图分类号
R749 [精神病学];
学科分类号
100205 ;
摘要
Attrition in longitudinal studies can lead to biased results. The study is motivated by the unexpected observation that alcohol consumption decreased despite increased availability, which may be due to sample attrition of heavy drinkers. Several imputation methods have been proposed, but rarely compared in longitudinal studies of alcohol consumption. The imputation of consumption level measurements is computationally particularly challenging due to alcohol consumption being a semi-continuous variable (dichotomous drinking status and continuous volume among drinkers), and the non-normality of data in the continuous part. Data come from a longitudinal study in Denmark with four waves (2003-2006) and 1771 individuals at baseline. Five techniques for missing data are compared: Last value carried forward (LVCF) was used as a single, and Hotdeck, Heckman modelling, multivariate imputation by chained equations (MICE), and a Bayesian approach as multiple imputation methods. Predictive mean matching was used to account for non-normality, where instead of imputing regression estimates, "real" observed values from similar cases are imputed. Methods were also compared by means of a simulated dataset. The simulation showed that the Bayesian approach yielded the most unbiased estimates for imputation. The finding of no increase in consumption levels despite a higher availability remained unaltered. Copyright (C) 2011 John Wiley & Sons, Ltd.
引用
收藏
页码:50 / 61
页数:12
相关论文
共 50 条
  • [1] A data-driven missing value imputation approach for longitudinal datasets
    Ribeiro, Caio
    Freitas, Alex A.
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (08) : 6277 - 6307
  • [2] A data-driven missing value imputation approach for longitudinal datasets
    Caio Ribeiro
    Alex A. Freitas
    [J]. Artificial Intelligence Review, 2021, 54 : 6277 - 6307
  • [3] Imputation of missing values in the case of a multiple item instrument measuring alcohol consumption
    Gmel, G
    [J]. STATISTICS IN MEDICINE, 2001, 20 (15) : 2369 - 2381
  • [4] Gaussian processes for missing value imputation
    Jafrasteh, Bahram
    Hernandez-Lobato, Daniel
    Lubian-Lopez, Simon Pedro
    Benavente-Fernandez, Isabel
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 273
  • [5] A hybrid method for missing value imputation
    Karanikola, Aikaterini
    Kotsiantis, Sotiris
    [J]. PROCEEDINGS OF THE 23RD PAN-HELLENIC CONFERENCE OF INFORMATICS (PCI 2019), 2019, : 74 - 79
  • [6] Missing value imputation for epistatic MAPs
    Ryan, Colm
    Greene, Derek
    Cagney, Gerard
    Cunningham, Padraig
    [J]. BMC BIOINFORMATICS, 2010, 11
  • [7] Missing value imputation for epistatic MAPs
    Colm Ryan
    Derek Greene
    Gerard Cagney
    Pádraig Cunningham
    [J]. BMC Bioinformatics, 11
  • [8] DataWig: Missing value imputation for tables
    Bießmann, Felix
    Rukat, Tammo
    Schmidt, Phillipp
    Naidu, Prathik
    Schelter, Sebastian
    Taptunov, Andrey
    Lange, Dustin
    Salinas, David
    [J]. Journal of Machine Learning Research, 2019, 20
  • [9] Missing Value Imputation for Diabetes Prediction
    Luo, Fei
    Qian, Hangwei
    Wang, Di
    Guo, Xu
    Sun, Yan
    Lee, Eng Sing
    Teong, Hui Hwang
    Lai, Ray Tian Rui
    Miao, Chunyan
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [10] DataWig: Missing Value Imputation for Tables
    Biessmann, Felix
    Rukat, Tammo
    Schmidt, Phillipp
    Naidu, Prathik
    Schelter, Sebastian
    Taptunov, Andrey
    Lange, Dustin
    Salinas, David
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20