Multiple imputation of semi-continuous exposure variables that are categorized for analysis

被引:3
|
作者
Nguyen, Cattram D. [1 ,2 ]
Moreno-Betancur, Margarita [1 ,2 ]
Rodwell, Laura [1 ,2 ,3 ]
Romaniuk, Helena [4 ]
Carlin, John B. [1 ,2 ]
Lee, Katherine J. [1 ,2 ]
机构
[1] Murdoch Childrens Res Inst, Clin Epidemiol & Biostat Unit, Parkville, Vic, Australia
[2] Univ Melbourne, Fac Med Dent & Hlth Sci, Dept Paediat, Melbourne, Vic, Australia
[3] Radboud Univ Nijmegen, Radboud Inst Hlth Sci, Dept Hlth Evidence, Med Ctr, Nijmegen, Netherlands
[4] Deakin Univ, Fac Hlth, Biostat Unit, Geelong, Vic, Australia
基金
澳大利亚研究理事会; 英国医学研究理事会;
关键词
missing data; multiple imputation; ordinal categorical variable; semi-continuous; zero inflated data; FULLY CONDITIONAL SPECIFICATION; CHAINED EQUATIONS; OUTCOMES;
D O I
10.1002/sim.9172
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Semi-continuous variables are characterized by a point mass at one value and a continuous range of values for remaining observations. An example is alcohol consumption quantity, with a spike of zeros representing non-drinkers and positive values for drinkers. If multiple imputation is used to handle missing values for semi-continuous variables, it is unclear how this should be implemented within the standard approaches of fully conditional specification (FCS) and multivariate normal imputation (MVNI). This question is brought into focus by the use of categorized versions of semi-continuous exposure variables in analyses (eg, no drinking, drinking below binge level, binge drinking, heavy binge drinking), raising the question of how best to achieve congeniality between imputation and analysis models. We performed a simulation study comparing nine approaches for imputing semi-continuous exposures requiring categorization for analysis. Three methods imputed the categories directly: ordinal logistic regression, and imputation of binary indicator variables representing the categories using MVNI (with two variants). Six methods (predictive mean matching, zero-inflated binomial imputation, and two-part imputation methods with variants in FCS and MVNI) imputed the semi-continuous variable, with categories derived after imputation. The ordinal and zero-inflated binomial methods had good performance across most scenarios, while MVNI methods requiring rounding after imputation did not perform well. There were mixed results for predictive mean matching and the two-part methods, depending on whether the estimands were proportions or regression coefficients. The results highlight the need to consider the parameter of interest when selecting an imputation procedure.
引用
收藏
页码:6093 / 6106
页数:14
相关论文
共 50 条
  • [1] Evaluation of software for multiple imputation of semi-continuous data
    Yu, L-M
    Burton, Andrea
    Rivero-Arias, Oliver
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2007, 16 (03) : 243 - 258
  • [2] Multiple imputation for continuous variables using a Bayesian principal component analysis
    Audigier, Vincent
    Husson, Francois
    Josse, Julie
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2016, 86 (11) : 2140 - 2156
  • [3] Multiple Imputation for Multilevel Data with Continuous and Binary Variables
    Audigier, Vincent
    White, Ian R.
    Jolani, Shahab
    Debray, Thomas P. A.
    Quartagno, Matteo
    Carpenter, James
    van Buuren, Stef
    Resche-Rigon, Matthieu
    [J]. STATISTICAL SCIENCE, 2018, 33 (02) : 160 - 183
  • [4] Computability on continuous, lower semi-continuous and upper semi-continuous real functions
    Weihrauch, K
    Zheng, XH
    [J]. THEORETICAL COMPUTER SCIENCE, 2000, 234 (1-2) : 109 - 133
  • [5] Integral inclusions of upper semi-continuous or lower semi-continuous type
    ORegan, D
    [J]. PROCEEDINGS OF THE AMERICAN MATHEMATICAL SOCIETY, 1996, 124 (08) : 2391 - 2399
  • [6] CONTINUOUS AND SEMI-CONTINUOUS UTILITY
    RICHTER, MK
    [J]. INTERNATIONAL ECONOMIC REVIEW, 1980, 21 (02) : 293 - 299
  • [7] Propensity score analysis for a semi-continuous exposure variable: a study of gestational alcohol exposure and childhood cognition
    Hocagil, Tugba Akkaya
    Cook, Richard J.
    Jacobson, Sandra W.
    Jacobson, Joseph L.
    Ryan, Louise M.
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2021, 184 (04) : 1390 - 1413
  • [8] SEMI-CONTINUOUS MULTIFUNCTIONS
    POPA, V
    [J]. REVUE ROUMAINE DE MATHEMATIQUES PURES ET APPLIQUEES, 1982, 27 (07): : 807 - 815
  • [9] Recent Advances in Mathematical Programming with Semi-continuous Variables and Cardinality Constraint
    Sun X.
    Zheng X.
    Li D.
    [J]. Sun, X. (xls@fudan.edu.cn), 1600, Springer Science and Business Media Deutschland GmbH (01): : 55 - 77
  • [10] SEMI-CONTINUOUS MAPPINGS
    NOIRI, T
    [J]. ATTI DELLA ACCADEMIA NAZIONALE DEI LINCEI RENDICONTI-CLASSE DI SCIENZE FISICHE-MATEMATICHE & NATURALI, 1974, 54 (02): : 210 - 214