Evaluation of software for multiple imputation of semi-continuous data

被引:68
|
作者
Yu, L-M
Burton, Andrea
Rivero-Arias, Oliver
机构
[1] Canc Res UK NHS Ctr Stat Med, Oxford, England
[2] Univ Warwick, Warwick Clin Trials Unit, Coventry CV4 7AL, W Midlands, England
[3] Univ Oxford, Hlth Econ Res Ctr, Oxford OX1 2JD, England
关键词
D O I
10.1177/0962280206074464
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
It is now widely accepted that multiple imputation (MI) methods properly handle the uncertainty of missing data over single imputation methods. Several standard statistical software packages, such as SAS, R and STATA, have standard procedures or user-written programs to perform MI. The performance of these packages is generally acceptable for most types of data. However, it is unclear whether these applications are appropriate for imputing data with a large proportion of zero values resulting in a semi-continuous distribution. In addition, it is not clear whether the use of these applications is suitable when the distribution of the data needs to be preserved for Subsequent analysis. This article reports the findings of a simulation study carried out to evaluate the performance of the MI procedures for handling semi-continuous data within these statistical packages. Complete resource use data on 1060 participants from a large randomized clinical trial were used as the simulation population from which 500 bootstrap samples were obtained and missing data imposed. The findings of this study showed differences in the performance of the MI programs when imputing semi-continuous data. Caution should be exercised when deciding which program should perform MI on this type of data.
引用
收藏
页码:243 / 258
页数:16
相关论文
共 50 条
  • [1] Multiple imputation of semi-continuous exposure variables that are categorized for analysis
    Nguyen, Cattram D.
    Moreno-Betancur, Margarita
    Rodwell, Laura
    Romaniuk, Helena
    Carlin, John B.
    Lee, Katherine J.
    [J]. STATISTICS IN MEDICINE, 2021, 40 (27) : 6093 - 6106
  • [2] Transformations for semi-continuous data
    Shmueli, Galit
    Jank, Wolfgang
    Hyde, Valerie
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2008, 52 (08) : 4000 - 4020
  • [3] An imputation approach for fitting two-part mixed effects models for longitudinal semi-continuous data
    Choo-Wosoba, Hyoyoung
    Kundu, Debamita
    Albert, Paul S.
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2020, 29 (11) : 3351 - 3361
  • [4] Computability on continuous, lower semi-continuous and upper semi-continuous real functions
    Weihrauch, K
    Zheng, XH
    [J]. THEORETICAL COMPUTER SCIENCE, 2000, 234 (1-2) : 109 - 133
  • [5] Integral inclusions of upper semi-continuous or lower semi-continuous type
    ORegan, D
    [J]. PROCEEDINGS OF THE AMERICAN MATHEMATICAL SOCIETY, 1996, 124 (08) : 2391 - 2399
  • [6] CONTINUOUS AND SEMI-CONTINUOUS UTILITY
    RICHTER, MK
    [J]. INTERNATIONAL ECONOMIC REVIEW, 1980, 21 (02) : 293 - 299
  • [7] SEMI-CONTINUOUS MULTIFUNCTIONS
    POPA, V
    [J]. REVUE ROUMAINE DE MATHEMATIQUES PURES ET APPLIQUEES, 1982, 27 (07): : 807 - 815
  • [8] A semi-parametric Bayesian model for semi-continuous longitudinal data
    Ren, Junting
    Tapert, Susan
    Fan, Chun Chieh
    Thompson, Wesley K.
    [J]. STATISTICS IN MEDICINE, 2022, 41 (13) : 2354 - 2374
  • [9] Multiple edit/multiple imputation for multivariate continuous data
    Ghosh-Dastidar, B
    Schafer, JL
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2003, 98 (464) : 807 - 817
  • [10] SEMI-CONTINUOUS MAPPINGS
    NOIRI, T
    [J]. ATTI DELLA ACCADEMIA NAZIONALE DEI LINCEI RENDICONTI-CLASSE DI SCIENZE FISICHE-MATEMATICHE & NATURALI, 1974, 54 (02): : 210 - 214