Missing Data Imputation for a Multivariate Outcome of Mixed Variable Types

被引:2
|
作者
Wang, Tuo [1 ]
Zilinskas, Rachel [2 ]
Li, Ying [3 ]
Qu, Yongming [3 ,4 ]
机构
[1] Univ Wisconsin, Dept Biostat & Med Informat, Madison, WI USA
[2] Stat & Data Corp, Tempe, AZ USA
[3] Eli Lilly & Co, Dept Stat Data & Analyt, Indianapolis, IN USA
[4] Eli Lilly & Co, Lilly Corp Ctr, Dept Global Stat Sci, Indianapolis, IN 46285 USA
来源
关键词
Fully conditional specification; Missing at random; Piecewise hazard model; TO-EVENT DATA; SENSITIVITY-ANALYSIS; MULTIPLE IMPUTATION;
D O I
10.1080/19466315.2023.2169753
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Data collected in clinical trials are often composed of multiple types of variables. For example, laboratory measurements and vital signs are longitudinal data of continuous or categorical variables, adverse events may be recurrent events, and death is a time-to-event variable. Missing data due to patients' discontinuation from the study or as a result of handling intercurrent events using a hypothetical strategy almost always occur during any clinical trial. Imputing these data with mixed types of variables simultaneously is a challenge that has not been studied extensively. In this article, we propose using an approximate fully conditional specification to impute the missing data. Simulation shows the proposed method provides satisfactory results under the assumption of missing at random. Finally, real data from a clinical trial evaluating treatments for diabetes are analyzed to illustrate the potential benefit of the proposed method.
引用
收藏
页码:826 / 837
页数:12
相关论文
共 50 条
  • [1] Mixed intelligent- multivariate missing imputation
    Gibert, K.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2014, 91 (01) : 85 - 96
  • [2] Clustering and variable selection in the presence of mixed variable types and missing data
    Storlie, C. B.
    Myers, S. M.
    Katusic, S. K.
    Weaver, A. L.
    Voigt, R. G.
    Croarkin, P. E.
    Stoeckel, R. E.
    Port, J. D.
    [J]. STATISTICS IN MEDICINE, 2018, 37 (19) : 2884 - 2899
  • [3] Missing data imputation in multivariate data by evolutionary algorithms
    Figueroa Garcia, Juan C.
    Kalenatic, Dusko
    Lopez Bello, Cesar Amilcar
    [J]. COMPUTERS IN HUMAN BEHAVIOR, 2011, 27 (05) : 1468 - 1474
  • [4] A genetic algorithm for multivariate missing data imputation
    Carlos Figueroa-Garcia, Juan
    Neruda, Roman
    Hernandez-Perez, German
    [J]. INFORMATION SCIENCES, 2023, 619 : 947 - 967
  • [5] Variable selection in the presence of missing data: resampling and imputation
    Long, Qi
    Johnson, Brent A.
    [J]. BIOSTATISTICS, 2015, 16 (03) : 596 - 610
  • [6] Application of Sequential Regression Multivariate Imputation Method on Multivariate Normal Missing Data
    Nurzaman
    Siswantining, Titin
    Soemartojo, Saskya Mary
    Sarwinda, Devvi
    [J]. 2019 3RD INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS 2019), 2019,
  • [7] On multivariate imputation and forecasting of decadal wind speed missing data
    Wesonga, Ronald
    [J]. SPRINGERPLUS, 2015, 4
  • [8] Missing Data and Multiple Imputation in the Context of Multivariate Analysis of Variance
    Finch, W. Holmes
    [J]. JOURNAL OF EXPERIMENTAL EDUCATION, 2016, 84 (02): : 356 - 372
  • [10] Multivariate imputation of qualitative missing data using Bayesian networks
    Romero, V
    Salmerón, A
    [J]. SOFT METHODOLOGY AND RANDOM INFORMATION SYSTEMS, 2004, : 605 - 612