Multiple Imputation for General Missing Data Patterns in the Presence of High-dimensional Data

被引:0
|
作者
Yi Deng
Changgee Chang
Moges Seyoum Ido
Qi Long
机构
[1] Emory University,Department of Biostatistics and Bioinformatics
[2] Georgia Department of Public Health,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Multiple imputation (MI) has been widely used for handling missing data in biomedical research. In the presence of high-dimensional data, regularized regression has been used as a natural strategy for building imputation models, but limited research has been conducted for handling general missing data patterns where multiple variables have missing values. Using the idea of multiple imputation by chained equations (MICE), we investigate two approaches of using regularized regression to impute missing values of high-dimensional data that can handle general missing data patterns. We compare our MICE methods with several existing imputation methods in simulation studies. Our simulation results demonstrate the superiority of the proposed MICE approach based on an indirect use of regularized regression in terms of bias. We further illustrate the proposed methods using two data examples.
引用
收藏
相关论文
共 50 条
  • [41] Introduction to multiple imputation for dealing with missing data
    Lee, Katherine J.
    Simpson, Julie A.
    [J]. RESPIROLOGY, 2014, 19 (02) : 162 - 167
  • [42] Regression multiple imputation for missing data analysis
    Yu, Lili
    Liu, Liang
    Peace, Karl E.
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2020, 29 (09) : 2647 - 2664
  • [43] Multiple imputation of ordinal missing not at random data
    Angelina Hammon
    [J]. AStA Advances in Statistical Analysis, 2023, 107 : 671 - 692
  • [44] Analysing Mark-Recapture-Recovery Data in the Presence of Missing Covariate Data Via Multiple Imputation
    Worthington, Hannah
    King, Ruth
    Buckland, Stephen T.
    [J]. JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2015, 20 (01) : 28 - 46
  • [45] Variable selection in the presence of missing data: resampling and imputation
    Long, Qi
    Johnson, Brent A.
    [J]. BIOSTATISTICS, 2015, 16 (03) : 596 - 610
  • [46] High-Dimensional Matched Subspace Detection When Data are Missing
    Balzano, Laura
    Recht, Benjamin
    Nowak, Robert
    [J]. 2010 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, 2010, : 1638 - 1642
  • [47] High-dimensional variable selection in regression and classification with missing data
    Gao, Qi
    Lee, Thomas C. M.
    [J]. SIGNAL PROCESSING, 2017, 131 : 1 - 7
  • [48] A General Spatiotemporal Imputation Framework for Missing Sensor Data
    Tharzeen, Aahila
    Munikoti, Sai
    Prakash, Punit
    Kim, Jungkwun
    Natarajan, Balasubramaniam
    [J]. 2023 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI, 2023, : 55 - 58
  • [49] An Efficient Multiple Imputation Approach for Estimating Equations with Response Missing at Random and High-Dimensional Covariates
    Lei Wang
    Siying Sun
    Zheng Xia
    [J]. Journal of Systems Science and Complexity, 2021, 34 : 440 - 464
  • [50] Multiple Imputation via Generative Adversarial Network for High-dimensional Blockwise Missing Value Problems
    Dai, Zongyu
    Bu, Zhiqi
    Long, Qi
    [J]. 20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 791 - 798