Multiple Imputation for General Missing Data Patterns in the Presence of High-dimensional Data

被引:0
|
作者
Yi Deng
Changgee Chang
Moges Seyoum Ido
Qi Long
机构
[1] Emory University,Department of Biostatistics and Bioinformatics
[2] Georgia Department of Public Health,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Multiple imputation (MI) has been widely used for handling missing data in biomedical research. In the presence of high-dimensional data, regularized regression has been used as a natural strategy for building imputation models, but limited research has been conducted for handling general missing data patterns where multiple variables have missing values. Using the idea of multiple imputation by chained equations (MICE), we investigate two approaches of using regularized regression to impute missing values of high-dimensional data that can handle general missing data patterns. We compare our MICE methods with several existing imputation methods in simulation studies. Our simulation results demonstrate the superiority of the proposed MICE approach based on an indirect use of regularized regression in terms of bias. We further illustrate the proposed methods using two data examples.
引用
收藏
相关论文
共 50 条
  • [1] Multiple Imputation for General Missing Data Patterns in the Presence of High-dimensional Data
    Deng, Yi
    Chang, Changgee
    Ido, Moges Seyoum
    Long, Qi
    [J]. SCIENTIFIC REPORTS, 2016, 6
  • [2] Multiple imputation in the presence of high-dimensional data
    Zhao, Yize
    Long, Qi
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2016, 25 (05) : 2021 - 2035
  • [3] Missing Data Imputation with High-Dimensional Data
    Brini, Alberto
    van den Heuvel, Edwin R.
    [J]. AMERICAN STATISTICIAN, 2024, 78 (02): : 240 - 252
  • [4] Bootstrap-multiple-imputation; high-dimensional model validation with missing data
    Chang, Billy
    Demetrashvili, Nino
    Kowgier, Matthew
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2011, 39 (02): : 202 - 204
  • [5] Multiple imputation with compatibility for high-dimensional data
    Zahid, Faisal Maqbool
    Faisal, Shahla
    Heumann, Christian
    [J]. PLOS ONE, 2021, 16 (07):
  • [6] Weighted multiple blockwise imputation method for high-dimensional regression with blockwise missing data
    Li, Jingmao
    Zhang, Qingzhao
    Chen, Song
    Fang, Kuangnan
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2023, 93 (03) : 459 - 474
  • [7] Missing value imputation in high-dimensional phenomic data: imputable or not, and how?
    Serena G Liao
    Yan Lin
    Dongwan D Kang
    Divay Chandra
    Jessica Bon
    Naftali Kaminski
    Frank C Sciurba
    George C Tseng
    [J]. BMC Bioinformatics, 15
  • [8] High-dimensional missing data imputation via undirected graphical model
    Lee, Yoonah
    Park, Seongoh
    [J]. STATISTICS AND COMPUTING, 2024, 34 (05)
  • [9] Missing value imputation in high-dimensional phenomic data: imputable or not, and how?
    Liao, Serena G.
    Lin, Yan
    Kang, Dongwan D.
    Chandra, Divay
    Bon, Jessica
    Kaminski, Naftali
    Sciurba, Frank C.
    Tseng, George C.
    [J]. BMC BIOINFORMATICS, 2014, 15
  • [10] Multiple imputation and analysis for high-dimensional incomplete proteomics data
    Yin, Xiaoyan
    Levy, Daniel
    Willinger, Christine
    Adourian, Aram
    Larson, Martin G.
    [J]. STATISTICS IN MEDICINE, 2016, 35 (08) : 1315 - 1326