Multiple Imputation for General Missing Data Patterns in the Presence of High-dimensional Data

被引:0
|
作者
Yi Deng
Changgee Chang
Moges Seyoum Ido
Qi Long
机构
[1] Emory University,Department of Biostatistics and Bioinformatics
[2] Georgia Department of Public Health,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Multiple imputation (MI) has been widely used for handling missing data in biomedical research. In the presence of high-dimensional data, regularized regression has been used as a natural strategy for building imputation models, but limited research has been conducted for handling general missing data patterns where multiple variables have missing values. Using the idea of multiple imputation by chained equations (MICE), we investigate two approaches of using regularized regression to impute missing values of high-dimensional data that can handle general missing data patterns. We compare our MICE methods with several existing imputation methods in simulation studies. Our simulation results demonstrate the superiority of the proposed MICE approach based on an indirect use of regularized regression in terms of bias. We further illustrate the proposed methods using two data examples.
引用
收藏
相关论文
共 50 条
  • [31] High-dimensional large-scale mixed-type data imputation under missing at random
    Wei Liu
    Guizhen Li
    Ling Zhou
    Lan Luo
    [J]. Science China(Mathematics), 2025, 68 (04) : 969 - 1000
  • [32] The Pairwise Gaussian Random Field for High-Dimensional Data Imputation
    Cai, Zhuhua
    Jermaine, Christopher
    Vagena, Zografoula
    Logothetis, Dionysios
    Perez, Luis L.
    [J]. 2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 61 - 70
  • [33] A method for learning a sparse classifier in the presence of missing data for high-dimensional biological datasets
    Severson, Kristen A.
    Monian, Brinda
    Love, J. Christopher
    Braatz, Richard D.
    [J]. BIOINFORMATICS, 2017, 33 (18) : 2897 - 2905
  • [34] Analysing Mark–Recapture–Recovery Data in the Presence of Missing Covariate Data Via Multiple Imputation
    Hannah Worthington
    Ruth King
    Stephen T. Buckland
    [J]. Journal of Agricultural, Biological, and Environmental Statistics, 2015, 20 : 28 - 46
  • [35] Flexible High-Dimensional Unsupervised Learning with Missing Data
    Wei, Yuhong
    Tang, Yang
    McNicholas, Paul D.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (03) : 610 - 621
  • [36] Online missing value imputation for high-dimensional mixed-type data via generalized factor models
    Liu, Wei
    Luo, Lan
    Zhou, Ling
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2023, 187
  • [37] Multiple imputation for missing data: a brief introduction
    Baccini, Michela
    [J]. EPIDEMIOLOGIA & PREVENZIONE, 2008, 32 (03): : 162 - 163
  • [38] Multiple imputation for missing data - A cautionary tale
    Allison, PD
    [J]. SOCIOLOGICAL METHODS & RESEARCH, 2000, 28 (03) : 301 - 309
  • [39] Introduction to multiple imputation for dealing with missing data
    Lee, Katherine J.
    Simpson, Julie A.
    [J]. RESPIROLOGY, 2014, 19 (02) : 162 - 167
  • [40] The use of multiple imputation for the analysis of missing data
    Sinharay, S
    Stern, HS
    Russell, D
    [J]. PSYCHOLOGICAL METHODS, 2001, 6 (04) : 317 - 329