Unsupervised data imputation with multiple importance sampling variational autoencoders

被引:0
|
作者
Kuang, Shenfen [1 ]
Huang, Yewen [2 ]
Song, Jie [1 ]
机构
[1] Shaoguan Univ, Sch Math & Stat, Shaoguan 512005, Peoples R China
[2] Guangdong Polytech Normal Univ, Sch Elect & Informat, Guangzhou 510665, Peoples R China
来源
SCIENTIFIC REPORTS | 2025年 / 15卷 / 01期
关键词
Missing data; Variational autoencoders; Multiple importance sampling; Resampling; MISSING DATA IMPUTATION;
D O I
10.1038/s41598-025-87641-0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recently, deep latent variable models have made significant progress in dealing with missing data problems, benefiting from their ability to capture intricate and non-linear relationships within the data. In this work, we further investigate the potential of Variational Autoencoders (VAEs) in addressing the uncertainty associated with missing data via a multiple importance sampling strategy. We propose a Missing data Multiple Importance Sampling Variational Auto-Encoder (MMISVAE) method to effectively model incomplete data. Our approach consists of a learning step and an imputation step. During the learning step, the mixture components are represented by multiple separate encoder networks, which are later combined through simple averaging to enhance the latent representation capabilities of the VAEs when dealing with incomplete data. The statistical model and variational distributions are iteratively updated by maximizing the Multiple Importance Sampling Evidence Lower Bound (MISELBO) on the joint log-likelihood. In the imputation step, missing data is estimated using conditional expectation through multiple importance resampling. We propose an efficient imputation algorithm that broadens the scope of Missing data Importance Weighted Auto-Encoder (MIWAE) by incorporating multiple proposal probability distributions and the resampling schema. One notable characteristic of our method is the complete unsupervised nature of both the learning and imputation processes. Through comprehensive experimental analysis, we present evidence of the effectiveness of our method in improving the imputation accuracy of incomplete data when compared to current state-of-the-art VAEs-based methods.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Variational approximation for importance sampling
    Xiao Su
    Yuguo Chen
    Computational Statistics, 2021, 36 : 1901 - 1930
  • [22] Importance sampling as a variational approximation
    Nott, David J.
    Li Jialiang
    Fielding, Mark
    STATISTICS & PROBABILITY LETTERS, 2011, 81 (08) : 1052 - 1055
  • [23] MIDIA: exploring denoising autoencoders for missing data imputation
    Ma, Qian
    Lee, Wang-Chien
    Fu, Tao-Yang
    Gu, Yu
    Yu, Ge
    DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 34 (06) : 1859 - 1897
  • [24] Variational approximation for importance sampling
    Su, Xiao
    Chen, Yuguo
    COMPUTATIONAL STATISTICS, 2021, 36 (03) : 1901 - 1930
  • [25] Parallel Variational Autoencoders for Multiple Responses Generation
    Li, Miaojin
    Fu, Peng
    Lin, Zheng
    Wang, Weiping
    2020 IEEE INTL SYMP ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, INTL CONF ON BIG DATA & CLOUD COMPUTING, INTL SYMP SOCIAL COMPUTING & NETWORKING, INTL CONF ON SUSTAINABLE COMPUTING & COMMUNICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2020), 2020, : 128 - 136
  • [26] Sparse Autoencoders for Unsupervised Netflow Data Classification
    Kozik, Rafal
    Pawlicki, Marek
    Choras, Michal
    IMAGE PROCESSING AND COMMUNICATIONS CHALLENGES 10, 2019, 892 : 192 - 199
  • [27] Unsupervised pathology detection in medical images using conditional variational autoencoders
    Hristina Uzunova
    Sandra Schultz
    Heinz Handels
    Jan Ehrhardt
    International Journal of Computer Assisted Radiology and Surgery, 2019, 14 : 451 - 461
  • [28] Robust Variational Autoencoders and Normalizing Flows for Unsupervised Network Anomaly Detection
    Najari, Naji
    Berlemont, Samuel
    Lefebvre, Gregoire
    Duffner, Stefan
    Garcia, Christophe
    ADVANCED INFORMATION NETWORKING AND APPLICATIONS, AINA-2022, VOL 2, 2022, 450 : 281 - 292
  • [29] Unsupervised Linear and Nonlinear Channel Equalization and Decoding Using Variational Autoencoders
    Caciularu, Avi
    Burshtein, David
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2020, 6 (03) : 1003 - 1018
  • [30] Combination of Variational Autoencoders and Generative Adversarial Network into an Unsupervised Generative Model
    Almalki, Ali Jaber
    Wocjan, Pawel
    ADVANCES IN ARTIFICIAL INTELLIGENCE AND APPLIED COGNITIVE COMPUTING, 2021, : 101 - 110