GAIN: Missing Data Imputation using Generative Adversarial Nets

被引:0
|
作者
Yoon, Jinsung [1 ]
Jordon, James [2 ]
van der Schaar, Mihaela [1 ,2 ,3 ]
机构
[1] Univ Calif Los Angeles, Los Angeles, CA 90095 USA
[2] Univ Oxford, Oxford, England
[3] Alan Turing Inst, London, England
关键词
MULTIPLE IMPUTATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel method for imputing missing data by adapting the well-known Generative Adversarial Nets (GAN) framework. Accordingly, we call our method Generative Adversarial Imputation Nets (GAIN). The generator (G) observes some components of a real data vector, imputes the missing components conditioned on what is actually observed, and outputs a completed vector. The discriminator (D) then takes a completed vector and attempts to determine which components were actually observed and which were imputed. To ensure that D forces G to learn the desired distribution, we provide D with some additional information in the form of a hint vector. The hint reveals to D partial information about the missingness of the original sample, which is used by D to focus its attention on the imputation quality of particular components. This hint ensures that G does in fact learn to generate according to the true data distribution. We tested our method on various datasets and found that GAIN significantly outperforms state-of-the-art imputation methods.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] LFM-D2GAIN: An Improved Missing Data Imputation Method Based on Generative Adversarial Imputation Nets
    Shen, Yebai
    Zhang, Chao
    Zhang, Songyu
    Yan, Jinghua
    Bu, Fanliang
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, BIG DATA AND ALGORITHMS (EEBDA), 2022, : 447 - 453
  • [2] Federated conditional generative adversarial nets imputation method for air quality missing data
    Zhou, Xu
    Liu, Xiaofeng
    Lan, Gongjin
    Wu, Jian
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 228
  • [3] Improved generative adversarial imputation networks for missing data
    Qin, Xiwen
    Shi, Hongyu
    Dong, Xiaogang
    Zhang, Siqi
    Yuan, Liping
    [J]. APPLIED INTELLIGENCE, 2024, 54 (21) : 11068 - 11082
  • [4] A systematic review of generative adversarial imputation network in missing data imputation
    Yuqing Zhang
    Runtong Zhang
    Butian Zhao
    [J]. Neural Computing and Applications, 2023, 35 : 19685 - 19705
  • [5] A systematic review of generative adversarial imputation network in missing data imputation
    Zhang, Yuqing
    Zhang, Runtong
    Zhao, Butian
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (27): : 19685 - 19705
  • [6] FragmGAN: generative adversarial nets for fragmentary data imputation and prediction
    Fang, Fang
    Bao, Shenliao
    [J]. STATISTICAL THEORY AND RELATED FIELDS, 2024, 8 (01) : 15 - 28
  • [7] Imputation of missing data with class imbalance using conditional generative adversarial networks
    Awan, Saqib Ejaz
    Bennamoun, Mohammed
    Sohel, Ferdous
    Sanfilippo, Frank
    Dwivedi, Girish
    [J]. NEUROCOMPUTING, 2021, 453 : 164 - 171
  • [8] GAGIN: generative adversarial guider imputation network for missing data
    Wei Wang
    Yimeng Chai
    Yue Li
    [J]. Neural Computing and Applications, 2022, 34 : 7597 - 7610
  • [9] GAGIN: generative adversarial guider imputation network for missing data
    Wang, Wei
    Chai, Yimeng
    Li, Yue
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (10): : 7597 - 7610
  • [10] Ensemble Generative Adversarial Imputation Network with Selective Multi-Generator (ESM-GAIN) for Missing Data Imputation
    Li, Yuxuan
    Dogan, Ayse
    Liu, Chenang
    [J]. 2022 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2022, : 807 - 812