A joint learning method for incomplete and imbalanced data in electronic health record based on generative adversarial networks

被引:2
|
作者
Weng, Xutao [1 ]
Song, Hong [1 ]
Lin, Yucong [2 ]
Wu, You [3 ]
Zhang, Xi [1 ]
Liu, Bowen [3 ]
Yang, Jian [2 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Sch Opt & Photon, Beijing 100081, Peoples R China
[3] Beijing Inst Technol, Sch Med Technol, Beijing 100081, Peoples R China
关键词
Electronic health records; Generative adversarial networks; Imbalanced learning; Missing values imputation; MISSING DATA; IMPUTATION; CLASSIFICATION;
D O I
10.1016/j.compbiomed.2023.107687
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Electronic health records (EHR), present challenges of incomplete and imbalanced data in clinical predictions. Previous studies addressed these two issues with two-step separately, which caused the decrease in the performance of prediction tasks. In this paper, we propose a unified framework to simultaneously addresses the challenges of incomplete and imbalanced data in EHR. Based on the framework, we develop a model called Missing Value Imputation and Imbalanced Learning Generative Adversarial Network (MVIIL-GAN). We use MVIIL-GAN to perform joint learning on the imputation process of high missing rate data and the conditional generation process of EHR data. The joint learning is achieved by introducing two discriminators to distinguish the fake data from the generated data at sample-level and variable-level. MVIIL-GAN integrate the missing values imputation and data generation in one step, improving the consistency of parameter optimization and the performance of prediction tasks. We evaluate our framework using the public dataset MIMIC-IV with high missing rates data and imbalanced data. Experimental results show that MVIIL-GAN outperforms existing methods in prediction performance. The implementation of MVIIL-GAN can be found at https://github.com/P eroxidess/MVIIL-GAN.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Boosting Deep Learning Risk Prediction with Generative Adversarial Networks for Electronic Health Records
    Che, Zhengping
    Cheng, Yu
    Zha, Shuangfei
    Sun, Zhaonan
    Liu, Yan
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2017, : 787 - 792
  • [32] Machinery fault diagnosis with imbalanced data using deep generative adversarial networks
    Zhang, Wei
    Li, Xiang
    Jia, Xiao-Dong
    Ma, Hui
    Luo, Zhong
    Li, Xu
    MEASUREMENT, 2020, 152
  • [33] Enhanced generative adversarial networks for fault diagnosis of rotating machinery with imbalanced data
    Li, Qi
    Chen, Liang
    Shen, Changqing
    Yang, Bingru
    Zhu, Zhongkui
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2019, 30 (11)
  • [34] Zero-Shot Learning with Joint Generative Adversarial Networks
    Zhang, Minwan
    Wang, Xiaohua
    Shi, Yueting
    Ren, Shiwei
    Wang, Weijiang
    ELECTRONICS, 2023, 12 (10)
  • [35] A machine learning method for incomplete and imbalanced medical data
    Salman, Issam
    Vomlel, Jiri
    PROCEEDINGS OF THE 20TH CZECH-JAPAN SEMINAR ON DATA ANALYSIS AND DECISION MAKING UNDER UNCERTAINTY, 2017, : 188 - 195
  • [36] TMG-GAN: Generative Adversarial Networks-Based Imbalanced Learning for Network Intrusion Detection
    Ding, Hongwei
    Sun, Yu
    Huang, Nana
    Shen, Zhidong
    Cui, Xiaohui
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 1156 - 1167
  • [37] Tackling Class-imbalanced Learning Issues Based on Local Neighborhood Information and Generative Adversarial Networks
    Chen, Chien-Chih
    Lin, Yao-San
    Chen, Hung-Yu
    SENSORS AND MATERIALS, 2024, 63 (11) : 4835 - 4847
  • [38] Data Synthesis based on Generative Adversarial Networks
    Park, Noseong
    Mohammadi, Mahmoud
    Gorde, Kshitij
    Jajodia, Sushil
    Park, Hongkyu
    Kim, Youngmin
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 11 (10): : 1071 - 1083
  • [39] Continuous missing data imputation with incomplete dataset by generative adversarial networks-based unsupervised learning for long-term bridge health monitoring
    Jiang, Huachen
    Wan, Chunfeng
    Yang, Kang
    Ding, Youliang
    Xue, Songtao
    STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2022, 21 (03): : 1093 - 1109
  • [40] A new generative adversarial network based imbalanced fault diagnosis method
    Li, Menglei
    Zou, Dacheng
    Luo, Shuyang
    Zhou, Qi
    Cao, Longchao
    Liu, Huaping
    MEASUREMENT, 2022, 194