Synthesizing credit data using autoencoders and generative adversarial networks

被引:4
|
作者
Oreski, Goran [1 ]
机构
[1] Univ Pula, Fac Informat, Pula, Croatia
关键词
Autoencoders; Generative adversarial networks; Tabular data; Credit risk data; NEURAL-NETWORKS; ENSEMBLE; CLASSIFICATION; MACHINE;
D O I
10.1016/j.knosys.2023.110646
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
quality is an essential element necessary for the development of a successful machine-learning project. One of the biggest challenges in various real-world application domains is class imbalance. This paper proposes a new framework for oversampling credit data by combining two deep learning techniques: autoencoders and generative adversarial networks. A trivial autoencoder (TAE) is used to change data representation, and modified generative adversarial networks (GAN) are used to create new instances from random noise. The experiment on three different datasets demonstrates that the same classifier achieves a better area under the receiver operating characteristic curve (AUC) on datasets augmented by the proposed framework compared to datasets oversampled by other techniques. Additionally, the results show that datasets balanced by the new framework influence the classifier to change the prediction error types, significantly reducing false negatives; more expensive misclassification case in the imbalance learning. The improvements are significant, and considering the change in error distribution, the proposed technique is an excellent complement to existing oversampling techniques.& COPY; 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] A Survey of Generative Adversarial Networks for Synthesizing Structured Electronic Health Records
    Ghosheh, Ghadeer O.
    Li, Jin
    Zhu, Tingting
    ACM COMPUTING SURVEYS, 2024, 56 (06)
  • [42] A pore space reconstruction method of shale based on autoencoders and generative adversarial networks
    Zhang, Ting
    Li, Deya
    Lu, Fangfang
    COMPUTATIONAL GEOSCIENCES, 2021, 25 (06) : 2149 - 2165
  • [43] AFE-GAN: Synthesizing Electrocardiograms with Atrial Fibrillation Characteristics Using Generative Adversarial Networks
    Wang, Xianglong
    Sahiner, Berkman
    Scully, Christopher G.
    Cha, Kenny H.
    2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [44] Zero-day malware detection using transferred generative adversarial networks based on deep autoencoders
    Kim, Jin-Young
    Bu, Seok-Jun
    Cho, Sung-Bae
    INFORMATION SCIENCES, 2018, 460 : 83 - 102
  • [45] Detection of crack bar deterioration at offshore wind turbine supports using generative adversarial networks and autoencoders
    Prieto-Galarza, Ricardo
    Tutivén, Christian
    Vidal, Yolanda
    Journal of Physics: Conference Series, 2024, 2647 (18):
  • [46] Adaptive Traffic Data Augmentation Using Generative Adversarial Networks for Optical Networks
    Li, Shuai
    Li, Jin
    Zhang, Min
    Wang, Danshi
    Song, Chuang
    Zhen, Xinghua
    2019 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXHIBITION (OFC), 2019,
  • [47] Multiple imputation method of missing credit risk assessment data based on generative adversarial networks
    Zhao, Feng
    Lu, Yan
    Li, Xinning
    Wang, Lina
    Song, Yingjie
    Fan, Deming
    Zhang, Caiming
    Chen, Xiaobo
    APPLIED SOFT COMPUTING, 2022, 126
  • [48] A pore space reconstruction method of shale based on autoencoders and generative adversarial networks
    Ting Zhang
    Deya Li
    Fangfang Lu
    Computational Geosciences, 2021, 25 : 2149 - 2165
  • [49] Deep Generative Models for Image Generation: A Practical Comparison Between Variational Autoencoders and Generative Adversarial Networks
    El-Kaddoury, Mohamed
    Mahmoudi, Abdelhak
    Himmi, Mohammed Majid
    MOBILE, SECURE, AND PROGRAMMABLE NETWORKING, 2019, 11557 : 1 - 8
  • [50] On the adversarial robustness of generative autoencoders in the latent space
    Lu, Mingfei
    Chen, Badong
    NEURAL COMPUTING & APPLICATIONS, 2024, : 8109 - 8123