Synthesizing credit data using autoencoders and generative adversarial networks

被引:4
|
作者
Oreski, Goran [1 ]
机构
[1] Univ Pula, Fac Informat, Pula, Croatia
关键词
Autoencoders; Generative adversarial networks; Tabular data; Credit risk data; NEURAL-NETWORKS; ENSEMBLE; CLASSIFICATION; MACHINE;
D O I
10.1016/j.knosys.2023.110646
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
quality is an essential element necessary for the development of a successful machine-learning project. One of the biggest challenges in various real-world application domains is class imbalance. This paper proposes a new framework for oversampling credit data by combining two deep learning techniques: autoencoders and generative adversarial networks. A trivial autoencoder (TAE) is used to change data representation, and modified generative adversarial networks (GAN) are used to create new instances from random noise. The experiment on three different datasets demonstrates that the same classifier achieves a better area under the receiver operating characteristic curve (AUC) on datasets augmented by the proposed framework compared to datasets oversampled by other techniques. Additionally, the results show that datasets balanced by the new framework influence the classifier to change the prediction error types, significantly reducing false negatives; more expensive misclassification case in the imbalance learning. The improvements are significant, and considering the change in error distribution, the proposed technique is an excellent complement to existing oversampling techniques.& COPY; 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] An overview of biological data generation using generative adversarial networks
    Liu, Lin
    Xia, Yujing
    Tang, Lin
    2020 IEEE CONFERENCE ON TELECOMMUNICATIONS, OPTICS AND COMPUTER SCIENCE (TOCS), 2020, : 141 - 144
  • [32] Realistic Data Synthesis Using Enhanced Generative Adversarial Networks
    Baowaly, Mrinal Kanti
    Liu, Chao-Lin
    Chen, Kuan-Ta
    2019 IEEE SECOND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND KNOWLEDGE ENGINEERING (AIKE), 2019, : 289 - 292
  • [33] Efficient Approaches for Data Augmentation by Using Generative Adversarial Networks
    Saha, Pretom Kumar
    Logofatu, Doina
    ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EAAAI/EANN 2022, 2022, 1600 : 386 - 399
  • [34] Geolocated Data Generation and Protection Using Generative Adversarial Networks
    Alatrista-Salas, Hugo
    Montalvo-Garcia, Peter
    Nunez-del-Prado, Miguel
    Salas, Julián
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2022, 13408 LNAI : 80 - 91
  • [35] Data Augmentation for Voiceprint Recognition Using Generative Adversarial Networks
    Lin, Yao-San
    Chen, Hung-Yu
    Huang, Mei-Ling
    Hsieh, Tsung-Yu
    ALGORITHMS, 2024, 17 (12)
  • [36] Geolocated Data Generation and Protection Using Generative Adversarial Networks
    Alatrista-Salas, Hugo
    Montalvo-Garcia, Peter
    Nunez-del-Prado, Miguel
    Salas, Julian
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, MDAI 2022, 2022, 13408 : 80 - 91
  • [37] Conditional Generative Adversarial Networks with Adversarial Attack and Defense for Generative Data Augmentation
    Baek, Francis
    Kim, Daeho
    Park, Somin
    Kim, Hyoungkwan
    Lee, SangHyun
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2022, 36 (03)
  • [38] Credit default swap prediction based on generative adversarial networks
    Lin, Shu-Ying
    Liu, Duen-Ren
    Huang, Hsien-Pin
    DATA TECHNOLOGIES AND APPLICATIONS, 2022, 56 (05) : 720 - 740
  • [39] Using generative adversarial networks for improving classification effectiveness in credit card fraud detection
    Fiore, Ugo
    De Santis, Alfredo
    Perla, Francesca
    Zanetti, Paolo
    Palmieri, Francesco
    INFORMATION SCIENCES, 2019, 479 : 448 - 455
  • [40] A comprehensive survey on generative adversarial networks used for synthesizing multimedia content
    Kumar, Lalit
    Singh, Dushyant Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (26) : 40585 - 40624