Synthesizing credit data using autoencoders and generative adversarial networks

被引：4

作者：

Oreski, Goran ^{[1
]}

机构：

[1] Univ Pula, Fac Informat, Pula, Croatia

来源：

KNOWLEDGE-BASED SYSTEMS | 2023年 / 274卷

关键词：

Autoencoders; Generative adversarial networks; Tabular data; Credit risk data; NEURAL-NETWORKS; ENSEMBLE; CLASSIFICATION; MACHINE;

D O I：

10.1016/j.knosys.2023.110646

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

quality is an essential element necessary for the development of a successful machine-learning project. One of the biggest challenges in various real-world application domains is class imbalance. This paper proposes a new framework for oversampling credit data by combining two deep learning techniques: autoencoders and generative adversarial networks. A trivial autoencoder (TAE) is used to change data representation, and modified generative adversarial networks (GAN) are used to create new instances from random noise. The experiment on three different datasets demonstrates that the same classifier achieves a better area under the receiver operating characteristic curve (AUC) on datasets augmented by the proposed framework compared to datasets oversampled by other techniques. Additionally, the results show that datasets balanced by the new framework influence the classifier to change the prediction error types, significantly reducing false negatives; more expensive misclassification case in the imbalance learning. The improvements are significant, and considering the change in error distribution, the proposed technique is an excellent complement to existing oversampling techniques.& COPY; 2023 Elsevier B.V. All rights reserved.

引用

页数：12

共 50 条

[31] An overview of biological data generation using generative adversarial networks
Liu, Lin
Xia, Yujing
Tang, Lin
2020 IEEE CONFERENCE ON TELECOMMUNICATIONS, OPTICS AND COMPUTER SCIENCE (TOCS), 2020, : 141 - 144
[32] Realistic Data Synthesis Using Enhanced Generative Adversarial Networks
Baowaly, Mrinal Kanti
Liu, Chao-Lin
Chen, Kuan-Ta
2019 IEEE SECOND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND KNOWLEDGE ENGINEERING (AIKE), 2019, : 289 - 292
[33] Efficient Approaches for Data Augmentation by Using Generative Adversarial Networks
Saha, Pretom Kumar
Logofatu, Doina
ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EAAAI/EANN 2022, 2022, 1600 : 386 - 399
[34] Geolocated Data Generation and Protection Using Generative Adversarial Networks
Alatrista-Salas, Hugo
Montalvo-Garcia, Peter
Nunez-del-Prado, Miguel
Salas, Julián
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2022, 13408 LNAI : 80 - 91
[35] Data Augmentation for Voiceprint Recognition Using Generative Adversarial Networks
Lin, Yao-San
Chen, Hung-Yu
Huang, Mei-Ling
Hsieh, Tsung-Yu
ALGORITHMS, 2024, 17 (12)
[36] Geolocated Data Generation and Protection Using Generative Adversarial Networks
Alatrista-Salas, Hugo
Montalvo-Garcia, Peter
Nunez-del-Prado, Miguel
Salas, Julian
MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, MDAI 2022, 2022, 13408 : 80 - 91
[37] Conditional Generative Adversarial Networks with Adversarial Attack and Defense for Generative Data Augmentation
Baek, Francis
Kim, Daeho
Park, Somin
Kim, Hyoungkwan
Lee, SangHyun
JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2022, 36 (03)
[38] Credit default swap prediction based on generative adversarial networks
Lin, Shu-Ying
Liu, Duen-Ren
Huang, Hsien-Pin
DATA TECHNOLOGIES AND APPLICATIONS, 2022, 56 (05) : 720 - 740
[39] Using generative adversarial networks for improving classification effectiveness in credit card fraud detection
Fiore, Ugo
De Santis, Alfredo
Perla, Francesca
Zanetti, Paolo
Palmieri, Francesco
INFORMATION SCIENCES, 2019, 479 : 448 - 455
[40] A comprehensive survey on generative adversarial networks used for synthesizing multimedia content
Kumar, Lalit
Singh, Dushyant Kumar
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (26) : 40585 - 40624

← 1 2 3 4 5 →