Collaborative-GAN: An Approach for Stabilizing the Training Process of Generative Adversarial Network

被引：1

作者：

Megahed, Mohammed ^{[1
]}

Mohammed, Ammar ^{[2
]}

机构：

[1] Cairo Univ, Fac Grad Studies Stat Res, Giza 12613, Egypt

[2] Prince Sattam Bin Abdulaziz Univ, Coll Comp Engn & Sci, Dept Comp Sci, Al Kharj 16278, Saudi Arabia

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Generators; Training; Generative adversarial networks; Transfer learning; Fuzzy logic; Propagation losses; Games; Generative adversarial network; transfer learning; training instability; mode collapse;

D O I：

10.1109/ACCESS.2024.3457902

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Generative Adversarial Network (GAN) outperforms its peers in the generative models' family and is widely used to generate realistic samples in various domains. The basic idea of GAN is a competition between two networks called a generator and discriminator. Throughout the training process of GAN, the two networks face various challenges that affect the quality and diversity of the generated samples of GAN. These challenges include training instability and mode collapse problem. Training instability happens due to the variance of the performance between the generator and discriminator. The mode collapse, on the other hand, happens when the generator is stuck to generate diverse samples. One of the promising techniques that might overcome these issues and increase the networks' performance is transfer learning between discriminators as same as generators. In this regard, the contribution of this paper is fourfold. First, it proposes a novel approach called Collaborative-GAN based on transfer learning to mitigate the training instability and tackle the mode collapse issues. In the proposed approach, the well-performer network transfers its learned weights to the low-performer ones based on a periodical evaluation during the training process. Second, the paper proposes a novel method to evaluate the discriminators' performance based on a fuzzy inference system. Third, the paper proposes a method to evaluate the generators' performance based on a series of detected FID scores that measure the diversity of the generated samples every certain intervals during the training process. We apply the proposed approach on two different architectures of GAN, which we called Single-GAN and Dual-GANs. In Single-GAN, the weights are transferred between the identical networks within the same GAN model. In Dual-GANs, on the other hand, the weights are transferred between identical networks across different GAN models. Thus, the paper introduces two types of transfer learning for GANs; inter and intra-transfer learning based on the paradigm of GAN architecture as a fourth contribution. We validate the proposed approach on three different benchmarks representing CelebA, Cifar-10, and Fashion-Mnist. The experimental results indicate that the proposed approach outperforms the state-of-the-art GAN models in terms of FID metric that measures the generated sample diversity. It is worth noting that the proposed approach achieved remarkable FID scores of 11.44, 24.19, and 11.21 on the Fashion-Mnist, Cifar-10, and CelebA datasets respectively.

引用

页码：138716 / 138735

页数：20

共 50 条

[1] On Stabilizing Generative Adversarial Training with Noise
Jenni, Simon
Favaro, Paolo
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12137 - 12145
[2] A Generative Adversarial Network (GAN) Fingerprint Approach Over LTE
Serreli, Luigi
Fadda, Mauro
Girau, Roberto
Ruiu, Pietro
Giusto, Daniele D.
Anedda, Matteo
IEEE ACCESS, 2024, 12 : 82083 - 82094
[3] DFS-GAN: stabilizing training of generative adversarial networks through discarding fake samples
Yang, Lianping
Sun, Hao
Zhang, Jian
Mo, Sijia
Jiang, Wuming
Zhang, Xiangde
JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
[4] Stabilizing Training of Generative Adversarial Networks through Regularization
Roth, Kevin
Lucchi, Aurelien
Nowozin, Sebastian
Hofmann, Thomas
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[5] Generative Adversarial Network (GAN) for Simulating Electroencephalography
Priyanshu Mahey
Nima Toussi
Grace Purnomu
Anthony Thomas Herdman
Brain Topography, 2023, 36 : 661 - 670
[6] Generative Adversarial Network (GAN) for Simulating Electroencephalography
Mahey, Priyanshu
Toussi, Nima
Purnomu, Grace
Herdman, Anthony Thomas
BRAIN TOPOGRAPHY, 2023, 36 (05) : 661 - 670
[7] Collaborative Generative Adversarial Network for Recommendation Systems
Tong, Yuzhen
Luo, Yadan
Zhang, Zheng
Sadiq, Shazia
Cui, Peng
2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW 2019), 2019, : 161 - 168
[8] Digital brain phantoms by generative adversarial network (GAN)
Shao, Wenyi
Leung, Kevin
Rowe, Steven
Pomper, Martin
Du, Yong
JOURNAL OF NUCLEAR MEDICINE, 2021, 62
[9] Speech Enhancement Using Generative Adversarial Network (GAN)
Huq, Mahmudul
Maskeliunas, Rytis
HYBRID INTELLIGENT SYSTEMS, HIS 2021, 2022, 420 : 273 - 282
[10] Spatial Coevolution for Generative Adversarial Network Training
Hemberg E.
Toutouh J.
Al-Dujaili A.
Schmiedlechner T.
O'Reilly U.-M.
ACM Transactions on Evolutionary Learning and Optimization, 2021, 1 (02):

← 1 2 3 4 5 →