Collaborative-GAN: An Approach for Stabilizing the Training Process of Generative Adversarial Network

被引:1
|
作者
Megahed, Mohammed [1 ]
Mohammed, Ammar [2 ]
机构
[1] Cairo Univ, Fac Grad Studies Stat Res, Giza 12613, Egypt
[2] Prince Sattam Bin Abdulaziz Univ, Coll Comp Engn & Sci, Dept Comp Sci, Al Kharj 16278, Saudi Arabia
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Generators; Training; Generative adversarial networks; Transfer learning; Fuzzy logic; Propagation losses; Games; Generative adversarial network; transfer learning; training instability; mode collapse;
D O I
10.1109/ACCESS.2024.3457902
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Generative Adversarial Network (GAN) outperforms its peers in the generative models' family and is widely used to generate realistic samples in various domains. The basic idea of GAN is a competition between two networks called a generator and discriminator. Throughout the training process of GAN, the two networks face various challenges that affect the quality and diversity of the generated samples of GAN. These challenges include training instability and mode collapse problem. Training instability happens due to the variance of the performance between the generator and discriminator. The mode collapse, on the other hand, happens when the generator is stuck to generate diverse samples. One of the promising techniques that might overcome these issues and increase the networks' performance is transfer learning between discriminators as same as generators. In this regard, the contribution of this paper is fourfold. First, it proposes a novel approach called Collaborative-GAN based on transfer learning to mitigate the training instability and tackle the mode collapse issues. In the proposed approach, the well-performer network transfers its learned weights to the low-performer ones based on a periodical evaluation during the training process. Second, the paper proposes a novel method to evaluate the discriminators' performance based on a fuzzy inference system. Third, the paper proposes a method to evaluate the generators' performance based on a series of detected FID scores that measure the diversity of the generated samples every certain intervals during the training process. We apply the proposed approach on two different architectures of GAN, which we called Single-GAN and Dual-GANs. In Single-GAN, the weights are transferred between the identical networks within the same GAN model. In Dual-GANs, on the other hand, the weights are transferred between identical networks across different GAN models. Thus, the paper introduces two types of transfer learning for GANs; inter and intra-transfer learning based on the paradigm of GAN architecture as a fourth contribution. We validate the proposed approach on three different benchmarks representing CelebA, Cifar-10, and Fashion-Mnist. The experimental results indicate that the proposed approach outperforms the state-of-the-art GAN models in terms of FID metric that measures the generated sample diversity. It is worth noting that the proposed approach achieved remarkable FID scores of 11.44, 24.19, and 11.21 on the Fashion-Mnist, Cifar-10, and CelebA datasets respectively.
引用
收藏
页码:138716 / 138735
页数:20
相关论文
共 50 条
  • [31] FISS GAN: A Generative Adversarial Network for Foggy Image Semantic Segmentation
    Kunhua Liu
    Zihao Ye
    Hongyan Guo
    Dongpu Cao
    Long Chen
    Fei-Yue Wang
    IEEE/CAAJournalofAutomaticaSinica, 2021, 8 (08) : 1428 - 1439
  • [32] A Systematic Review on Generative Adversarial Network (GAN): Challenges and Future Directions
    Nayak, Ankitha A.
    Venugopala, P. S.
    Ashwini, B.
    ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2024, 31 (08) : 4739 - 4772
  • [33] MB-GAN: Microbiome Simulation via Generative Adversarial Network
    Rong, Ruichen
    Jiang, Shuang
    Xu, Lin
    Xiao, Guanghua
    Xie, Yang
    Liu, Dajiang J.
    Li, Qiwei
    Zhan, Xiaowei
    GIGASCIENCE, 2021, 10 (02):
  • [34] A Generative Adversarial Network (GAN) Technique for Internet of Medical Things Data
    Vaccari, Ivan
    Orani, Vanessa
    Paglialonga, Alessia
    Cambiaso, Enrico
    Mongelli, Maurizio
    SENSORS, 2021, 21 (11)
  • [35] Smart GAN: a smart generative adversarial network for limited imbalanced dataset
    Kumari, Deepa
    Vyshnavi, S. K.
    Dhar, Rupsa
    Rajita, B. S. A. S.
    Panda, Subhrakanta
    Christopher, Jabez
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (14): : 20640 - 20681
  • [36] DE-GAN: A Conditional Generative Adversarial Network for Document Enhancement
    Souibgui, Mohamed Ali
    Kessentini, Yousri
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (03) : 1180 - 1191
  • [37] FDeblur-GAN: Fingerprint Deblurring using Generative Adversarial Network
    Joshi, Amol S.
    Dabouei, Ali
    Dawson, Jeremy
    Nasrabadi, Nasser M.
    2021 INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2021), 2021,
  • [38] On the Performance of Generative Adversarial Network (GAN) Variants: A Clinical Data Study
    Yoo, Jaesung
    Park, Jeman
    Wang, An
    Mohaisen, David
    Kim, Joongheon
    11TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE: DATA, NETWORK, AND AI IN THE AGE OF UNTACT (ICTC 2020), 2020, : 100 - 104
  • [39] HOT-GAN: Hilbert Optimal Transport for Generative Adversarial Network
    Li, Qian
    Wang, Zhichao
    Xia, Haiyang
    Li, Gang
    Cao, Yanan
    Yao, Lina
    Xu, Guandong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 14
  • [40] DFH-GAN: A Deep Face Hashing with Generative Adversarial Network
    Zhou, Lanxiang
    Wang, Yifei
    Xiao, Bo
    Xu, Qianfang
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7012 - 7019