DuCaGAN: Unified Dual Capsule Generative Adversarial Network for Unsupervised Image-to-Image Translation

被引:9
|
作者
Shao, Guifang [1 ,2 ]
Huang, Meng [1 ,2 ]
Gao, Fengqiang [1 ,2 ,3 ]
Liu, Tundong [1 ,2 ]
Li, Liduan [1 ,2 ]
机构
[1] Xiamen Univ, Dept Automat, Xiamen 361005, Peoples R China
[2] Xiamen Key Lab Big Data Intelligent Anal & Decis, Xiamen 361005, Peoples R China
[3] Xiamen Univ, Sch Informat Sci & Technol, Tan Kah Kee Coll, Zhangzhou, Peoples R China
关键词
Generative adversarial networks; Convolution; Generators; Gallium nitride; Industries; Computer vision; Computational modeling; Image translation; generative adversarial network; capsule network; adversarial loss; data augmentation;
D O I
10.1109/ACCESS.2020.3007266
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the appearance of Generative Adversarial Network (GAN), image-to-image translation based on a new unified framework has attracted growing interests. As a new technique, it can generate synthesizing images for various requirements in both computer vision and image processing. However, the cycle consistent structure adopted in some common models, such as cycle generative adversarial network (CycleGAN), is usually unable to learn more abundant image features. In this work, we developed a novel model based on GAN, named as dual capsule generative adversarial network (DuCaGAN), by utilizing the distinctive characteristic of view angle invariance and rotation equivariance in capsule network. Firstly, two capsule networks were introduced into the traditional CycleGAN model as discriminators to form our proposed model with six agents. To improve the feature capturing performance, we modified the full objective by combining the margin loss and the original adversarial loss. Furthermore, the Routing Algorithm in the capsule network was optimized by changing its compression function. Finally, experimental results on conventional visual tasks with paired and unpaired datasets demonstrated the superiority and effectiveness of the proposed approach compared to both deep convolutional generative adversarial network (DCGAN) and CycleGAN methods. More importantly, the proposed DuCaGAN was applied for the first time to augment the surface defect data from the real industrial field, and exhibited better performance than those methods available.
引用
收藏
页码:154691 / 154707
页数:17
相关论文
共 50 条
  • [1] Unsupervised Generative Adversarial Network for Plantar Pressure Image-to-Image Translation
    Ahmadian, Mona
    Beheshti, Mohammad T. H.
    Kalhor, Ahmad
    Shirian, Amir
    [J]. 2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 2580 - 2583
  • [2] Unsupervised image-to-image translation with multiscale attention generative adversarial network
    Wang, Fasheng
    Zhang, Qing
    Zhao, Qianyi
    Wang, Mengyin
    Sun, Fuming
    [J]. APPLIED INTELLIGENCE, 2024, 54 (08) : 6558 - 6578
  • [3] Unified Generative Adversarial Networks for Controllable Image-to-Image Translation
    Tang, Hao
    Liu, Hong
    Sebe, Nicu
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8916 - 8929
  • [4] Perceptual Contrastive Generative Adversarial Network based on image warping for unsupervised image-to-image translation
    Huang, Lin-Chieh
    Tsai, Hung-Hsu
    [J]. NEURAL NETWORKS, 2023, 166 : 313 - 325
  • [5] Knowledge Distillation Generative Adversarial Network for Image-to-Image Translation
    Sub-r-pa, Chayanon
    Chen, Rung-Ching
    [J]. JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2024, 15 (08) : 896 - 902
  • [6] Image-to-Image Translation using a Relativistic Generative Adversarial Network
    Xing, Xingrun
    Zhang, Dawei
    [J]. ELEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2019), 2019, 11179
  • [7] iFlowGAN: An Invertible Flow-Based Generative Adversarial Network for Unsupervised Image-to-Image Translation
    Dai, Longquan
    Tang, Jinhui
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (08) : 4151 - 4162
  • [8] Attention-Guided Generative Adversarial Networks for Unsupervised Image-to-Image Translation
    Tang, Hao
    Xu, Dan
    Sebel, Nicu
    Yan, Yan
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [9] Unsupervised Image-to-Image Translation with Generative Prior
    Yang, Shuai
    Jiang, Liming
    Liu, Ziwei
    Loy, Chen Change
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18311 - 18320
  • [10] Asynchronous Generative Adversarial Network for Asymmetric Unpaired Image-to-Image Translation
    Zheng, Ziqiang
    Bin, Yi
    Lv, Xiaoou
    Wu, Yang
    Yang, Yang
    Shen, Heng Tao
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2474 - 2487