Unsupervised image-to-image translation with multiscale attention generative adversarial network

被引:0
|
作者
Wang, Fasheng [1 ]
Zhang, Qing [1 ]
Zhao, Qianyi [1 ]
Wang, Mengyin [1 ]
Sun, Fuming [1 ]
机构
[1] Dalian Minzu Univ, Sch Informat & Commun Engn, 18 Liaohe West Rd, Dalian 116600, Liaoning, Peoples R China
基金
中国国家自然科学基金;
关键词
Image-to-image translation; Generative adversarial network; Multiscale; Convolutional block attention module;
D O I
10.1007/s10489-024-05522-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised image-to-image translation refers to translating images from the source domain to the target domain, assuring that the translated images have the style of the target domain while retaining the content of the source domain. Although existing image-to-image translation methods can map an image from the source domain to the target domain, the translation results are prone to visual artifacts, and the texture and shape of the input image cannot match the target domain well. The reason for this phenomenon is that the generator ignores the most differential information between the source and target domains, preventing the extraction of the rich image feature information. In this paper, we propose a multiscale attention-generative adversarial network (MSA-GAN) for unsupervised image-to-image translation. In MSA-GAN, we design a multiscale attention network (MSANet) as the backbone of the generator, which consists of the Res2Net block and convolutional block attention module (CBAM). MSANet can extract global and local features and effectively alleviate the detail missing and blurry problems in image translation. It also focuses on the important image features and improves the ability of the network to extract features from the most distinguishing regions between the source and target domains, which allows it to better translate the texture details and object shape. In addition, to generate high-quality images, we introduce the perceptual loss to constrain high-level feature information. Extensive experimental results show that the proposed MSA-GAN achieves competitive performance in image-to-image translation. Our model outperforms several advanced models on several public benchmark datasets.
引用
收藏
页码:6558 / 6578
页数:21
相关论文
共 50 条
  • [1] Unsupervised Generative Adversarial Network for Plantar Pressure Image-to-Image Translation
    Ahmadian, Mona
    Beheshti, Mohammad T. H.
    Kalhor, Ahmad
    Shirian, Amir
    [J]. 2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 2580 - 2583
  • [2] Attention-Guided Generative Adversarial Networks for Unsupervised Image-to-Image Translation
    Tang, Hao
    Xu, Dan
    Sebel, Nicu
    Yan, Yan
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [3] Perceptual Contrastive Generative Adversarial Network based on image warping for unsupervised image-to-image translation
    Huang, Lin-Chieh
    Tsai, Hung-Hsu
    [J]. NEURAL NETWORKS, 2023, 166 : 313 - 325
  • [4] DuCaGAN: Unified Dual Capsule Generative Adversarial Network for Unsupervised Image-to-Image Translation
    Shao, Guifang
    Huang, Meng
    Gao, Fengqiang
    Liu, Tundong
    Li, Liduan
    [J]. IEEE ACCESS, 2020, 8 : 154691 - 154707
  • [5] Knowledge Distillation Generative Adversarial Network for Image-to-Image Translation
    Sub-r-pa, Chayanon
    Chen, Rung-Ching
    [J]. JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2024, 15 (08) : 896 - 902
  • [6] Image-to-Image Translation using a Relativistic Generative Adversarial Network
    Xing, Xingrun
    Zhang, Dawei
    [J]. ELEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2019), 2019, 11179
  • [7] iFlowGAN: An Invertible Flow-Based Generative Adversarial Network for Unsupervised Image-to-Image Translation
    Dai, Longquan
    Tang, Jinhui
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (08) : 4151 - 4162
  • [8] CSAGAN: Channel and Spatial Attention-Guided Generative Adversarial Networks for Unsupervised Image-to-Image Translation
    Yang, Rui
    Peng, Chao
    Wang, Chenchao
    Wang, Mengdan
    Chen, Yao
    Zheng, Peng
    Xiong, Neal N.
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 3258 - 3265
  • [9] Unsupervised Image-to-Image Translation with Generative Prior
    Yang, Shuai
    Jiang, Liming
    Liu, Ziwei
    Loy, Chen Change
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18311 - 18320
  • [10] Asynchronous Generative Adversarial Network for Asymmetric Unpaired Image-to-Image Translation
    Zheng, Ziqiang
    Bin, Yi
    Lv, Xiaoou
    Wu, Yang
    Yang, Yang
    Shen, Heng Tao
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2474 - 2487