Unsupervised image-to-image translation with multiscale attention generative adversarial network

被引：0

作者：

Wang, Fasheng ^{[1
]}

Zhang, Qing ^{[1
]}

Zhao, Qianyi ^{[1
]}

Wang, Mengyin ^{[1
]}

Sun, Fuming ^{[1
]}

机构：

[1] Dalian Minzu Univ, Sch Informat & Commun Engn, 18 Liaohe West Rd, Dalian 116600, Liaoning, Peoples R China

来源：

APPLIED INTELLIGENCE | 2024年 / 54卷 / 08期

基金：

中国国家自然科学基金;

关键词：

Image-to-image translation; Generative adversarial network; Multiscale; Convolutional block attention module;

D O I：

10.1007/s10489-024-05522-x

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Unsupervised image-to-image translation refers to translating images from the source domain to the target domain, assuring that the translated images have the style of the target domain while retaining the content of the source domain. Although existing image-to-image translation methods can map an image from the source domain to the target domain, the translation results are prone to visual artifacts, and the texture and shape of the input image cannot match the target domain well. The reason for this phenomenon is that the generator ignores the most differential information between the source and target domains, preventing the extraction of the rich image feature information. In this paper, we propose a multiscale attention-generative adversarial network (MSA-GAN) for unsupervised image-to-image translation. In MSA-GAN, we design a multiscale attention network (MSANet) as the backbone of the generator, which consists of the Res2Net block and convolutional block attention module (CBAM). MSANet can extract global and local features and effectively alleviate the detail missing and blurry problems in image translation. It also focuses on the important image features and improves the ability of the network to extract features from the most distinguishing regions between the source and target domains, which allows it to better translate the texture details and object shape. In addition, to generate high-quality images, we introduce the perceptual loss to constrain high-level feature information. Extensive experimental results show that the proposed MSA-GAN achieves competitive performance in image-to-image translation. Our model outperforms several advanced models on several public benchmark datasets.

引用

页码：6558 / 6578

页数：21

共 50 条

[1] Unsupervised Generative Adversarial Network for Plantar Pressure Image-to-Image Translation
Ahmadian, Mona
Beheshti, Mohammad T. H.
Kalhor, Ahmad
Shirian, Amir
[J]. 2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 2580 - 2583
[2] Attention-Guided Generative Adversarial Networks for Unsupervised Image-to-Image Translation
Tang, Hao
Xu, Dan
Sebel, Nicu
Yan, Yan
[J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[3] Perceptual Contrastive Generative Adversarial Network based on image warping for unsupervised image-to-image translation
Huang, Lin-Chieh
Tsai, Hung-Hsu
[J]. NEURAL NETWORKS, 2023, 166 : 313 - 325
[4] DuCaGAN: Unified Dual Capsule Generative Adversarial Network for Unsupervised Image-to-Image Translation
Shao, Guifang
Huang, Meng
Gao, Fengqiang
Liu, Tundong
Li, Liduan
[J]. IEEE ACCESS, 2020, 8 : 154691 - 154707
[5] Knowledge Distillation Generative Adversarial Network for Image-to-Image Translation
Sub-r-pa, Chayanon
Chen, Rung-Ching
[J]. JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2024, 15 (08) : 896 - 902
[6] Image-to-Image Translation using a Relativistic Generative Adversarial Network
Xing, Xingrun
Zhang, Dawei
[J]. ELEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2019), 2019, 11179
[7] iFlowGAN: An Invertible Flow-Based Generative Adversarial Network for Unsupervised Image-to-Image Translation
Dai, Longquan
Tang, Jinhui
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (08) : 4151 - 4162
[8] CSAGAN: Channel and Spatial Attention-Guided Generative Adversarial Networks for Unsupervised Image-to-Image Translation
Yang, Rui
Peng, Chao
Wang, Chenchao
Wang, Mengdan
Chen, Yao
Zheng, Peng
Xiong, Neal N.
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 3258 - 3265
[9] Unsupervised Image-to-Image Translation with Generative Prior
Yang, Shuai
Jiang, Liming
Liu, Ziwei
Loy, Chen Change
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18311 - 18320
[10] Asynchronous Generative Adversarial Network for Asymmetric Unpaired Image-to-Image Translation
Zheng, Ziqiang
Bin, Yi
Lv, Xiaoou
Wu, Yang
Yang, Yang
Shen, Heng Tao
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2474 - 2487

← 1 2 3 4 5 →