Unsupervised image-to-image translation by semantics consistency and self-attention

被引:0
|
作者
ZHANG Zhibin [1 ]
XUE Wanli [1 ]
FU Guokai [1 ]
机构
[1] The Key Laboratory of Computer Vision and System of Ministry of Education, Tianjin Key Laboratory of Intelligence Computing and Novel Software Technology, Tianjin University of Technology
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP391.41 [];
学科分类号
080203 ;
摘要
Unsupervised image-to-image translation is a challenging task for computer vision. The goal of image translation is to learn a mapping between two domains, without corresponding image pairs. Many previous works only focused on image-level translation but ignored image features processing, which led to a certain semantics loss, such as the changes of the background of the generated image, partial transformation, and so on. In this work, we propose a method of image-to-image translation based on generative adversarial nets(GANs). We use autoencoder structure to extract image features in the generator and add semantic consistency loss on extracted features to maintain the semantic consistency of the generated image. Self-attention mechanism at the end of generator is used to obtain long-distance dependency in image. At the same time, as expanding the convolution receptive field, the quality of the generated image is enhanced. Quantitative experiment shows that our method significantly outperforms previous works. Especially on images with obvious foreground, our model shows an impressive improvement.
引用
收藏
页码:175 / 180
页数:6
相关论文
共 50 条
  • [1] Unsupervised image-to-image translation by semantics consistency and self-attention
    Zhibin Zhang
    Wanli Xue
    Guokai Fu
    [J]. Optoelectronics Letters, 2022, 18 : 175 - 180
  • [2] Unsupervised image-to-image translation by semantics consistency and self-attention
    Zhang Zhibin
    Xue Wanli
    Fu Guokai
    [J]. OPTOELECTRONICS LETTERS, 2022, 18 (03) : 175 - 180
  • [3] Unsupervised Image-to-Image Translation with Self-Attention Networks
    Kang, Taewon
    Lee, Kwang Hee
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020), 2020, : 102 - 108
  • [4] Unsupervised Image-to-Image Translation with Style Consistency
    Lai, Binxin
    Wang, Yuan-Gen
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 322 - 334
  • [5] Self-attention StarGAN for Multi-domain Image-to-Image Translation
    He, Ziliang
    Yang, Zhenguo
    Mao, Xudong
    Lv, Jianming
    Li, Qing
    Liu, Wenyin
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: IMAGE PROCESSING, PT III, 2019, 11729 : 537 - 549
  • [6] Unsupervised Attention-guided Image-to-Image Translation
    Mejjati, Youssef A.
    Richardt, Christian
    Tompkin, James
    Cosker, Darren
    Kim, Kwang In
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [7] ARDA-UNIT recurrent dense self-attention block with adaptive feature fusion for unpaired (unsupervised) image-to-image translation
    Ghombavani, Farzane Maghsoudi
    Fadaeieslam, Mohammad Javad
    Yaghmaee, Farzin
    [J]. IET IMAGE PROCESSING, 2023, 17 (13) : 3746 - 3758
  • [8] Alleviating Semantics Distortion in Unsupervised Low-Level Image-to-Image Translation via Structure Consistency Constraint
    Guo, Jiaxian
    Li, Jiachen
    Fu, Huan
    Gong, Mingming
    Zhang, Kun
    Tao, Dacheng
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18228 - 18238
  • [9] Unsupervised image-to-image translation with multiscale attention generative adversarial network
    Wang, Fasheng
    Zhang, Qing
    Zhao, Qianyi
    Wang, Mengyin
    Sun, Fuming
    [J]. APPLIED INTELLIGENCE, 2024, 54 (08) : 6558 - 6578
  • [10] Multimodal Unsupervised Image-to-Image Translation
    Huang, Xun
    Liu, Ming-Yu
    Belongie, Serge
    Kautz, Jan
    [J]. COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 179 - 196