ATTENTIVE GENERATIVE ADVERSARIAL NETWORK TO BRIDGE MULTI-DOMAIN GAP FOR IMAGE SYNTHESIS

被引:5
|
作者
Wang, Min [1 ]
Lang, Congyan [1 ]
Liang, Liqian [1 ]
Lyu, Gengyu [1 ]
Feng, Songhe [1 ]
Wang, Tao [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing 100044, Peoples R China
关键词
text-to-image synthesis; attentive generative adversarial network; contextual loss; image contours; TEXT;
D O I
10.1109/icme46284.2020.9102761
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Despite the significant progress on text-to-image synthesis, automatically generating realistic images remains a challenging task since the location and specific shape of object are not given in the text descriptions. To address these problems, we propose a novel attentive generative adversarial network with contextual loss (AGAN-CL) algorithm. More specifically, the generative network consists of two sub-networks: a contextual network for generating image contours, and a cycle transformation autoencoder for converting contours to realistic images. Our core idea is the injection of image contours into the generative network, which is the most critical part of our network, since it will guide the whole generative network to focus on object regions. In addition, we also apply contextual loss and cycle-consistent loss to bridge multi-domain gap. Comprehensive results on several challenging datasets demonstrate the advantage of the proposed method over the leading approaches, regarding both visual fidelity and alignment with input descriptions.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] A Domain Gap Aware Generative Adversarial Network for Multi-Domain Image Translation
    Xu, Wenju
    Wang, Guanghui
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 72 - 84
  • [2] WeaGAN:Generative Adversarial Network for Weather Translation of Image among Multi-domain
    Lin, Yating
    Li, Yidong
    Cui, Haidong
    Feng, Zheng
    2019 6TH INTERNATIONAL CONFERENCE ON BEHAVIORAL, ECONOMIC AND SOCIO-CULTURAL COMPUTING (BESC 2019), 2019,
  • [3] MULTI-DOMAIN ATTENTIVE DETECTION NETWORK
    Cho, Sungmin
    Choi, Bowon
    Kim, Do-Hwi
    Kwon, Junseok
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2194 - 2198
  • [4] TMGAN: two-stage multi-domain generative adversarial network for landscape image translation
    Lin, Liyuan
    Zhang, Shun
    Ji, Shulin
    Zhao, Shuxian
    Wen, Aolin
    Yan, Jingpeng
    Zhou, Yuan
    Zhou, Weibin
    VISUAL COMPUTER, 2024, 40 (09): : 6389 - 6405
  • [5] SoloGAN: Multi-domain Multimodal Unpaired Image-to-Image Translation via a Single Generative Adversarial Network
    Huang S.
    He C.
    Cheng R.
    IEEE Transactions on Artificial Intelligence, 2022, 3 (05): : 722 - 737
  • [6] Class-Balanced Text to Image Synthesis With Attentive Generative Adversarial Network
    Wang, Min
    Lang, Congyan
    Liang, Liqian
    Lyu, Gengyu
    Feng, Songhe
    Wang, Tao
    IEEE MULTIMEDIA, 2021, 28 (03) : 21 - 31
  • [7] StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation
    Choi, Yunjey
    Choi, Minje
    Kim, Munyoung
    Ha, Jung-Woo
    Kim, Sunghun
    Choo, Jaegul
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8789 - 8797
  • [8] Dual Generator Generative Adversarial Networks for Multi-domain Image-to-Image Translation
    Tang, Hao
    Xu, Dan
    Wang, Wei
    Yan, Yan
    Sebe, Nicu
    COMPUTER VISION - ACCV 2018, PT I, 2019, 11361 : 3 - 21
  • [9] Crossing-Domain Generative Adversarial Networks for Unsupervised Multi-Domain Image-to-Image Translation
    Yang, Xuewen
    Xie, Dongliang
    Wang, Xin
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 374 - 382
  • [10] MDT: UNSUPERVISED MULTI-DOMAIN IMAGE-TO-IMAGE TRANSLATOR BASED ON GENERATIVE ADVERSARIAL NETWORKS
    Lin, Ye
    Fu, Keren
    Ling, Shenggui
    Cheng, Peng
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 598 - 602