ATTENTIVE GENERATIVE ADVERSARIAL NETWORK TO BRIDGE MULTI-DOMAIN GAP FOR IMAGE SYNTHESIS

被引：5

作者：

Wang, Min ^{[1
]}

Lang, Congyan ^{[1
]}

Liang, Liqian ^{[1
]}

Lyu, Gengyu ^{[1
]}

Feng, Songhe ^{[1
]}

Wang, Tao ^{[1
]}

机构：

[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing 100044, Peoples R China

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME) | 2020年

关键词：

text-to-image synthesis; attentive generative adversarial network; contextual loss; image contours; TEXT;

D O I：

10.1109/icme46284.2020.9102761

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Despite the significant progress on text-to-image synthesis, automatically generating realistic images remains a challenging task since the location and specific shape of object are not given in the text descriptions. To address these problems, we propose a novel attentive generative adversarial network with contextual loss (AGAN-CL) algorithm. More specifically, the generative network consists of two sub-networks: a contextual network for generating image contours, and a cycle transformation autoencoder for converting contours to realistic images. Our core idea is the injection of image contours into the generative network, which is the most critical part of our network, since it will guide the whole generative network to focus on object regions. In addition, we also apply contextual loss and cycle-consistent loss to bridge multi-domain gap. Comprehensive results on several challenging datasets demonstrate the advantage of the proposed method over the leading approaches, regarding both visual fidelity and alignment with input descriptions.

引用

页数：6

共 50 条

[1] A Domain Gap Aware Generative Adversarial Network for Multi-Domain Image Translation
Xu, Wenju
Wang, Guanghui
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 72 - 84
[2] WeaGAN:Generative Adversarial Network for Weather Translation of Image among Multi-domain
Lin, Yating
Li, Yidong
Cui, Haidong
Feng, Zheng
2019 6TH INTERNATIONAL CONFERENCE ON BEHAVIORAL, ECONOMIC AND SOCIO-CULTURAL COMPUTING (BESC 2019), 2019,
[3] MULTI-DOMAIN ATTENTIVE DETECTION NETWORK
Cho, Sungmin
Choi, Bowon
Kim, Do-Hwi
Kwon, Junseok
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2194 - 2198
[4] TMGAN: two-stage multi-domain generative adversarial network for landscape image translation
Lin, Liyuan
Zhang, Shun
Ji, Shulin
Zhao, Shuxian
Wen, Aolin
Yan, Jingpeng
Zhou, Yuan
Zhou, Weibin
VISUAL COMPUTER, 2024, 40 (09): : 6389 - 6405
[5] SoloGAN: Multi-domain Multimodal Unpaired Image-to-Image Translation via a Single Generative Adversarial Network
Huang S.
He C.
Cheng R.
IEEE Transactions on Artificial Intelligence, 2022, 3 (05): : 722 - 737
[6] Class-Balanced Text to Image Synthesis With Attentive Generative Adversarial Network
Wang, Min
Lang, Congyan
Liang, Liqian
Lyu, Gengyu
Feng, Songhe
Wang, Tao
IEEE MULTIMEDIA, 2021, 28 (03) : 21 - 31
[7] StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation
Choi, Yunjey
Choi, Minje
Kim, Munyoung
Ha, Jung-Woo
Kim, Sunghun
Choo, Jaegul
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8789 - 8797
[8] Dual Generator Generative Adversarial Networks for Multi-domain Image-to-Image Translation
Tang, Hao
Xu, Dan
Wang, Wei
Yan, Yan
Sebe, Nicu
COMPUTER VISION - ACCV 2018, PT I, 2019, 11361 : 3 - 21
[9] Crossing-Domain Generative Adversarial Networks for Unsupervised Multi-Domain Image-to-Image Translation
Yang, Xuewen
Xie, Dongliang
Wang, Xin
PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 374 - 382
[10] MDT: UNSUPERVISED MULTI-DOMAIN IMAGE-TO-IMAGE TRANSLATOR BASED ON GENERATIVE ADVERSARIAL NETWORKS
Lin, Ye
Fu, Keren
Ling, Shenggui
Cheng, Peng
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 598 - 602

← 1 2 3 4 5 →