SIMGAN: PHOTO-REALISTIC SEMANTIC IMAGE MANIPULATION USING GENERATIVE ADVERSARIAL NETWORKS

被引:0
|
作者
Yu, Simiao [1 ]
Dong, Hao [1 ]
Liang, Felix [2 ]
Mo, Yuanhan [1 ]
Wu, Chao [3 ]
Guo, Yike [1 ]
机构
[1] Imperial Coll London, London, England
[2] Univ Washington, Seattle, WA 98195 USA
[3] Zhejiang Univ, Hangzhou, Peoples R China
关键词
adversarial learning; generative model; image generation; semantic image manipulation;
D O I
10.1109/icip.2019.8804285
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Semantic image manipulation (SIM) aims to generate realistic images from an input source image and a target text description, such that the generated images not only match the content of the description, but also maintain text-irrelevant features of the source image. It requires to learn a good mapping between visual features and linguistic features. Previous works on SIM can only generate images of limited resolution that typically lack of fine and clear details. In this work, we aim to generate high-resolution photo-realistic images for SIM. Specifically, we propose SIMGAN, a generative adversarial networks (GAN) based architecture that is capable of generating images of size 256 x 256 for SIM. We demonstrate the effectiveness of SIMGAN and its superiority over existing methods via qualitative and quantitative evaluation on Caltech-200 and Oxford-102 datasets.
引用
收藏
页码:734 / 738
页数:5
相关论文
共 50 条
  • [1] SEMANTICGAN: GENERATIVE ADVERSARIAL NETWORKS FOR SEMANTIC IMAGE TO PHOTO-REALISTIC IMAGE TRANSLATION
    Liu, Junling
    Zou, Yuexian
    Yang, Dongming
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2528 - 2532
  • [2] Photo-Realistic Monocular Gaze Redirection Using Generative Adversarial Networks
    He, Zhe
    Spurr, Adrian
    Zhang, Xucong
    Hilliges, Otmar
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6931 - 6940
  • [3] StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks
    Zhang, Han
    Xu, Tao
    Li, Hongsheng
    Zhang, Shaoting
    Wang, Xiaogang
    Huang, Xiaolei
    Metaxas, Dimitris
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5908 - 5916
  • [4] Photo-realistic dehazing via contextual generative adversarial networks
    Zhang, Shengdong
    He, Fazhi
    Ren, Wenqi
    [J]. MACHINE VISION AND APPLICATIONS, 2020, 31 (05)
  • [5] Photo-realistic dehazing via contextual generative adversarial networks
    Shengdong Zhang
    Fazhi He
    Wenqi Ren
    [J]. Machine Vision and Applications, 2020, 31
  • [6] Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network
    Ledig, Christian
    Theis, Lucas
    Huszar, Ferenc
    Caballero, Jose
    Cunningham, Andrew
    Acosta, Alejandro
    Aitken, Andrew
    Tejani, Alykhan
    Totz, Johannes
    Wang, Zehan
    Shi, Wenzhe
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 105 - 114
  • [7] Towards Photo-Realistic Visible Watermark Removal with Conditional Generative Adversarial Networks
    Li, Xiang
    Lu, Chan
    Cheng, Danni
    Li, Wei-Hong
    Cao, Mei
    Liu, Bo
    Ma, Jiechao
    Zheng, Wei-Shi
    [J]. IMAGE AND GRAPHICS, ICIG 2019, PT I, 2019, 11901 : 345 - 356
  • [8] Photo-Realistic Image Dehazing and Verifying Networks via Complementary Adversarial Learning
    Shin, Joongchol
    Paik, Joonki
    [J]. SENSORS, 2021, 21 (18)
  • [9] Text to photo-realistic image synthesis via chained deep recurrent generative adversarial network
    Wang, Min
    Lang, Congyan
    Feng, Songhe
    Wang, Tao
    Jin, Yi
    Li, Yidong
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74
  • [10] Photo-realistic face age progression/regression using a single generative adversarial network
    Zeng, Jiangfeng
    Ma, Xiao
    Zhou, Ke
    [J]. NEUROCOMPUTING, 2019, 366 : 295 - 304