High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs

被引:2413
|
作者
Wang, Ting-Chun [1 ]
Liu, Ming-Yu [1 ]
Zhu, Jun-Yan [2 ]
Tao, Andrew [1 ]
Kautz, Jan [1 ]
Catanzaro, Bryan [1 ]
机构
[1] NVIDIA Corp, Santa Clara, CA 95051 USA
[2] Univ Calif Berkeley, Berkeley, CA USA
关键词
D O I
10.1109/CVPR.2018.00917
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a new method for synthesizing high resolution photo-realistic images from semantic label maps using conditional generative adversarial networks (conditional GANs). Conditional GANs have enabled a variety of applications, but the results are often limited to low resolution and still far from realistic. In this work, we generate 2048 x 1024 visually appealing results with a novel adversarial loss, as well as new multi-scale generator and discriminator architectures. Furthermore, we extend our framework to interactive visual manipulation with two additional features. First, we incorporate object instance segmentation information, which enables object manipulations such as removing/adding objects and changing the object category. Second, we propose a method to generate diverse results given the same input, allowing users to edit the object appearance interactively. Human opinion studies demonstrate that our method significantly outperforms existing methods, advancing both the quality and the resolution of deep image synthesis and editing.
引用
收藏
页码:8798 / 8807
页数:10
相关论文
共 50 条
  • [1] HIGH-RESOLUTION DRIVING SCENE SYNTHESIS USING STACKED CONDITIONAL GANS AND SPECTRAL NORMALIZATION
    Lin, Shaobo
    Chen, Long
    Zou, Qin
    Tian, Wei
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1330 - 1335
  • [2] High-resolution dermoscopy image synthesis with conditional generative adversarial networks
    Ding, Saisai
    Zheng, Jian
    Liu, Zhaobang
    Zheng, Yanyan
    Chen, Yanmei
    Xu, Xiaomin
    Lu, Jia
    Xie, Jing
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 64
  • [3] Semantic Layout Manipulation With High-Resolution Sparse Attention
    Zheng, Haitian
    Lin, Zhe
    Lu, Jingwan
    Cohen, Scott
    Zhang, Jianming
    Xu, Ning
    Luo, Jiebo
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3768 - 3782
  • [4] Conditional Image Synthesis with Auxiliary Classifier GANs
    Odena, Augustus
    Olah, Christopher
    Shlens, Jonathon
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [5] Fine-grained semantic ethnic costume high-resolution image colorization with conditional GAN
    Wu, Di
    Gan, Jianhou
    Zhou, Juxiang
    Wang, Jun
    Gao, Wei
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (05) : 2952 - 2968
  • [6] Latent space manipulation for high-resolution medical image synthesis via the StyleGAN
    Fetty, Lukas
    Bylund, Mikael
    Kuess, Peter
    Heilemann, Gerd
    Nyholm, Tufve
    Georg, Dietmar
    Lofstedt, Tommy
    [J]. ZEITSCHRIFT FUR MEDIZINISCHE PHYSIK, 2020, 30 (04): : 305 - 314
  • [7] Dual Attention GANs for Semantic Image Synthesis
    Tang, Hao
    Bai, Song
    Sebe, Nicu
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1994 - 2002
  • [8] Improved Transformer for High-Resolution GANs
    Zhao, Long
    Zhang, Zizhao
    Chen, Ting
    Metaxas, Dimitris N.
    Zhang, Han
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [9] High-resolution concrete damage image synthesis using conditional generative adversarial network
    Li, Shengyuan
    Zhao, Xuefeng
    [J]. AUTOMATION IN CONSTRUCTION, 2023, 147
  • [10] Cross-View Image Synthesis using Conditional GANs
    Regmi, Krishna
    Borji, Ali
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3501 - 3510