High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs

被引:2413
|
作者
Wang, Ting-Chun [1 ]
Liu, Ming-Yu [1 ]
Zhu, Jun-Yan [2 ]
Tao, Andrew [1 ]
Kautz, Jan [1 ]
Catanzaro, Bryan [1 ]
机构
[1] NVIDIA Corp, Santa Clara, CA 95051 USA
[2] Univ Calif Berkeley, Berkeley, CA USA
关键词
D O I
10.1109/CVPR.2018.00917
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a new method for synthesizing high resolution photo-realistic images from semantic label maps using conditional generative adversarial networks (conditional GANs). Conditional GANs have enabled a variety of applications, but the results are often limited to low resolution and still far from realistic. In this work, we generate 2048 x 1024 visually appealing results with a novel adversarial loss, as well as new multi-scale generator and discriminator architectures. Furthermore, we extend our framework to interactive visual manipulation with two additional features. First, we incorporate object instance segmentation information, which enables object manipulations such as removing/adding objects and changing the object category. Second, we propose a method to generate diverse results given the same input, allowing users to edit the object appearance interactively. Human opinion studies demonstrate that our method significantly outperforms existing methods, advancing both the quality and the resolution of deep image synthesis and editing.
引用
收藏
页码:8798 / 8807
页数:10
相关论文
共 50 条
  • [41] Railroad semantic segmentation on high-resolution images
    Belyaev, Sergey
    Popov, Igor
    Shubnikov, Vladislav
    Popov, Pavel
    Boltenkova, Ekaterina
    Savchuk, Daniil
    [J]. 2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [42] High-resolution DEM building with SAR interferometry and high-resolution optical image
    Hadj-Sahraoui, Omar
    Fizazi, Hadria
    Berrichi, Faouzi
    Chamakhi, Djemoui
    Kebir, Lahcen Wahib
    [J]. IET IMAGE PROCESSING, 2019, 13 (05) : 713 - 721
  • [43] ARTGAN: ARTWORK SYNTHESIS WITH CONDITIONAL CATEGORICAL GANs
    Tan, Wei Ren
    Chan, Chee Seng
    Aguirre, Hernan E.
    Tanaka, Kiyoshi
    [J]. 2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3760 - 3764
  • [44] High-Resolution Diabetic Retinopathy Image Synthesis Manipulated by Grading and Lesions
    Zhou, Yi
    He, Xiaodong
    Cui, Shanshan
    Zhu, Fan
    Liu, Li
    Shao, Ling
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT I, 2019, 11764 : 505 - 513
  • [45] Style-Guided Inference of Transformer for High-resolution Image Synthesis
    Yim, Jonghwa
    Kim, Minjae
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 1745 - 1755
  • [46] Learning conditional photometric stereo with high-resolution features
    Ju, Yakun
    Peng, Yuxin
    Jian, Muwei
    Gao, Feng
    Dong, Junyu
    [J]. COMPUTATIONAL VISUAL MEDIA, 2022, 8 (01) : 105 - 118
  • [47] Binary recombinase systems for high-resolution conditional mutagenesis
    Hermann, Mario
    Stillhard, Patrick
    Wildner, Hendrik
    Seruggia, Davide
    Kapp, Viktor
    Sanchez-Iranzo, Hector
    Mercader, Nadia
    Montoliu, Lluis
    Zeilhofer, Hanns Ulrich
    Pelczar, Pawel
    [J]. NUCLEIC ACIDS RESEARCH, 2014, 42 (06) : 3894 - 3907
  • [48] Learning conditional photometric stereo with high-resolution features
    Yakun Ju
    Yuxin Peng
    Muwei Jian
    Feng Gao
    Junyu Dong
    [J]. Computational Visual Media, 2022, (01) : 105 - 118
  • [49] Learning conditional photometric stereo with high-resolution features
    Yakun Ju
    Yuxin Peng
    Muwei Jian
    Feng Gao
    Junyu Dong
    [J]. Computational Visual Media, 2022, 8 : 105 - 118
  • [50] Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image Synthesis
    Liu, Xihui
    Yin, Guojun
    Shao, Jing
    Wang, Xiaogang
    Li, Hongsheng
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32