On the Diversity of Conditional Image Synthesis With Semantic Layouts

被引:6
|
作者
Yang, Zichen [1 ,2 ]
Liu, Haifeng [1 ]
Cai, Deng [1 ,3 ]
机构
[1] Zhejiang Univ, Coll Comp Sci, Key Lab CAD & CG, Hangzhou 310058, Zhejiang, Peoples R China
[2] Zhejiang Univ, Alibaba Zhejiang Univ Joint Inst Frontier Technol, Hangzhou 310058, Zhejiang, Peoples R China
[3] Fabu Inc, Hangzhou 310012, Zhejiang, Peoples R China
关键词
Image translation; conditional image synthesis; GAN; diversity loss; unpaired training;
D O I
10.1109/TIP.2019.2891935
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many image processing tasks can be formulated as translating images between two image domains such as colorization, super-resolution, and conditional image synthesis. In most of these tasks, an input image may correspond to multiple outputs. However, current existing approaches only show minor stochasticity of the outputs. In this paper, we present a novel approach to synthesize diverse realistic images corresponding to a semantic layout. We introduce a diversity loss objective that maximizes the distance between synthesized image pairs and relates the input noise to the semantic segments in the synthesized images. Thus, our approach can not only produce multiple diverse images but also allow users to manipulate the output images by adjusting the noise manually. The experimental results show that images synthesized by our approach are more diverse than that of the current existing works and equipping our diversity loss does not degrade the reality of the base networks. Moreover, our approach can be applied to unpaired datasets.
引用
收藏
页码:2898 / 2907
页数:10
相关论文
共 50 条
  • [1] Diverse Image Synthesis from Semantic Layouts via Conditional IMLE
    Li, Ke
    Zhang, Tianhao
    Malik, Jitendra
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4219 - 4228
  • [2] Diffusion-Based Semantic Image Synthesis from Sparse Layouts
    Huang, Yuantian
    Iizuka, Satoshi
    Fukui, Kazuhiro
    [J]. ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT II, 2024, 14496 : 441 - 454
  • [3] Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image Synthesis
    Liu, Xihui
    Yin, Guojun
    Shao, Jing
    Wang, Xiaogang
    Li, Hongsheng
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [4] Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image Synthesis
    Park, Minho
    Yun, Jooyeol
    Choi, Seunghwan
    Choo, Jaegul
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7557 - 7566
  • [5] Dual conditional GAN based on external attention for semantic image synthesis
    Liu, Gang
    Zhou, Qijun
    Xie, Xiaoxiao
    Yu, Qingchen
    [J]. CONNECTION SCIENCE, 2023, 35 (01)
  • [6] High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs
    Wang, Ting-Chun
    Liu, Ming-Yu
    Zhu, Jun-Yan
    Tao, Andrew
    Kautz, Jan
    Catanzaro, Bryan
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8798 - 8807
  • [7] Semantic Image Synthesis via Conditional Cycle-Generative Adversarial Networks
    Liu, Xiyan
    Meng, Gaofeng
    Xiang, Shiming
    Pan, Chunhong
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 988 - 993
  • [8] Semantic Image Analogy with a Conditional Single-Image GAN
    Li, Jiacheng
    Xiong, Zhiwei
    Liu, Dong
    Chen, Xuejin
    Zha, Zheng-Jun
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 637 - 645
  • [9] Semantic-Conditional Diffusion Networks for Image Captioning
    Luo, Jianjie
    Li, Yehao
    Pan, Yingwei
    Yao, Ting
    Feng, Jianlin
    Chao, Hongyang
    Mei, Tao
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23359 - 23368
  • [10] Enhancing Image Representation in Conditional Image Synthesis
    Shim, Jonghwa
    Kim, Eunbeen
    Kim, Hyeonwoo
    Hwang, Eenjun
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, BIGCOMP, 2023, : 203 - 210