Style-Guided and Disentangled Representation for Robust Image-to-Image Translation

被引:0
|
作者
Choi, Jaewoong [1 ]
Kim, Daeha [1 ]
Song, Byung Cheol [1 ]
机构
[1] Inha Univ, Dept Elect & Comp Engn, Incheon 22212, South Korea
关键词
GENERATIVE ADVERSARIAL NETWORKS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, various image-to-image translation (I2I) methods have improved mode diversity and visual quality in terms of neural networks or regularization terms. However, conventional I2I methods relies on a static decision boundary and the encoded representations in those methods are entangled with each other, so they often face with 'mode collapse' phenomenon. To mitigate mode collapse, 1) we design a so-called style-guided discriminator that guides an input image to the target image style based on the strategy of flexible decision boundary. 2) Also, we make the encoded representations include independent domain attributes. Based on two ideas, this paper proposes Style-Guided and Disentangled Representation for Robust Image-to-Image Translation (SRIT). SRIT showed outstanding FID by 8%, 22.8%, and 10.1% for CelebA-HQ, AFHQ, and Yosemite datasets, respectively. The translated images of SRIT reflect the styles of target domain successfully. This indicates that SRIT shows better mode diversity than previous works.
引用
收藏
页码:463 / 471
页数:9
相关论文
共 50 条
  • [1] Style-Guided Image-to-Image Translation for Multiple Domains
    Li, Tingting
    Zhao, Huan
    Wang, Song
    Huang, Jing
    [J]. MMPT '21: PROCEEDINGS OF THE 2021 WORKSHOP ON MULTI-MODAL PRE-TRAINING FOR MULTIMEDIA UNDERSTANDING, 2021, : 28 - 36
  • [2] Enhancing Style-Guided Image-to-Image Translation via Self-Supervised Metric Learning
    Mao, Qi
    Ma, Siwei
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8511 - 8526
  • [3] Smoothing the Disentangled Latent Style Space for Unsupervised Image-to-Image Translation
    Liu, Yahui
    Sangineto, Enver
    Chen, Yajing
    Bao, Linchao
    Zhang, Haoxian
    Sebe, Nicu
    Lepri, Bruno
    Wang, Wei
    De Nadai, Marco
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10780 - 10789
  • [4] Diverse Image-to-Image Translation via Disentangled Representations
    Lee, Hsin-Ying
    Tseng, Hung-Yu
    Huang, Jia-Bin
    Singh, Maneesh
    Yang, Ming-Hsuan
    [J]. COMPUTER VISION - ECCV 2018, PT I, 2018, 11205 : 36 - 52
  • [5] Image-to-Image Translation With Disentangled Latent Vectors for Face Editing
    Dalva, Yusuf
    Pehlivan, Hamza
    Hatipoglu, Oyku Irmak
    Moran, Cansu
    Dundar, Aysegul
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 14777 - 14788
  • [6] Unsupervised Image-to-Image Translation with Style Consistency
    Lai, Binxin
    Wang, Yuan-Gen
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 322 - 334
  • [7] Guided Image Weathering using Image-to-Image Translation
    Chen, Yu
    Shen, I-Chao
    Chen, Bing-Yu
    [J]. PROCEEDINGS OF SIGGRAPH ASIA 2021 TECHNICAL COMMUNICATIONS, 2021,
  • [8] Semantic Example Guided Image-to-Image Translation
    Huang, Jialu
    Liao, Jing
    Kwong, Sam
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1654 - 1665
  • [9] DRIT++: Diverse Image-to-Image Translation via Disentangled Representations
    Hsin-Ying Lee
    Hung-Yu Tseng
    Qi Mao
    Jia-Bin Huang
    Yu-Ding Lu
    Maneesh Singh
    Ming-Hsuan Yang
    [J]. International Journal of Computer Vision, 2020, 128 : 2402 - 2417
  • [10] Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation
    Richardson, Elad
    Alaluf, Yuval
    Patashnik, Or
    Nitzan, Yotam
    Azar, Yaniv
    Shapiro, Stav
    Cohen-Or, Daniel
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2287 - 2296