A Style-aware Discriminator for Controllable Image Translation

被引:17
|
作者
Kim, Kunhee [1 ]
Park, Sanghun [1 ]
Jeon, Eunyeong [1 ]
Kim, Taehun [1 ]
Kim, Daijin [1 ]
机构
[1] Pohang Univ Sci & Technol POSTECH, Pohang, South Korea
关键词
D O I
10.1109/CVPR52688.2022.01770
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current image-to-image translations do not control the output domain beyond the classes used during training, nor do they interpolate between different domains well, leading to implausible results. This limitation largely arises because labels do not consider the semantic distance. To mitigate such problems, we propose a style-aware discriminator that acts as a critic as well as a style encoder to provide conditions. The style-aware discriminator learns a controllable style space using prototype-based self-supervised learning and simultaneously guides the generator. Experiments on multiple datasets verify that the proposed model outperforms current state-of-the-art image-to-image translation methods. In contrast with current methods, the proposed approach supports various applications, including style interpolation, content transplantation, and local image translation.
引用
收藏
页码:18218 / 18227
页数:10
相关论文
共 50 条
  • [41] Unsupervised Image-to-Image Translation with Style Consistency
    Lai, Binxin
    Wang, Yuan-Gen
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 322 - 334
  • [42] Panoptic-aware Image-to-Image Translation
    Zhang, Liyun
    Ratsamee, Photchara
    Wang, Bowen
    Luo, Zhaojie
    Uranishi, Yuki
    Higashida, Manabu
    Takemura, Haruo
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 259 - 268
  • [43] Scellseg: A style-aware deep learning tool for adaptive cell instance segmentation by contrastive fine-tuning
    Xun, Dejin
    Chen, Deheng
    Zhou, Yitian
    Lauschke, Volker M.
    Wang, Rui
    Wang, Yi
    ISCIENCE, 2022, 25 (12)
  • [44] Aesthetic-Aware Image Style Transfer
    Hu, Zhiyuan
    Jia, Jia
    Liu, Bei
    Bu, Yaohua
    Fu, Jianlong
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3320 - 3329
  • [45] Image Purification through Controllable Neural Style Transfer
    Zhao, Tongtong
    Yan, Yuxiao
    Shehu, Ibrahim Shehi
    Wei, HaoHui
    Fu, Xianping
    2018 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2018, : 466 - 471
  • [46] Audio-Driven Dubbing for User Generated Contents via Style-Aware Semi-Parametric Synthesis
    Song, Linsen
    Wu, Wayne
    Fu, Chaoyou
    Loy, Chen Change
    He, Ran
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (03) : 1247 - 1261
  • [47] ECGAN: Image Translation with Multi-scale Relativistic Average Discriminator
    Xia, Weihao
    Yang, Yujiu
    Bao, Xian-yu
    ARTIFICIAL INTELLIGENCE AND MOBILE SERVICES - AIMS 2019, 2019, 11516 : 28 - 38
  • [48] Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation
    Richardson, Elad
    Alaluf, Yuval
    Patashnik, Or
    Nitzan, Yotam
    Azar, Yaniv
    Shapiro, Stav
    Cohen-Or, Daniel
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2287 - 2296
  • [49] OmniStyleGAN for Style-Guided Image-to-Image Translation
    Zhao, Qianyi
    Wang, Mengyin
    Zhang, Qing
    Wang, Fasheng
    Sun, Fuming
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XI, 2025, 15041 : 351 - 365
  • [50] SHUNIT: Style Harmonization for Unpaired Image-to-Image Translation
    Song, Seokbeom
    Lee, Suhyeon
    Seong, Hongje
    Min, Kyoungwon
    Kim, Euntai
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2292 - 2302