A Style-aware Discriminator for Controllable Image Translation

被引:17
|
作者
Kim, Kunhee [1 ]
Park, Sanghun [1 ]
Jeon, Eunyeong [1 ]
Kim, Taehun [1 ]
Kim, Daijin [1 ]
机构
[1] Pohang Univ Sci & Technol POSTECH, Pohang, South Korea
关键词
D O I
10.1109/CVPR52688.2022.01770
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current image-to-image translations do not control the output domain beyond the classes used during training, nor do they interpolate between different domains well, leading to implausible results. This limitation largely arises because labels do not consider the semantic distance. To mitigate such problems, we propose a style-aware discriminator that acts as a critic as well as a style encoder to provide conditions. The style-aware discriminator learns a controllable style space using prototype-based self-supervised learning and simultaneously guides the generator. Experiments on multiple datasets verify that the proposed model outperforms current state-of-the-art image-to-image translation methods. In contrast with current methods, the proposed approach supports various applications, including style interpolation, content transplantation, and local image translation.
引用
收藏
页码:18218 / 18227
页数:10
相关论文
共 50 条
  • [31] AUTOMATED STYLE-AWARE SELECTION OF ANNOTATED PRE-TRAINING DATABASES IN BIOMEDICAL IMAGING
    Molina-Moreno, Miguel
    Schilling, Marcel P.
    Reischl, Markus
    Mikut, Ralf
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [32] Style-aware Mid-level Representation for Discovering Visual Connections in Space and Time
    Lee, Yong Jae
    Efros, Alexei A.
    Hebert, Martial
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1857 - 1864
  • [33] Discriminator guided visible-to-infrared image translation
    Ma, Decao
    Su, Juan
    Xian, Yong
    Li, Shaopeng
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (04)
  • [34] Invisible to People but not to Machines: Evaluation of Style-aware Headline Generation in Absence of Reliable Human Judgment
    de Mattei, Lorenzo
    Cafagna, Michele
    Dell'Orletta, Felice
    Nissim, Malvina
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6709 - 6717
  • [35] Instance-Level Image Translation With a Local Discriminator
    Xu, Mingle
    Lee, Jaehwan
    Fuentes, Alvaro
    Park, Dong Sun
    Yang, Jucheng
    Yoon, Sook
    IEEE ACCESS, 2021, 9 : 111802 - 111813
  • [36] Guided Image-to-Image Translation by Discriminator-Generator Communication
    Cao, Yuanjiang
    Yao, Lina
    Pan, Le
    Sheng, Quan Z.
    Chang, Xiaojun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1528 - 1538
  • [37] Learning Style Subspaces for Controllable Unpaired Domain Translation
    Bhatt, Gaurav
    Balasubramanian, Vineeth N.
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 4209 - 4218
  • [38] Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator
    Shi, Zifan
    Xu, Yinghao
    Shen, Yujun
    Zhao, Deli
    Chen, Qifeng
    Yeung, Dit-Yan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [39] SPatchGAN: A Statistical Feature Based Discriminator for Unsupervised Image-to-Image Translation
    Shao, Xuning
    Zhang, Weidong
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6526 - 6535
  • [40] Driving style-aware energy management for battery/supercapacitor electric vehicles using deep reinforcement learning
    Wu, Yue
    Huang, Zhiwu
    Zhang, Rui
    Huang, Pei
    Gao, Yang
    Li, Heng
    Liu, Yongjie
    Peng, Jun
    JOURNAL OF ENERGY STORAGE, 2023, 73