A Style-aware Discriminator for Controllable Image Translation

被引：17

作者：

Kim, Kunhee ^{[1
]}

Park, Sanghun ^{[1
]}

Jeon, Eunyeong ^{[1
]}

Kim, Taehun ^{[1
]}

Kim, Daijin ^{[1
]}

机构：

[1] Pohang Univ Sci & Technol POSTECH, Pohang, South Korea

来源：

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年

关键词：

D O I：

10.1109/CVPR52688.2022.01770

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Current image-to-image translations do not control the output domain beyond the classes used during training, nor do they interpolate between different domains well, leading to implausible results. This limitation largely arises because labels do not consider the semantic distance. To mitigate such problems, we propose a style-aware discriminator that acts as a critic as well as a style encoder to provide conditions. The style-aware discriminator learns a controllable style space using prototype-based self-supervised learning and simultaneously guides the generator. Experiments on multiple datasets verify that the proposed model outperforms current state-of-the-art image-to-image translation methods. In contrast with current methods, the proposed approach supports various applications, including style interpolation, content transplantation, and local image translation.

引用

页码：18218 / 18227

页数：10

共 50 条

[41] Unsupervised Image-to-Image Translation with Style Consistency
Lai, Binxin
Wang, Yuan-Gen
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 322 - 334
[42] Panoptic-aware Image-to-Image Translation
Zhang, Liyun
Ratsamee, Photchara
Wang, Bowen
Luo, Zhaojie
Uranishi, Yuki
Higashida, Manabu
Takemura, Haruo
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 259 - 268
[43] Scellseg: A style-aware deep learning tool for adaptive cell instance segmentation by contrastive fine-tuning
Xun, Dejin
Chen, Deheng
Zhou, Yitian
Lauschke, Volker M.
Wang, Rui
Wang, Yi
ISCIENCE, 2022, 25 (12)
[44] Aesthetic-Aware Image Style Transfer
Hu, Zhiyuan
Jia, Jia
Liu, Bei
Bu, Yaohua
Fu, Jianlong
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3320 - 3329
[45] Image Purification through Controllable Neural Style Transfer
Zhao, Tongtong
Yan, Yuxiao
Shehu, Ibrahim Shehi
Wei, HaoHui
Fu, Xianping
2018 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2018, : 466 - 471
[46] Audio-Driven Dubbing for User Generated Contents via Style-Aware Semi-Parametric Synthesis
Song, Linsen
Wu, Wayne
Fu, Chaoyou
Loy, Chen Change
He, Ran
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (03) : 1247 - 1261
[47] ECGAN: Image Translation with Multi-scale Relativistic Average Discriminator
Xia, Weihao
Yang, Yujiu
Bao, Xian-yu
ARTIFICIAL INTELLIGENCE AND MOBILE SERVICES - AIMS 2019, 2019, 11516 : 28 - 38
[48] Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation
Richardson, Elad
Alaluf, Yuval
Patashnik, Or
Nitzan, Yotam
Azar, Yaniv
Shapiro, Stav
Cohen-Or, Daniel
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2287 - 2296
[49] OmniStyleGAN for Style-Guided Image-to-Image Translation
Zhao, Qianyi
Wang, Mengyin
Zhang, Qing
Wang, Fasheng
Sun, Fuming
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XI, 2025, 15041 : 351 - 365
[50] SHUNIT: Style Harmonization for Unpaired Image-to-Image Translation
Song, Seokbeom
Lee, Suhyeon
Seong, Hongje
Min, Kyoungwon
Kim, Euntai
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2292 - 2302

← 1 2 3 4 5 →