A Style-aware Discriminator for Controllable Image Translation

被引：17

作者：

Kim, Kunhee ^{[1
]}

Park, Sanghun ^{[1
]}

Jeon, Eunyeong ^{[1
]}

Kim, Taehun ^{[1
]}

Kim, Daijin ^{[1
]}

机构：

[1] Pohang Univ Sci & Technol POSTECH, Pohang, South Korea

来源：

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年

关键词：

D O I：

10.1109/CVPR52688.2022.01770

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Current image-to-image translations do not control the output domain beyond the classes used during training, nor do they interpolate between different domains well, leading to implausible results. This limitation largely arises because labels do not consider the semantic distance. To mitigate such problems, we propose a style-aware discriminator that acts as a critic as well as a style encoder to provide conditions. The style-aware discriminator learns a controllable style space using prototype-based self-supervised learning and simultaneously guides the generator. Experiments on multiple datasets verify that the proposed model outperforms current state-of-the-art image-to-image translation methods. In contrast with current methods, the proposed approach supports various applications, including style interpolation, content transplantation, and local image translation.

引用

页码：18218 / 18227

页数：10

共 50 条

[31] AUTOMATED STYLE-AWARE SELECTION OF ANNOTATED PRE-TRAINING DATABASES IN BIOMEDICAL IMAGING
Molina-Moreno, Miguel
Schilling, Marcel P.
Reischl, Markus
Mikut, Ralf
2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
[32] Style-aware Mid-level Representation for Discovering Visual Connections in Space and Time
Lee, Yong Jae
Efros, Alexei A.
Hebert, Martial
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1857 - 1864
[33] Discriminator guided visible-to-infrared image translation
Ma, Decao
Su, Juan
Xian, Yong
Li, Shaopeng
COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (04)
[34] Invisible to People but not to Machines: Evaluation of Style-aware Headline Generation in Absence of Reliable Human Judgment
de Mattei, Lorenzo
Cafagna, Michele
Dell'Orletta, Felice
Nissim, Malvina
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6709 - 6717
[35] Instance-Level Image Translation With a Local Discriminator
Xu, Mingle
Lee, Jaehwan
Fuentes, Alvaro
Park, Dong Sun
Yang, Jucheng
Yoon, Sook
IEEE ACCESS, 2021, 9 : 111802 - 111813
[36] Guided Image-to-Image Translation by Discriminator-Generator Communication
Cao, Yuanjiang
Yao, Lina
Pan, Le
Sheng, Quan Z.
Chang, Xiaojun
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1528 - 1538
[37] Learning Style Subspaces for Controllable Unpaired Domain Translation
Bhatt, Gaurav
Balasubramanian, Vineeth N.
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 4209 - 4218
[38] Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator
Shi, Zifan
Xu, Yinghao
Shen, Yujun
Zhao, Deli
Chen, Qifeng
Yeung, Dit-Yan
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[39] SPatchGAN: A Statistical Feature Based Discriminator for Unsupervised Image-to-Image Translation
Shao, Xuning
Zhang, Weidong
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6526 - 6535
[40] Driving style-aware energy management for battery/supercapacitor electric vehicles using deep reinforcement learning
Wu, Yue
Huang, Zhiwu
Zhang, Rui
Huang, Pei
Gao, Yang
Li, Heng
Liu, Yongjie
Peng, Jun
JOURNAL OF ENERGY STORAGE, 2023, 73

← 1 2 3 4 5 →