Image-to-image translation using an offset-basedmulti-scale codes GAN encoder

被引：8

作者：

Guo, Zihao ^{[1
]}

Shao, Mingwen ^{[1
]}

Li, Shunhang ^{[1
]}

机构：

[1] China Univ Petr, Coll Comp Sci & Technol, Qingdao, Peoples R China

来源：

VISUAL COMPUTER | 2024年 / 40卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Generative adversarial networks; GAN inversion; Image-to-image translation; Super-resolution; Conditional face synthesis;

D O I：

10.1007/s00371-023-02810-4

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Despite the remarkable achievements of generative adversarial networks (GANs) in high-quality image synthesis, applying pre-trained GAN models to image-to-image translation is still challenging. Previous approaches typically map the conditional image into the latent spaces of GANs by per-image optimization or learning a GAN encoder. However, neither of these two methods can ideally perform image-to-image translation tasks. In this work, we propose a novel learning-based framework which can complete common image-to-image translation tasks with high quality in real-time based on pre-trained GANs. Specifically, to mitigate the semantic misalignment between conditional and synthesized images, we propose an offset-based image synthesis method that allows our encoder to use multiple rather than one forward propagation to predict the latent codes. During the multiple forward passes, the final latent codes are adjusted continuously according to the semantic difference between the conditional image and the current synthesized image. To further reduce the loss of details during encoding, we extract multiple latent codes at multiple scales from input instead of a single code to synthesize the image. Moreover, we propose an optional multiple feature maps fusion module that combines our encoder with different generators to implement our multiple latent codes strategies. Finally, we analyze the performance and demonstrate the effectiveness of our framework by comparing it with state-of-the-art works on super-resolution and conditional face synthesis tasks.

引用

页码：699 / 715

页数：17

共 50 条

[41] A novel framework for image-to-image translation and image compression
Yang, Fei
Wang, Yaxing
Herranz, Luis
Cheng, Yongmei
Mozerov, Mikhail G.
NEUROCOMPUTING, 2022, 508 : 58 - 70
[42] Correction to: Generative image completion with image-to-image translation
Shuzhen Xu
Qing Zhu
Jin Wang
Neural Computing and Applications, 2020, 32 : 17809 - 17809
[43] Unsupervised Image-to-Image Translation with Generative Prior
Yang, Shuai
Jiang, Liming
Liu, Ziwei
Loy, Chen Change
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18311 - 18320
[44] Leveraging Local Domains for Image-to-Image Translation
Dell'Eva, Anthony
Pizzati, Fabio
Bertozzi, Massimo
de Charette, Raoul
PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2022, : 179 - 189
[45] Downscaling for Climate Data in Indonesia Using Image-to-Image Translation Approach
Muttaqien, Furqon Hensan
Rahadianti, Laksmita
Latifah, Arnida L.
13TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS 2021), 2021, : 73 - +
[46] Unsupervised Image-to-Image Translation with Style Consistency
Lai, Binxin
Wang, Yuan-Gen
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 322 - 334
[47] Breaking the Dilemma of Medical Image-to-image Translation
Kong, Lingke
Lian, Chenyu
Huang, Detian
Li, Zhenjiang
Hu, Yanle
Zhou, Qichao
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[48] Image-to-Image Translation with Conditional Adversarial Networks
Isola, Phillip
Zhu, Jun-Yan
Zhou, Tinghui
Efros, Alexei A.
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5967 - 5976
[49] Random Reconstructed Unpaired Image-to-Image Translation
Zhang, Xiaoqin
Fan, Chenxiang
Xiao, Zhiheng
Zhao, Li
Chen, Huiling
Chang, Xiaojun
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (03) : 3144 - 3154
[50] Edge Sensitive Unsupervised Image-to-Image Translation
Akkaya, Ibrahim Batuhan
Halici, Ugur
2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,

← 1 2 3 4 5 →