Image-to-image translation using an offset-basedmulti-scale codes GAN encoder

被引：8

作者：

Guo, Zihao ^{[1
]}

Shao, Mingwen ^{[1
]}

Li, Shunhang ^{[1
]}

机构：

[1] China Univ Petr, Coll Comp Sci & Technol, Qingdao, Peoples R China

来源：

VISUAL COMPUTER | 2024年 / 40卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Generative adversarial networks; GAN inversion; Image-to-image translation; Super-resolution; Conditional face synthesis;

D O I：

10.1007/s00371-023-02810-4

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Despite the remarkable achievements of generative adversarial networks (GANs) in high-quality image synthesis, applying pre-trained GAN models to image-to-image translation is still challenging. Previous approaches typically map the conditional image into the latent spaces of GANs by per-image optimization or learning a GAN encoder. However, neither of these two methods can ideally perform image-to-image translation tasks. In this work, we propose a novel learning-based framework which can complete common image-to-image translation tasks with high quality in real-time based on pre-trained GANs. Specifically, to mitigate the semantic misalignment between conditional and synthesized images, we propose an offset-based image synthesis method that allows our encoder to use multiple rather than one forward propagation to predict the latent codes. During the multiple forward passes, the final latent codes are adjusted continuously according to the semantic difference between the conditional image and the current synthesized image. To further reduce the loss of details during encoding, we extract multiple latent codes at multiple scales from input instead of a single code to synthesize the image. Moreover, we propose an optional multiple feature maps fusion module that combines our encoder with different generators to implement our multiple latent codes strategies. Finally, we analyze the performance and demonstrate the effectiveness of our framework by comparing it with state-of-the-art works on super-resolution and conditional face synthesis tasks.

引用

页码：699 / 715

页数：17

共 50 条

[31] Unsupervised Image-to-Image Translation: A Review
Hoyez, Henri
Schockaert, Cedric
Rambach, Jason
Mirbach, Bruno
Stricker, Didier
SENSORS, 2022, 22 (21)
[32] Unsupervised Image-to-Image Translation Networks
Liu, Ming-Yu
Breuel, Thomas
Kautz, Jan
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[33] Re-EnGAN: Unsupervised image-to-image translation based on reused feature encoder in CycleGAN
Lu, Yu
Liu, Ju
Lv, Lin
Gao, Xuesong
Chen, Weiqiang
Zhang, Yuyi
IET IMAGE PROCESSING, 2022, 16 (08) : 2219 - 2227
[34] Image-to-Image Translation using a Relativistic Generative Adversarial Network
Xing, Xingrun
Zhang, Dawei
ELEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2019), 2019, 11179
[35] Synthesizing and Manipulating Natural Videos Using Image-to-Image Translation
Yeh, Ryan
Loui, Alexander
2021 IEEE WESTERN NEW YORK IMAGE AND SIGNAL PROCESSING WORKSHOP (WNYISPW), 2021,
[36] Hand Hygiene Quality Assessment Using Image-to-Image Translation
Wang, Chaofan
Yang, Kangning
Jiang, Weiwei
Wei, Jing
Sarsenbayeva, Zhanna
Goncalves, Jorge
Kostakos, Vassilis
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VII, 2022, 13437 : 64 - 73
[37] Unrestricted Facial Geometry Reconstruction Using Image-to-Image Translation
Sela, Matan
Richardson, Elad
Kimmel, Ron
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1585 - 1594
[38] Eliminating Adversarial Perturbations Using Image-to-Image Translation Method
Zhang, Haibo
Yao, Zhihua
Sakurai, Kouichi
APPLIED CRYPTOGRAPHY AND NETWORK SECURITY WORKSHOPS, ACNS 2023 SATELLITE WORKSHOPS, ADSC 2023, AIBLOCK 2023, AIHWS 2023, AIOTS 2023, CIMSS 2023, CLOUD S&P 2023, SCI 2023, SECMT 2023, SIMLA 2023, 2023, 13907 : 601 - 620
[39] DehazeGAN: Underwater Haze Image Restoration using Unpaired Image-to-image Translation
Cho, Younggun
Malav, Ramavtar
Pandey, Gaurav
Kim, Ayoung
IFAC PAPERSONLINE, 2019, 52 (21): : 82 - 85
[40] GiGAN: Gate in GAN, could gate mechanism filter the features in image-to-image translation?
Nie, Xuan
Jia, Jianchao
Ding, Haoxuan
Wong, Edward K.
NEUROCOMPUTING, 2021, 462 (462) : 376 - 388

← 1 2 3 4 5 →