Image-to-image translation using an offset-basedmulti-scale codes GAN encoder

被引:8
|
作者
Guo, Zihao [1 ]
Shao, Mingwen [1 ]
Li, Shunhang [1 ]
机构
[1] China Univ Petr, Coll Comp Sci & Technol, Qingdao, Peoples R China
来源
VISUAL COMPUTER | 2024年 / 40卷 / 02期
基金
中国国家自然科学基金;
关键词
Generative adversarial networks; GAN inversion; Image-to-image translation; Super-resolution; Conditional face synthesis;
D O I
10.1007/s00371-023-02810-4
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Despite the remarkable achievements of generative adversarial networks (GANs) in high-quality image synthesis, applying pre-trained GAN models to image-to-image translation is still challenging. Previous approaches typically map the conditional image into the latent spaces of GANs by per-image optimization or learning a GAN encoder. However, neither of these two methods can ideally perform image-to-image translation tasks. In this work, we propose a novel learning-based framework which can complete common image-to-image translation tasks with high quality in real-time based on pre-trained GANs. Specifically, to mitigate the semantic misalignment between conditional and synthesized images, we propose an offset-based image synthesis method that allows our encoder to use multiple rather than one forward propagation to predict the latent codes. During the multiple forward passes, the final latent codes are adjusted continuously according to the semantic difference between the conditional image and the current synthesized image. To further reduce the loss of details during encoding, we extract multiple latent codes at multiple scales from input instead of a single code to synthesize the image. Moreover, we propose an optional multiple feature maps fusion module that combines our encoder with different generators to implement our multiple latent codes strategies. Finally, we analyze the performance and demonstrate the effectiveness of our framework by comparing it with state-of-the-art works on super-resolution and conditional face synthesis tasks.
引用
收藏
页码:699 / 715
页数:17
相关论文
共 50 条
  • [31] Unsupervised Image-to-Image Translation: A Review
    Hoyez, Henri
    Schockaert, Cedric
    Rambach, Jason
    Mirbach, Bruno
    Stricker, Didier
    SENSORS, 2022, 22 (21)
  • [32] Unsupervised Image-to-Image Translation Networks
    Liu, Ming-Yu
    Breuel, Thomas
    Kautz, Jan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [33] Re-EnGAN: Unsupervised image-to-image translation based on reused feature encoder in CycleGAN
    Lu, Yu
    Liu, Ju
    Lv, Lin
    Gao, Xuesong
    Chen, Weiqiang
    Zhang, Yuyi
    IET IMAGE PROCESSING, 2022, 16 (08) : 2219 - 2227
  • [34] Image-to-Image Translation using a Relativistic Generative Adversarial Network
    Xing, Xingrun
    Zhang, Dawei
    ELEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2019), 2019, 11179
  • [35] Synthesizing and Manipulating Natural Videos Using Image-to-Image Translation
    Yeh, Ryan
    Loui, Alexander
    2021 IEEE WESTERN NEW YORK IMAGE AND SIGNAL PROCESSING WORKSHOP (WNYISPW), 2021,
  • [36] Hand Hygiene Quality Assessment Using Image-to-Image Translation
    Wang, Chaofan
    Yang, Kangning
    Jiang, Weiwei
    Wei, Jing
    Sarsenbayeva, Zhanna
    Goncalves, Jorge
    Kostakos, Vassilis
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VII, 2022, 13437 : 64 - 73
  • [37] Unrestricted Facial Geometry Reconstruction Using Image-to-Image Translation
    Sela, Matan
    Richardson, Elad
    Kimmel, Ron
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1585 - 1594
  • [38] Eliminating Adversarial Perturbations Using Image-to-Image Translation Method
    Zhang, Haibo
    Yao, Zhihua
    Sakurai, Kouichi
    APPLIED CRYPTOGRAPHY AND NETWORK SECURITY WORKSHOPS, ACNS 2023 SATELLITE WORKSHOPS, ADSC 2023, AIBLOCK 2023, AIHWS 2023, AIOTS 2023, CIMSS 2023, CLOUD S&P 2023, SCI 2023, SECMT 2023, SIMLA 2023, 2023, 13907 : 601 - 620
  • [39] DehazeGAN: Underwater Haze Image Restoration using Unpaired Image-to-image Translation
    Cho, Younggun
    Malav, Ramavtar
    Pandey, Gaurav
    Kim, Ayoung
    IFAC PAPERSONLINE, 2019, 52 (21): : 82 - 85
  • [40] GiGAN: Gate in GAN, could gate mechanism filter the features in image-to-image translation?
    Nie, Xuan
    Jia, Jianchao
    Ding, Haoxuan
    Wong, Edward K.
    NEUROCOMPUTING, 2021, 462 (462) : 376 - 388