Image-to-image translation using an offset-basedmulti-scale codes GAN encoder

被引:8
|
作者
Guo, Zihao [1 ]
Shao, Mingwen [1 ]
Li, Shunhang [1 ]
机构
[1] China Univ Petr, Coll Comp Sci & Technol, Qingdao, Peoples R China
来源
VISUAL COMPUTER | 2024年 / 40卷 / 02期
基金
中国国家自然科学基金;
关键词
Generative adversarial networks; GAN inversion; Image-to-image translation; Super-resolution; Conditional face synthesis;
D O I
10.1007/s00371-023-02810-4
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Despite the remarkable achievements of generative adversarial networks (GANs) in high-quality image synthesis, applying pre-trained GAN models to image-to-image translation is still challenging. Previous approaches typically map the conditional image into the latent spaces of GANs by per-image optimization or learning a GAN encoder. However, neither of these two methods can ideally perform image-to-image translation tasks. In this work, we propose a novel learning-based framework which can complete common image-to-image translation tasks with high quality in real-time based on pre-trained GANs. Specifically, to mitigate the semantic misalignment between conditional and synthesized images, we propose an offset-based image synthesis method that allows our encoder to use multiple rather than one forward propagation to predict the latent codes. During the multiple forward passes, the final latent codes are adjusted continuously according to the semantic difference between the conditional image and the current synthesized image. To further reduce the loss of details during encoding, we extract multiple latent codes at multiple scales from input instead of a single code to synthesize the image. Moreover, we propose an optional multiple feature maps fusion module that combines our encoder with different generators to implement our multiple latent codes strategies. Finally, we analyze the performance and demonstrate the effectiveness of our framework by comparing it with state-of-the-art works on super-resolution and conditional face synthesis tasks.
引用
收藏
页码:699 / 715
页数:17
相关论文
共 50 条
  • [41] A novel framework for image-to-image translation and image compression
    Yang, Fei
    Wang, Yaxing
    Herranz, Luis
    Cheng, Yongmei
    Mozerov, Mikhail G.
    NEUROCOMPUTING, 2022, 508 : 58 - 70
  • [42] Correction to: Generative image completion with image-to-image translation
    Shuzhen Xu
    Qing Zhu
    Jin Wang
    Neural Computing and Applications, 2020, 32 : 17809 - 17809
  • [43] Unsupervised Image-to-Image Translation with Generative Prior
    Yang, Shuai
    Jiang, Liming
    Liu, Ziwei
    Loy, Chen Change
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18311 - 18320
  • [44] Leveraging Local Domains for Image-to-Image Translation
    Dell'Eva, Anthony
    Pizzati, Fabio
    Bertozzi, Massimo
    de Charette, Raoul
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2022, : 179 - 189
  • [45] Downscaling for Climate Data in Indonesia Using Image-to-Image Translation Approach
    Muttaqien, Furqon Hensan
    Rahadianti, Laksmita
    Latifah, Arnida L.
    13TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS 2021), 2021, : 73 - +
  • [46] Unsupervised Image-to-Image Translation with Style Consistency
    Lai, Binxin
    Wang, Yuan-Gen
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 322 - 334
  • [47] Breaking the Dilemma of Medical Image-to-image Translation
    Kong, Lingke
    Lian, Chenyu
    Huang, Detian
    Li, Zhenjiang
    Hu, Yanle
    Zhou, Qichao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [48] Image-to-Image Translation with Conditional Adversarial Networks
    Isola, Phillip
    Zhu, Jun-Yan
    Zhou, Tinghui
    Efros, Alexei A.
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5967 - 5976
  • [49] Random Reconstructed Unpaired Image-to-Image Translation
    Zhang, Xiaoqin
    Fan, Chenxiang
    Xiao, Zhiheng
    Zhao, Li
    Chen, Huiling
    Chang, Xiaojun
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (03) : 3144 - 3154
  • [50] Edge Sensitive Unsupervised Image-to-Image Translation
    Akkaya, Ibrahim Batuhan
    Halici, Ugur
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,