EM-LAST: Effective Multidimensional Latent Space Transport for an Unpaired Image-to-Image Translation With an Energy-Based Model

被引:1
|
作者
Han, Giwoong [1 ]
Min, Jinhong [1 ]
Han, Sung Won [1 ]
机构
[1] Korea Univ, Sch Ind & Management Engn, Seoul 02841, South Korea
关键词
Task analysis; Aerospace electronics; Visualization; Licenses; Generative adversarial networks; Deep learning; Decoding; Energy-based model; image-to-image translation; Langevin dynamics; multidimensional latent space; vector-quantized variational autoencoder;
D O I
10.1109/ACCESS.2022.3189352
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For an unpaired image-to-image translation to work effectively, the latent space of each image domain must be well-designed. The codes of each style must be translated toward the target while preserving the parts corresponding to the source content. In general, most Variational Autoencoder (VAE)-based models use a one-dimensional latent space. However, to apply high dimensional methodologies such as vector quantization, controlling a multidimensional latent space is necessary. In this study, among the VAE-based models that use relatively complex multidimensional latent spaces, we apply an Energy-Based Model and Vector-Quantized VAE v2, with the latter as the main model. We show that among the latent spaces that represent each image domain, the importance of each feature at the top and bottom latent spaces must be interpreted differently for appropriate translation. Therefore, we argue that simply understanding the features of latent space composition well can show effective image translation results. We also present various analyses and visual outcomes of multidimensional latent space transport.
引用
收藏
页码:72839 / 72849
页数:11
相关论文
共 40 条
  • [21] Controlling the Latent Space of GANs through Reinforcement Learning: A Case Study on Task-based Image-to-Image Translation
    Abbasian, Mahyar
    Rajabzadeh, Taha
    Moradipari, Ahmadreza
    Aqajari, Seyed Amir Hossein
    Lu, Hongsheng
    Rahmani, Amir M.
    39TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2024, 2024, : 1061 - 1063
  • [22] FACE AGING AS IMAGE-TO-IMAGE TRANSLATION USING SHARED-LATENT SPACE GENERATIVE ADVERSARIAL NETWORKS
    Pantraki, Evangelia
    Kotropoulos, Constantine
    2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 306 - 310
  • [23] Learning Latent Space Energy-Based Prior Model
    Pang, Bo
    Han, Tian
    Nijkamp, Erik
    Zhu, Song-Chun
    Wu, Ying Nian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [24] Disentangled latent energy-based style translation: An image-level structural MRI harmonization framework
    Wu, Mengqi
    Zhang, Lintao
    Yap, Pew-Thian
    Zhu, Hongtu
    Liu, Mingxia
    NEURAL NETWORKS, 2025, 184
  • [25] Deep Learning-Based Path Loss Model in Urban Environments Using Image-to-Image Translation
    Juang, Rong-Terng
    IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 2022, 70 (12) : 12081 - 12091
  • [26] Likelihood-Based Generative Radiance Field with Latent Space Energy-Based Model for 3D-Aware Disentangled Image Representation
    Zhu, Yaxuan
    Xie, Jianwen
    Li, Ping
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
  • [27] Multidimensional Feature Space for an Effective Content Based Medical Image Retrieval
    Jyothi, B.
    MadhaveeLatha, Y.
    Mohan, P. G. Krishna
    2015 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2015, : 54 - 58
  • [28] K-space and image domain collaborative energy-based model for parallel MRI reconstruction
    Tu, Zongjiang
    Jiang, Chen
    Guan, Yu
    Liu, Jijun
    Liu, Qiegen
    MAGNETIC RESONANCE IMAGING, 2023, 99 : 110 - 122
  • [29] EGC: Image Generation and Classification via a Diffusion Energy-Based Model
    Guo, Qiushan
    Ma, Chuofan
    Jiang, Yi
    Yuan, Zehuan
    Yu, Yizhou
    Luo, Ping
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22895 - 22905
  • [30] An Energy-Based Model for the Image Edge-Histogram Specification Problem
    Mignotte, Max
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2012, 21 (01) : 379 - +