EM-LAST: Effective Multidimensional Latent Space Transport for an Unpaired Image-to-Image Translation With an Energy-Based Model

被引:1
|
作者
Han, Giwoong [1 ]
Min, Jinhong [1 ]
Han, Sung Won [1 ]
机构
[1] Korea Univ, Sch Ind & Management Engn, Seoul 02841, South Korea
关键词
Task analysis; Aerospace electronics; Visualization; Licenses; Generative adversarial networks; Deep learning; Decoding; Energy-based model; image-to-image translation; Langevin dynamics; multidimensional latent space; vector-quantized variational autoencoder;
D O I
10.1109/ACCESS.2022.3189352
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For an unpaired image-to-image translation to work effectively, the latent space of each image domain must be well-designed. The codes of each style must be translated toward the target while preserving the parts corresponding to the source content. In general, most Variational Autoencoder (VAE)-based models use a one-dimensional latent space. However, to apply high dimensional methodologies such as vector quantization, controlling a multidimensional latent space is necessary. In this study, among the VAE-based models that use relatively complex multidimensional latent spaces, we apply an Energy-Based Model and Vector-Quantized VAE v2, with the latter as the main model. We show that among the latent spaces that represent each image domain, the importance of each feature at the top and bottom latent spaces must be interpreted differently for appropriate translation. Therefore, we argue that simply understanding the features of latent space composition well can show effective image translation results. We also present various analyses and visual outcomes of multidimensional latent space transport.
引用
收藏
页码:72839 / 72849
页数:11
相关论文
共 40 条
  • [31] Latent Space Energy-Based Model of Symbol-Vector Coupling for Text Generation and Classification
    Pang, Bo
    Wu, Ying Nian
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [32] A local Gaussian distribution fitting energy-based active contour model for image segmentation
    Xu, Haiyong
    Jiang, Gangyi
    Yu, Mei
    Luo, Ting
    COMPUTERS & ELECTRICAL ENGINEERING, 2018, 70 : 317 - 333
  • [33] Robust image Translation and Completion Based on Dual Auto-Encoder With Bidirectional Latent Space Regression
    Lee, Sukhan
    Ul Islam, Naeem
    IEEE ACCESS, 2019, 7 : 58695 - 58703
  • [34] A More Effective Method For Image Representation: Topic Model Based on Latent Dirichlet Allocation
    Li, Zongmin
    Tian, Weiwei
    Li, Yante
    Kuang, Zhenzhong
    Liu, Yujie
    2015 14TH INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN AND COMPUTER GRAPHICS (CAD/GRAPHICS), 2015, : 143 - 148
  • [35] Adaptive Multi-stage Density Ratio Estimation for Learning Latent Space Energy-based Model
    Xiao, Zhisheng
    Han, Tian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [36] EME: Energy-Based Multiexpert Model for Long-Tailed Remote Sensing Image Classification
    Bai, Yu
    Shao, Shuai
    Zhao, Shiyuan
    Liu, Weifeng
    Tao, Dapeng
    Liu, Baodi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 12
  • [37] Energy-Based Prior Latent Space Diffusion Model for Reconstruction of Lumbar Vertebrae from Thick Slice MRI
    Wang, Yanke
    Lee, Yolanne Y. R.
    Dolfini, Aurelio
    Reischl, Markus
    Konukoglu, Ender
    Flouris, Kyriakos
    DEEP GENERATIVE MODELS, DGM4MICCAI 2024, 2025, 15224 : 22 - 32
  • [38] Non-local pairwise energy-based model for the high-dynamic-range image compression problem
    Mignotte, Max
    JOURNAL OF ELECTRONIC IMAGING, 2012, 21 (01)
  • [39] Hybrid fitting energy-based fast level set model for image segmentation solving by algebraic multigrid and sparse field method
    Wang, Dengwei
    IET IMAGE PROCESSING, 2018, 12 (04) : 539 - 545
  • [40] RHLS: A Robust Hybrid Level Set Model Using Global-Local Signed Energy-Based Pressure Force for Medical Image Segmentation
    Almasganj, M.
    Fatemizadeh, E.
    IEEE ACCESS, 2025, 13 : 2004 - 2017