Latent Space Translation via Semantic Alignment

被引:0
|
作者
Maiorca, Valentino [1 ]
Moschella, Luca [1 ]
Norelli, Antonio [1 ]
Fumero, Marco [1 ]
Locatello, Francesco [2 ]
Rodola, Emanuele [1 ]
机构
[1] Sapienza Univ Rome, Rome, Italy
[2] Inst Sci & Technol Austria ISTA, Klosterneuburg, Austria
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While different neural models often exhibit latent spaces that are alike when exposed to semantically related data, this intrinsic similarity is not always immediately discernible. Towards a better understanding of this phenomenon, our work shows how representations learned from these neural modules can be translated between different pre-trained networks via simpler transformations than previously thought. An advantage of this approach is the ability to estimate these transformations using standard, well-understood algebraic procedures that have closed-form solutions. Our method directly estimates a transformation between two given latent spaces, thereby enabling effective stitching of encoders and decoders without additional training. We extensively validate the adaptability of this translation procedure in different experimental settings: across various trainings, domains, architectures (e.g., ResNet, CNN, ViT), and in multiple downstream tasks (classification, reconstruction). Notably, we show how it is possible to zero-shot stitch text encoders and vision decoders, or vice-versa, yielding surprisingly good classification performance in this multimodal setting.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Latent Space Alignment for Semantic Channel Equalization
    Huttebraucker, Tomas
    Sana, Mohamed
    Strinati, Emilio Calvanese
    2024 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING FOR COMMUNICATION AND NETWORKING, ICMLCN 2024, 2024, : 552 - 553
  • [2] Domain Translation via Latent Space Mapping
    Mayet, Tsiry
    Bernard, Simon
    Chatelain, Clement
    Herault, Romain
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [3] Learning Semantic Attributes via a Common Latent Space
    Al-Halah, Ziad
    Gehrig, Tobias
    Stiefelhagen, Rainer
    PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2, 2014, : 48 - 55
  • [4] Cross-modal Latent Space Alignment for Image to Avatar Translation
    de Guevara, Manuel Ladron
    Echevarria, Jose
    Li, Yijun
    Hold-Geoffroy, Yannick
    Smith, Cameron
    Ito, Daichi
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 520 - 529
  • [5] Vision-and-Language Navigation via Latent Semantic Alignment Learning
    Wu, Siying
    Fu, Xueyang
    Wu, Feng
    Zha, Zheng-Jun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8406 - 8418
  • [6] ROBUST LATENT REPRESENTATIONS VIA CROSS-MODAL TRANSLATION AND ALIGNMENT
    Rajan, Vandana
    Brutti, Alessio
    Cavallaro, Andrea
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4315 - 4319
  • [7] Domain Adaptive Semantic Segmentation via Image Translation and Representation Alignment
    Kang, Jingxuan
    Zang, Bin
    Cao, Weipeng
    19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 509 - 516
  • [8] Shape Part Transfer via Semantic Latent Space Factorization
    Groscot, Raphael
    Cohen, Laurent
    Guibas, Leonidas
    GEOMETRIC SCIENCE OF INFORMATION, 2019, 11712 : 511 - 519
  • [9] Hybrid Samples Image Dehazing via Latent Space Translation
    Zheng, Yutong
    Sun, Haoying
    Song, Wei
    Computer Engineering and Applications, 2023, 59 (09) : 225 - 236
  • [10] Old Photo Restoration via Deep Latent Space Translation
    Wan, Ziyu
    Zhang, Bo
    Chen, Dong
    Zhang, Pan
    Wen, Fang
    Liao, Jing
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) : 2071 - 2087