Learning Disentangled Discrete Representations

被引:1
|
作者
Friede, David [1 ]
Reimers, Christian [2 ]
Stuckenschmidt, Heiner [1 ]
Niepert, Mathias [3 ,4 ]
机构
[1] Univ Mannheim, Mannheim, Germany
[2] Max Planck Inst Biogeochem, Jena, Germany
[3] Univ Stuttgart, Stuttgart, Germany
[4] NEC Labs Europe, Heidelberg, Germany
关键词
Categorical VAE; Disentanglement;
D O I
10.1007/978-3-031-43421-1_35
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent successes in image generation, model-based reinforcement learning, and text-to-image generation have demonstrated the empirical advantages of discrete latent representations, although the reasons behind their benefits remain unclear. We explore the relationship between discrete latent spaces and disentangled representations by replacing the standard Gaussian variational autoencoder (VAE) with a tailored categorical variational autoencoder. We show that the underlying grid structure of categorical distributions mitigates the problem of rotational invariance associated with multivariate Gaussian distributions, acting as an efficient inductive prior for disentangled representations. We provide both analytical and empirical findings that demonstrate the advantages of discrete VAEs for learning disentangled representations. Furthermore, we introduce the first unsupervised model selection strategy that favors disentangled representations.
引用
收藏
页码:593 / 609
页数:17
相关论文
共 50 条
  • [41] Manipulating Voice Attributes by Adversarial Learning of Structured Disentangled Representations
    Benaroya, Laurent
    Obin, Nicolas
    Roebel, Axel
    ENTROPY, 2023, 25 (02)
  • [42] An Adversarial Neuro-Tensorial Approach for Learning Disentangled Representations
    Wang, Mengjiao
    Shu, Zhixin
    Cheng, Shiyang
    Panagakis, Yannis
    Samaras, Dimitris
    Zafeiriou, Stefanos
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2019, 127 (6-7) : 743 - 762
  • [43] An Adversarial Neuro-Tensorial Approach for Learning Disentangled Representations
    Mengjiao Wang
    Zhixin Shu
    Shiyang Cheng
    Yannis Panagakis
    Dimitris Samaras
    Stefanos Zafeiriou
    International Journal of Computer Vision, 2019, 127 : 743 - 762
  • [44] Disentangled behavioral representations
    Dezfouli, Amir
    Ashtiani, Hassan
    Ghattas, Omar
    Nock, Richard
    Dayan, Peter
    Ong, Cheng Soon
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [45] DisenCDR: Learning Disentangled Representations for Cross-Domain Recommendation
    Cao, Jiangxia
    Lin, Xixun
    Cong, Xin
    Ya, Jing
    Liu, Tingwen
    Wang, Bin
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 267 - 277
  • [46] Learning Disentangled Textual Representations via Statistical Measures of Similarity
    Colombo, Pierre
    Staerman, Guillaume
    Noiry, Nathan
    Piantanida, Pablo
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 2614 - 2630
  • [47] Learning Disentangled Representations for Identity Preserving Surveillance Face Camouflage
    Li, Jingzhi
    Han, Lutong
    Zhang, Hua
    Han, Xiaoguang
    Ge, Jingguo
    Cao, Xiaochun
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9748 - 9755
  • [48] On Causally Disentangled Representations
    Reddy, Abbavaram Gowtham
    Benin, Godfrey L.
    Balasubramanian, Vineeth N.
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8089 - 8097
  • [49] Structured Disentangled Representations
    Esmaeili, Babak
    Wu, Hao
    Jain, Sarthak
    Bozkurt, Alican
    Siddharth, N.
    Paige, Brooks
    Brooks, Dana H.
    Dy, Jennifer
    van de Meent, Jan-Willem
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [50] Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data
    Hsu, Wei-Ning
    Zhang, Yu
    Glass, James
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30