Learning disentangled representations via product manifold projection

被引:0
|
作者
Fumero, Marco [1 ]
Cosmo, Luca [1 ,2 ]
Melzi, Simone [1 ]
Rodola, Emanuele [1 ]
机构
[1] Sapienza Univ Rome, Rome, Italy
[2] Univ Svizzera Italiana, Lugano, Switzerland
基金
欧洲研究理事会;
关键词
INDEPENDENT COMPONENT ANALYSIS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel approach to disentangle the generative factors of variation underlying a given set of observations. Our method builds upon the idea that the (unknown) low-dimensional manifold underlying the data space can be explicitly modeled as a product of submanifolds. This definition of disentanglement gives rise to a novel weakly-supervised algorithm for recovering the unknown explanatory factors behind the data. At training time, our algorithm only requires pairs of non i.i.d. data samples whose elements share at least one, possibly multidimensional, generative factor of variation. We require no knowledge on the nature of these transformations, and do not make any limiting assumption on the properties of each subspace. Our approach is easy to implement, and can be successfully applied to different kinds of data (from images to 3D surfaces) undergoing arbitrary transformations. In addition to standard synthetic benchmarks, we showcase our method in challenging real-world applications, where we compare favorably with the state of the art.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Learning Disentangled Representations via Independent Subspaces
    Awiszus, Maren
    Ackermann, Hanno
    Rosenhahn, Bodo
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 560 - 568
  • [2] Deterring the Gray Market: Product Diversion Detection via Learning Disentangled Representations of Multivariate Time Series
    Lin, Hao
    Liu, Guannan
    Wu, Junjie
    Zhao, J. Leon
    INFORMS JOURNAL ON COMPUTING, 2024, 36 (02) : 571 - 586
  • [3] Fast Quaternion Product Units for Learning Disentangled Representations in SO(3)
    Qin, Shaofei
    Zhang, Xuan
    Xu, Hongteng
    Xu, Yi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4504 - 4520
  • [4] Learning Disentangled Textual Representations via Statistical Measures of Similarity
    Colombo, Pierre
    Staerman, Guillaume
    Noiry, Nathan
    Piantanida, Pablo
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 2614 - 2630
  • [5] Learning Disentangled Representations for Recommendation
    Ma, Jianxin
    Zhou, Chang
    Cui, Peng
    Yang, Hongxia
    Zhu, Wenwu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [6] Learning Disentangled Discrete Representations
    Friede, David
    Reimers, Christian
    Stuckenschmidt, Heiner
    Niepert, Mathias
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT IV, 2023, 14172 : 593 - 609
  • [7] Learning Causally Disentangled Representations via the Principle of Independent Causal Mechanisms
    Komanduri, Aneesh
    Wu, Yongkai
    Chen, Feng
    Wu, Xintao
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 4308 - 4316
  • [8] LEARNING DISENTANGLED FEATURE REPRESENTATIONS FOR SPEECH ENHANCEMENT VIA ADVERSARIAL TRAINING
    Hou, Nana
    Xu, Chenglin
    Chng, Eng Siong
    Li, Haizhou
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 666 - 670
  • [9] Learning Disentangled Representations for Counterfactual Regression via Mutual Information Minimization
    Cheng, Mingyuan
    Liao, Xinru
    Liu, Quan
    Ma, Bin
    Xu, Jian
    Zheng, Bo
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1802 - 1806
  • [10] Disentangled Representations via Synergy Minimization
    Steeg, Greg Ver
    Brekelmans, Rob
    Harutyunyan, Hrayr
    Galstyan, Aram
    2017 55TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2017, : 180 - 187