Identifiability of deep generative models without auxiliary information

被引:0
|
作者
Kivva, Bohdan [1 ]
Rajendran, Goutham [1 ]
Ravikumar, Pradeep [2 ]
Aragam, Bryon [1 ]
机构
[1] Univ Chicago, Chicago, IL 60637 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
INDEPENDENT COMPONENT ANALYSIS; INFERENCE; MIXTURES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We prove identifiability of a broad class of deep latent variable models that (a) have universal approximation capabilities and (b) are the decoders of variational autoencoders that are commonly used in practice. Unlike existing work, our analysis does not require weak supervision, auxiliary information, or conditioning in the latent space. Specifically, we show that for a broad class of generative (i.e. unsupervised) models with universal approximation capabilities, the side information u is not necessary: We prove identifiability of the entire generative model where we do not observe u and only observe the data x. The models we consider match autoencoder architectures used in practice that leverage mixture priors in the latent space and ReLU/leaky-ReLU activations in the encoder, such as VaDE and MFC-VAE. Our main result is an identifiability hierarchy that significantly generalizes previous work and exposes how different assumptions lead to different "strengths" of identifiability, and includes certain "vanilla" VAEs with isotropic Gaussian priors as a special case. For example, our weakest result establishes (unsupervised) identifiability up to an affine transformation, and thus partially resolves an open problem regarding model identifiability raised in prior work. These theoretical results are augmented with experiments on both simulated and real data.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] A survey of multimodal deep generative models
    Suzuki, Masahiro
    Matsuo, Yutaka
    Advanced Robotics, 2022, 36 (5-6): : 261 - 278
  • [22] On Memorization in Probabilistic Deep Generative Models
    van den Burg, Gerrit J. J.
    Williams, Christopher K. I.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [23] The Riemannian Geometry of Deep Generative Models
    Shao, Hang
    Kumar, Abhishek
    Fletcher, P. Thomas
    PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 428 - 436
  • [24] A Priori Independence for Deep Generative Models
    Rastgaufard, Rastin
    Alsamman, AbdulRahman
    2018 21ST INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2018, : 445 - 451
  • [25] Interpretable Deep Generative Recommendation Models
    Liu, Huafeng
    Jing, Liping
    Wen, Jingxuan
    Xu, Pengyu
    Wang, Jiaqi
    Yu, Jian
    Ng, Michael K.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
  • [26] Deep Generative Models for Spatial Networks
    Guo, Xiaojie
    Du, Yuanqi
    Zhao, Liang
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 505 - 515
  • [27] Face Inpainting with Deep Generative Models
    Qiang, Zhenping
    He, Libo
    Zhang, Qinghui
    Li, Junqiu
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2019, 12 (02) : 1232 - 1244
  • [28] On Deep Generative Models with Applications to Recognition
    Ranzato, Marc'Aurelio
    Susskind, Joshua
    Mnih, Volodymyr
    Hinton, Geoffrey
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011,
  • [29] Face Inpainting with Deep Generative Models
    Zhenping Qiang
    Libo He
    Qinghui Zhang
    Junqiu Li
    International Journal of Computational Intelligence Systems, 2019, 12 : 1232 - 1244
  • [30] Interpretable deep generative recommendation models
    Liu, Huafeng
    Jing, Liping
    Wen, Jingxuan
    Xu, Pengyu
    Wang, Jiaqi
    Yu, Jian
    Ng, Michael K.
    Journal of Machine Learning Research, 2021, 22