Identifiability of deep generative models without auxiliary information

被引：0

作者：

Kivva, Bohdan ^{[1
]}

Rajendran, Goutham ^{[1
]}

Ravikumar, Pradeep ^{[2
]}

Aragam, Bryon ^{[1
]}

机构：

[1] Univ Chicago, Chicago, IL 60637 USA

[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022 | 2022年

关键词：

INDEPENDENT COMPONENT ANALYSIS; INFERENCE; MIXTURES;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We prove identifiability of a broad class of deep latent variable models that (a) have universal approximation capabilities and (b) are the decoders of variational autoencoders that are commonly used in practice. Unlike existing work, our analysis does not require weak supervision, auxiliary information, or conditioning in the latent space. Specifically, we show that for a broad class of generative (i.e. unsupervised) models with universal approximation capabilities, the side information u is not necessary: We prove identifiability of the entire generative model where we do not observe u and only observe the data x. The models we consider match autoencoder architectures used in practice that leverage mixture priors in the latent space and ReLU/leaky-ReLU activations in the encoder, such as VaDE and MFC-VAE. Our main result is an identifiability hierarchy that significantly generalizes previous work and exposes how different assumptions lead to different "strengths" of identifiability, and includes certain "vanilla" VAEs with isotropic Gaussian priors as a special case. For example, our weakest result establishes (unsupervised) identifiability up to an affine transformation, and thus partially resolves an open problem regarding model identifiability raised in prior work. These theoretical results are augmented with experiments on both simulated and real data.

引用

页数：15

共 50 条

[21] A survey of multimodal deep generative models
Suzuki, Masahiro
Matsuo, Yutaka
Advanced Robotics, 2022, 36 (5-6): : 261 - 278
[22] On Memorization in Probabilistic Deep Generative Models
van den Burg, Gerrit J. J.
Williams, Christopher K. I.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[23] The Riemannian Geometry of Deep Generative Models
Shao, Hang
Kumar, Abhishek
Fletcher, P. Thomas
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 428 - 436
[24] A Priori Independence for Deep Generative Models
Rastgaufard, Rastin
Alsamman, AbdulRahman
2018 21ST INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2018, : 445 - 451
[25] Interpretable Deep Generative Recommendation Models
Liu, Huafeng
Jing, Liping
Wen, Jingxuan
Xu, Pengyu
Wang, Jiaqi
Yu, Jian
Ng, Michael K.
JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
[26] Deep Generative Models for Spatial Networks
Guo, Xiaojie
Du, Yuanqi
Zhao, Liang
KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 505 - 515
[27] Face Inpainting with Deep Generative Models
Qiang, Zhenping
He, Libo
Zhang, Qinghui
Li, Junqiu
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2019, 12 (02) : 1232 - 1244
[28] On Deep Generative Models with Applications to Recognition
Ranzato, Marc'Aurelio
Susskind, Joshua
Mnih, Volodymyr
Hinton, Geoffrey
2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011,
[29] Face Inpainting with Deep Generative Models
Zhenping Qiang
Libo He
Qinghui Zhang
Junqiu Li
International Journal of Computational Intelligence Systems, 2019, 12 : 1232 - 1244
[30] Interpretable deep generative recommendation models
Liu, Huafeng
Jing, Liping
Wen, Jingxuan
Xu, Pengyu
Wang, Jiaqi
Yu, Jian
Ng, Michael K.
Journal of Machine Learning Research, 2021, 22

← 1 2 3 4 5 →