Leveraging the Exact Likelihood of Deep Latent Variable Models

被引：0

作者：

Mattei, Pierre-Alexandre ^{[1
]}

Frellsen, Jes ^{[1
]}

机构：

[1] IT Univ Copenhagen, Dept Comp Sci, Copenhagen, Denmark

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018) | 2018年 / 31卷

关键词：

MAXIMUM-LIKELIHOOD; CONSISTENCY; INFERENCE; APPROXIMATE; NETWORKS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep latent variable models (DLVMs) combine the approximation abilities of deep neural networks and the statistical foundations of generative models. Variational methods are commonly used for inference; however, the exact likelihood of these models has been largely overlooked. The purpose of this work is to study the general properties of this quantity and to show how they can be leveraged in practice. We focus on important inferential problems that rely on the likelihood: estimation and missing data imputation. First, we investigate maximum likelihood estimation for DLVMs: in particular, we show that most unconstrained models used for continuous data have an unbounded likelihood function. This problematic behaviour is demonstrated to be a source of mode collapse. We also show how to ensure the existence of maximum likelihood estimates, and draw useful connections with nonparametric mixture models. Finally, we describe an algorithm for missing data imputation using the exact conditional likelihood of a DLVM. On several data sets, our algorithm consistently and significantly outperforms the usual imputation scheme used for DLVMs.

引用

页数：12

共 50 条

[1] Exact Inference for Integer Latent-Variable Models
Winner, Kevin
Sujono, Debora
Sheldon, Dan
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
[2] Particle methods for maximum likelihood estimation in latent variable models
Adam M. Johansen
Arnaud Doucet
Manuel Davy
[J]. Statistics and Computing, 2008, 18 : 47 - 57
[3] Particle methods for maximum likelihood estimation in latent variable models
Johansen, Adam M.
Doucet, Arnaud
Davy, Manuel
[J]. STATISTICS AND COMPUTING, 2008, 18 (01) : 47 - 57
[4] Deep latent variable models for generating knockoffs
Liu, Ying
Zheng, Cheng
[J]. STAT, 2019, 8 (01):
[5] Differentiable samplers for deep latent variable models
Doucet, Arnaud
Moulines, Eric
Thin, Achille
[J]. PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2023, 381 (2247):
[6] A Composite Likelihood Inference in Latent Variable Models for Ordinal Longitudinal Responses
Vassilis G. S. Vasdekis
Silvia Cagnone
Irini Moustaki
[J]. Psychometrika, 2012, 77 : 425 - 441
[7] Hierarchical clustering with discrete latent variable models and the integrated classification likelihood
Come, Etienne
Jouvin, Nicolas
Latouche, Pierre
Bouveyron, Charles
[J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2021, 15 (04) : 957 - 986
[8] Hierarchical clustering with discrete latent variable models and the integrated classification likelihood
Etienne Côme
Nicolas Jouvin
Pierre Latouche
Charles Bouveyron
[J]. Advances in Data Analysis and Classification, 2021, 15 : 957 - 986
[9] A Composite Likelihood Inference in Latent Variable Models for Ordinal Longitudinal Responses
Vasdekis, Vassilis G. S.
Cagnone, Silvia
Moustaki, Irini
[J]. PSYCHOMETRIKA, 2012, 77 (03) : 425 - 441
[10] Asymptotic properties of adaptive maximum likelihood estimators in latent variable models
Bianconcini, Silvia
[J]. BERNOULLI, 2014, 20 (03) : 1507 - 1531

← 1 2 3 4 5 →