Leveraging the Exact Likelihood of Deep Latent Variable Models

被引:0
|
作者
Mattei, Pierre-Alexandre [1 ]
Frellsen, Jes [1 ]
机构
[1] IT Univ Copenhagen, Dept Comp Sci, Copenhagen, Denmark
关键词
MAXIMUM-LIKELIHOOD; CONSISTENCY; INFERENCE; APPROXIMATE; NETWORKS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep latent variable models (DLVMs) combine the approximation abilities of deep neural networks and the statistical foundations of generative models. Variational methods are commonly used for inference; however, the exact likelihood of these models has been largely overlooked. The purpose of this work is to study the general properties of this quantity and to show how they can be leveraged in practice. We focus on important inferential problems that rely on the likelihood: estimation and missing data imputation. First, we investigate maximum likelihood estimation for DLVMs: in particular, we show that most unconstrained models used for continuous data have an unbounded likelihood function. This problematic behaviour is demonstrated to be a source of mode collapse. We also show how to ensure the existence of maximum likelihood estimates, and draw useful connections with nonparametric mixture models. Finally, we describe an algorithm for missing data imputation using the exact conditional likelihood of a DLVM. On several data sets, our algorithm consistently and significantly outperforms the usual imputation scheme used for DLVMs.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Exact Inference for Integer Latent-Variable Models
    Winner, Kevin
    Sujono, Debora
    Sheldon, Dan
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [2] Particle methods for maximum likelihood estimation in latent variable models
    Adam M. Johansen
    Arnaud Doucet
    Manuel Davy
    [J]. Statistics and Computing, 2008, 18 : 47 - 57
  • [3] Particle methods for maximum likelihood estimation in latent variable models
    Johansen, Adam M.
    Doucet, Arnaud
    Davy, Manuel
    [J]. STATISTICS AND COMPUTING, 2008, 18 (01) : 47 - 57
  • [4] Deep latent variable models for generating knockoffs
    Liu, Ying
    Zheng, Cheng
    [J]. STAT, 2019, 8 (01):
  • [5] Differentiable samplers for deep latent variable models
    Doucet, Arnaud
    Moulines, Eric
    Thin, Achille
    [J]. PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2023, 381 (2247):
  • [6] A Composite Likelihood Inference in Latent Variable Models for Ordinal Longitudinal Responses
    Vassilis G. S. Vasdekis
    Silvia Cagnone
    Irini Moustaki
    [J]. Psychometrika, 2012, 77 : 425 - 441
  • [7] Hierarchical clustering with discrete latent variable models and the integrated classification likelihood
    Come, Etienne
    Jouvin, Nicolas
    Latouche, Pierre
    Bouveyron, Charles
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2021, 15 (04) : 957 - 986
  • [8] Hierarchical clustering with discrete latent variable models and the integrated classification likelihood
    Etienne Côme
    Nicolas Jouvin
    Pierre Latouche
    Charles Bouveyron
    [J]. Advances in Data Analysis and Classification, 2021, 15 : 957 - 986
  • [9] A Composite Likelihood Inference in Latent Variable Models for Ordinal Longitudinal Responses
    Vasdekis, Vassilis G. S.
    Cagnone, Silvia
    Moustaki, Irini
    [J]. PSYCHOMETRIKA, 2012, 77 (03) : 425 - 441
  • [10] Asymptotic properties of adaptive maximum likelihood estimators in latent variable models
    Bianconcini, Silvia
    [J]. BERNOULLI, 2014, 20 (03) : 1507 - 1531