Decomposed Normalized Maximum Likelihood Codelength Criterion for Selecting Hierarchical Latent Variable Models

被引:7
|
作者
Wu, Tianyi [1 ]
Sugawara, Shinya [1 ]
Yamanishi, Kenji [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo, Japan
关键词
Model selection; Hierarchical latent variable models; MDL;
D O I
10.1145/3097983.3098110
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a new model selection criterion based on the minimum description length principle in a name of the decomposed normalized maximum likelihood criterion. Our criterion can be applied to a large class of hierarchical latent variable models, such as the Naive Bayes models, stochastic block models and latent Dirichlet allocations, for which many conventional information criteria cannot be straightforwardly applied due to irregularity of latent variable models. Our method also has an advantage that it can be exactly evaluated without asymptotic approximation with small time complexity. Our experiments using synthetic and real data demonstrated validity of our method in terms of computational efficiency and model selection accuracy, while our criterion especially dominated the other criteria when sample size is small and when data are noisy.
引用
收藏
页码:1165 / 1174
页数:10
相关论文
共 50 条
  • [1] The decomposed normalized maximum likelihood code-length criterion for selecting hierarchical latent variable models
    Kenji Yamanishi
    Tianyi Wu
    Shinya Sugawara
    Makoto Okada
    [J]. Data Mining and Knowledge Discovery, 2019, 33 : 1017 - 1058
  • [2] The decomposed normalized maximum likelihood code-length criterion for selecting hierarchical latent variable models
    Yamanishi, Kenji
    Wu, Tianyi
    Sugawara, Shinya
    Okada, Makoto
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 33 (04) : 1017 - 1058
  • [3] Selecting food web models using normalized maximum likelihood
    Staniczenko, Phillip P. A.
    Smith, Matthew J.
    Allesina, Stefano
    [J]. METHODS IN ECOLOGY AND EVOLUTION, 2014, 5 (06): : 551 - 562
  • [4] Selecting amongst multinomial models: An apologia for normalized maximum likelihood
    Kellen, David
    Klauer, Karl Christoph
    [J]. JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2020, 97
  • [5] Particle methods for maximum likelihood estimation in latent variable models
    Adam M. Johansen
    Arnaud Doucet
    Manuel Davy
    [J]. Statistics and Computing, 2008, 18 : 47 - 57
  • [6] Particle methods for maximum likelihood estimation in latent variable models
    Johansen, Adam M.
    Doucet, Arnaud
    Davy, Manuel
    [J]. STATISTICS AND COMPUTING, 2008, 18 (01) : 47 - 57
  • [7] Normalized maximum likelihood models for genomics
    Tabus, Ioan
    Rissanen, Jorma
    Astola, Jaakko
    [J]. 2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 1433 - 1438
  • [8] Asymptotic properties of adaptive maximum likelihood estimators in latent variable models
    Bianconcini, Silvia
    [J]. BERNOULLI, 2014, 20 (03) : 1507 - 1531
  • [9] Hierarchical clustering with discrete latent variable models and the integrated classification likelihood
    Come, Etienne
    Jouvin, Nicolas
    Latouche, Pierre
    Bouveyron, Charles
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2021, 15 (04) : 957 - 986
  • [10] On the convergence of the Monte Carlo maximum likelihood method for latent variable models
    Cappé, O
    Douc, R
    Moulines, E
    Robert, C
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2002, 29 (04) : 615 - 635