Comparison of Criteria for Choosing the Number of Classes in Bayesian Finite Mixture Models

被引:73
|
作者
Nasserinejad, Kazem [1 ,2 ]
van Rosmalen, Joost [1 ]
de Kort, Wim [3 ,4 ]
Lesaffre, Emmanuel [5 ]
机构
[1] Erasmus MC, Dept Biostat, Rotterdam, Netherlands
[2] Erasmus MC, Dept Hematol, Clin Trial Ctr, Inst Canc, Rotterdam, Netherlands
[3] Sanquin Res, Dept Donor Studies, Amsterdam, Netherlands
[4] Acad Med Ctr, Dept Publ Hlth, Amsterdam, Netherlands
[5] Katholieke Univ Leuven, Biostat L, Leuven, Belgium
来源
PLOS ONE | 2017年 / 12卷 / 01期
关键词
UNKNOWN NUMBER; BLOOD-DONORS; WHOLE-BLOOD; HEMOGLOBIN LEVELS; IRON-DEFICIENCY; DISTRIBUTIONS;
D O I
10.1371/journal.pone.0168838
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Identifying the number of classes in Bayesian finite mixture models is a challenging problem. Several criteria have been proposed, such as adaptations of the deviance information criterion, marginal likelihoods, Bayes factors, and reversible jump MCMC techniques. It was recently shown that in overfitted mixture models, the overfitted latent classes will asymptotically become empty under specific conditions for the prior of the class proportions. This result may be used to construct a criterion for finding the true number of latent classes, based on the removal of latent classes that have negligible proportions. Unlike some alternative criteria, this criterion can easily be implemented in complex statistical models such as latent class mixed-effects models and multivariate mixture models using standard Bayesian software. We performed an extensive simulation study to develop practical guidelines to determine the appropriate number of latent classes based on the posterior distribution of the class proportions, and to compare this criterion with alternative criteria. The performance of the proposed criterion is illustrated using a data set of repeatedly measured hemoglobin values of blood donors.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] A comparison of segment retention criteria for finite mixture logit models
    Andrews, RL
    Currim, IS
    [J]. JOURNAL OF MARKETING RESEARCH, 2003, 40 (02) : 235 - 243
  • [2] Choosing the Number of Topics in LDA Models - A Monte Carlo Comparison of Selection Criteria
    Bystrov, Victor
    Naboka-Krell, Viktoriia
    Staszewska-Bystrova, Anna
    Winker, Peter
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25
  • [3] Bayesian mixture models (in)consistency for the number of clusters
    Alamichel, Louise
    Bystrova, Daria
    Arbel, Julyan
    King, Guillaume Kon Kam
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2024,
  • [4] Approximate Bayesian computation for finite mixture models
    Simola, Umberto
    Cisewski-Kehe, Jessi
    Wolpert, Robert L.
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2021, 91 (06) : 1155 - 1174
  • [5] Identifying the correct number of classes in growth mixture models
    Tofighi, Davood
    Enders, Craig K.
    [J]. ADVANCES IN LATENT VARIABLE MIXTURE MODELS, 2008, : 317 - +
  • [6] Sensitivity analysis and choosing between alternative polytomous IRT models using Bayesian model comparison criteria
    da Silva, Marcelo A.
    Bazan, Jorge L.
    Huggins-Manley, Anne Corinne
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2019, 48 (02) : 601 - 620
  • [7] Overfitting Bayesian Mixture Models with an Unknown Number of Components
    van Havre, Zoe
    White, Nicole
    Rousseau, Judith
    Mengersen, Kerrie
    [J]. PLOS ONE, 2015, 10 (07):
  • [8] Fitting Unstructured Finite Mixture Models in Longitudinal Design: A Recommendation for Model Selection and Estimation of the Number of Classes
    Todo, Naoya
    Usami, Satoshi
    [J]. STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2016, 23 (05) : 695 - 712
  • [9] A new Bayesian approach for determining the number of components in a finite mixture
    Aitkin M.
    Vu D.
    Francis B.
    [J]. METRON, 2015, 73 (2) : 155 - 176
  • [10] THE NUMBER OF COMPUTABLE INDEXATIONS OF FINITE CLASSES OF CONSTRUCTIVE MODELS
    DOBRITSA, VP
    [J]. MATHEMATICAL NOTES, 1986, 40 (1-2) : 551 - 553