A LASSO-penalized BIC for mixture model selection

被引:20
|
作者
Bhattacharya, Sakyajit [1 ]
McNicholas, Paul D. [1 ]
机构
[1] Univ Guelph, Dept Math & Stat, Guelph, ON N1G 2W1, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
BIC; LASSO; Mixture models; Model-based clustering; Model selection; VARIABLE SELECTION; INFORMATION CRITERION; ORACLE PROPERTIES; EM ALGORITHM; LIKELIHOOD; SHRINKAGE; CHOICE;
D O I
10.1007/s11634-013-0155-1
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The efficacy of family-based approaches to mixture model-based clustering and classification depends on the selection of parsimonious models. Current wisdom suggests the Bayesian information criterion (BIC) for mixture model selection. However, the BIC has well-known limitations, including a tendency to overestimate the number of components as well as a proclivity for underestimating, often drastically, the number of components in higher dimensions. While the former problem might be soluble by merging components, the latter is impossible to mitigate in clustering and classification applications. In this paper, a LASSO-penalized BIC (LPBIC) is introduced to overcome this problem. This approach is illustrated based on applications of extensions of mixtures of factor analyzers, where the LPBIC is used to select both the number of components and the number of latent factors. The LPBIC is shown to match or outperform the BIC in several situations.
引用
收藏
页码:45 / 61
页数:17
相关论文
共 50 条
  • [21] On the Proper Forms of BIC for Model Order Selection
    Stoica, Petre
    Babu, Prabhu
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2012, 60 (09) : 4956 - 4961
  • [22] Cluster number selection using finite mixture model and penalized Fisher class separability measure
    Wang, Xudong
    Syrrnos, Vassilis L.
    [J]. 2007 AMERICAN CONTROL CONFERENCE, VOLS 1-13, 2007, : 4160 - +
  • [23] On improvability of model averaging by penalized model selection
    Cao, Kun
    Li, Xinmin
    Zhou, Yali
    Zou, Chenchen
    [J]. STAT, 2023, 12 (01):
  • [24] Penalized factor mixture analysis for variable selection in clustered data
    Galimberti, Giuliano
    Montanari, Angela
    Viroli, Cinzia
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2009, 53 (12) : 4301 - 4310
  • [25] Penalized regressions: The bridge versus the lasso
    Fu, WJJ
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 1998, 7 (03) : 397 - 416
  • [26] Variable Selection of Lasso and Large Model
    Xia, Huiyi
    [J]. IEEE ACCESS, 2023, 11 : 96514 - 96521
  • [27] Improving Lasso for model selection and prediction
    Pokarowski, Piotr
    Rejchel, Wojciech
    Soltys, Agnieszka
    Frej, Michal
    Mielniczuk, Jan
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2022, 49 (02) : 831 - 863
  • [28] BIC Extensions for Order-constrained Model Selection
    Mulder, J.
    Raftery, A. E.
    [J]. SOCIOLOGICAL METHODS & RESEARCH, 2022, 51 (02) : 471 - 498
  • [29] Multimodel inference - understanding AIC and BIC in model selection
    Burnham, KP
    Anderson, DR
    [J]. SOCIOLOGICAL METHODS & RESEARCH, 2004, 33 (02) : 261 - 304
  • [30] A Batch Rival Penalized Expectation-Maximization Algorithm for Gaussian Mixture Clustering with Automatic Model Selection
    Wen, Jiechang
    Zhang, Dan
    Cheung, Yiu-ming
    Liu, Hailin
    You, Xinge
    [J]. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2012, 2012