A LASSO-penalized BIC for mixture model selection

被引:20
|
作者
Bhattacharya, Sakyajit [1 ]
McNicholas, Paul D. [1 ]
机构
[1] Univ Guelph, Dept Math & Stat, Guelph, ON N1G 2W1, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
BIC; LASSO; Mixture models; Model-based clustering; Model selection; VARIABLE SELECTION; INFORMATION CRITERION; ORACLE PROPERTIES; EM ALGORITHM; LIKELIHOOD; SHRINKAGE; CHOICE;
D O I
10.1007/s11634-013-0155-1
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The efficacy of family-based approaches to mixture model-based clustering and classification depends on the selection of parsimonious models. Current wisdom suggests the Bayesian information criterion (BIC) for mixture model selection. However, the BIC has well-known limitations, including a tendency to overestimate the number of components as well as a proclivity for underestimating, often drastically, the number of components in higher dimensions. While the former problem might be soluble by merging components, the latter is impossible to mitigate in clustering and classification applications. In this paper, a LASSO-penalized BIC (LPBIC) is introduced to overcome this problem. This approach is illustrated based on applications of extensions of mixtures of factor analyzers, where the LPBIC is used to select both the number of components and the number of latent factors. The LPBIC is shown to match or outperform the BIC in several situations.
引用
收藏
页码:45 / 61
页数:17
相关论文
共 50 条
  • [31] BIC Extensions for Order-constrained Model Selection
    Mulder, J.
    Raftery, A. E.
    [J]. SOCIOLOGICAL METHODS & RESEARCH, 2022, 51 (02) : 471 - 498
  • [32] The ABC of model selection:: AIC, BIC and the new CIC
    Rodríguez, CC
    [J]. Bayesian Inference and Maximum Entropy Methods in Science and Engineering, 2005, 803 : 80 - 87
  • [33] Overview of LASSO-related penalized regression methods for quantitative trait mapping and genomic selection
    Li, Zitong
    Sillanpaa, Mikko J.
    [J]. THEORETICAL AND APPLIED GENETICS, 2012, 125 (03) : 419 - 435
  • [34] Overview of LASSO-related penalized regression methods for quantitative trait mapping and genomic selection
    Zitong Li
    Mikko J. Sillanpää
    [J]. Theoretical and Applied Genetics, 2012, 125 : 419 - 435
  • [35] Outlier detection and robust variable selection via the penalized weighted LAD-LASSO method
    Jiang, Yunlu
    Wang, Yan
    Zhang, Jiantao
    Xie, Baojian
    Liao, Jibiao
    Liao, Wenhui
    [J]. JOURNAL OF APPLIED STATISTICS, 2021, 48 (02) : 234 - 246
  • [36] Penalized logistic regression with the adaptive LASSO for gene selection in high-dimensional cancer classification
    Algamal, Zakariya Yahya
    Lee, Muhammad Hisyam
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (23) : 9326 - 9332
  • [37] Model Selection Via Penalized Logistic Regression
    Ayers, Kristin L.
    Cordell, Heather J.
    [J]. GENETIC EPIDEMIOLOGY, 2009, 33 (08) : 770 - 770
  • [38] A fuzzy penalized regression model with variable selection
    Kashani, M.
    Arashi, M.
    Rabiei, M. R.
    D'Urso, P.
    De Giovanni, L.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 175
  • [39] Doubly Penalized LASSO for Reconstruction of Biological Networks
    Asadi, Behrang
    Maurya, Mano Ram
    Tartakovsky, Daniel M.
    Subramaniam, Shankar
    [J]. PROCEEDINGS OF THE IEEE, 2017, 105 (02) : 319 - 329
  • [40] A note on the Lasso and related procedures in model selection
    Leng, Chenlei
    Lin, Yi
    Wahba, Grace
    [J]. STATISTICA SINICA, 2006, 16 (04) : 1273 - 1284