Choosing models in model-based clustering and discriminant analysis

被引:74
|
作者
Biernacki, C
Govaert, G
机构
[1] INRIA Rhone Alps, ZIRST, F-38330 St Martin, France
[2] Univ Technol Compiegne, CNRS, UMR 6599, F-60205 Compiegne, France
关键词
Gaussian mixture models; eigenvalue decomposition; cross-validation; information; Bayesian and classification criteria;
D O I
10.1080/00949659908811966
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Using an eigenvalue decomposition of variance matrices, Celeux and Govaert (1993) obtained numerous and powerful models for Gaussian model-based clustering and discriminant analysis. Through Monte Carlo simulations, we compare the performances of many classical criteria to select these models: information criteria as AIC, the Bayesian criterion BIG, classification criteria as NEC and cross-validation. In the clustering context, information criteria and BIC outperform the classification criteria. In the discriminant analysis context, cross-validation shows good performance but information criteria and BIC give satisfactory results as well with, by far, less time-computing.
引用
收藏
页码:49 / 71
页数:23
相关论文
共 50 条
  • [31] Model-based linear clustering
    Yan, Guohua
    Welch, William J.
    Zamar, Ruben H.
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2010, 38 (04): : 716 - 737
  • [32] Model-Based Edge Clustering
    Sewell, Daniel K.
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2021, 30 (02) : 390 - 405
  • [33] Model-Based Clustering with HDBSCAN
    Strobl, Michael
    Sander, Joerg
    Campello, Ricardo J. G. B.
    Zaiane, Osmar
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2020, PT II, 2021, 12458 : 364 - 379
  • [34] A model-based distance for clustering
    Rattray, M
    IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL IV, 2000, : 13 - 16
  • [35] Parametric model-based clustering
    Nikulin, V
    Smola, AJ
    DATA MINING, INTRUSION DETECTION, INFORMATION ASSURANCE, AND DATA NETWORKS SECURITY 2005, 2005, 5812 : 190 - 201
  • [36] Model-based subspace clustering
    Hoff, Peter D.
    BAYESIAN ANALYSIS, 2006, 1 (02): : 321 - 344
  • [37] A Bayesian approach to model-based clustering for binary panel probit models
    Assmann, Christian
    Boysen-Hogrefe, Jens
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2011, 55 (01) : 261 - 279
  • [38] Model-based clustering via linear cluster-weighted models
    Ingrassia, Salvatore
    Minotti, Simona C.
    Punzo, Antonio
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 71 : 159 - 182
  • [39] Discriminant features for model-based image databases
    Dong, A
    Bhanu, B
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 997 - 1000
  • [40] Weighted model-based clustering for remote sensing image analysis
    Joseph W. Richards
    Johanna Hardin
    Eric B. Grosfils
    Computational Geosciences, 2010, 14 : 125 - 136