Model-based clustering, classification, and discriminant analysis via mixtures of multivariate t-distributions

被引:99
|
作者
Andrews, Jeffrey L. [1 ]
McNicholas, Paul D. [1 ]
机构
[1] Univ Guelph, Dept Math & Stat, Guelph, ON N1G 2W1, Canada
基金
加拿大创新基金会;
关键词
Classification; Clustering; Discriminant analysis; Eigen-decomposition; Mixture models; Model-based clustering; Multivariate t-distribution; MAXIMUM-LIKELIHOOD; VARIABLE SELECTION; FACTOR ANALYZERS; ALGORITHM;
D O I
10.1007/s11222-011-9272-x
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The last decade has seen an explosion of work on the use of mixture models for clustering. The use of the Gaussian mixture model has been common practice, with constraints sometimes imposed upon the component covariance matrices to give families of mixture models. Similar approaches have also been applied, albeit with less fecundity, to classification and discriminant analysis. In this paper, we begin with an introduction to model-based clustering and a succinct account of the state-of-the-art. We then put forth a novel family of mixture models wherein each component is modeled using a multivariate t-distribution with an eigen-decomposed covariance structure. This family, which is largely a t-analogue of the well-known MCLUST family, is known as the tEIGEN family. The efficacy of this family for clustering, classification, and discriminant analysis is illustrated with both real and simulated data. The performance of this family is compared to its Gaussian counterpart on three real data sets.
引用
收藏
页码:1021 / 1029
页数:9
相关论文
共 50 条
  • [31] Clustering, classification, discriminant analysis, and dimension reduction via generalized hyperbolic mixtures
    Morris, Katherine
    McNicholas, Paul D.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2016, 97 : 133 - 150
  • [32] Probabilistic model-based discriminant analysis and clustering methods in chemometrics
    Bouveyron, Charles
    JOURNAL OF CHEMOMETRICS, 2013, 27 (12) : 433 - 446
  • [33] On robust probabilistic principal component analysis using multivariate t-distributions
    Guo, Yiping
    Bondell, Howard
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2023, 52 (23) : 8261 - 8279
  • [34] Model-based clustering and classification with non-normal mixture distributions
    Sharon X. Lee
    Geoffrey J. McLachlan
    Statistical Methods & Applications, 2013, 22 : 427 - 454
  • [35] Multiple hypothesis testing and clustering with mixtures of non-central t-distributions applied in microarray data analysis
    Marin, J. M.
    Rodriguez-Bernal, M. T.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2012, 56 (06) : 1898 - 1907
  • [36] Mixture model-based functional discriminant analysis for curve classification
    Chamroukhi, Faicel
    Glotin, Herve
    2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,
  • [37] Model-Based Clustering, Classification, and Discriminant Analysis Using the Generalized Hyperbolic Distribution: MixGHD R package
    Tortora, Cristina
    Browne, Ryan P.
    ElSherbiny, Aisha
    Franczak, Brian C.
    McNicholas, Paul D.
    JOURNAL OF STATISTICAL SOFTWARE, 2021, 98 (03): : 1 - 24
  • [38] Variable selection in model-based clustering and discriminant analysis with a regularization approach
    Gilles Celeux
    Cathy Maugis-Rabusseau
    Mohammed Sedki
    Advances in Data Analysis and Classification, 2019, 13 : 259 - 278
  • [39] Variable selection in model-based clustering and discriminant analysis with a regularization approach
    Celeux, Gilles
    Maugis-Rabusseau, Cathy
    Sedki, Mohammed
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2019, 13 (01) : 259 - 278
  • [40] Model-based clustering of censored data via mixtures of factor analyzers
    Wang, Wan-Lun
    Castro, Luis M.
    Lachos, Victor H.
    Lin, Tsung-I
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2019, 140 (104-121) : 104 - 121