Model-based clustering, classification, and discriminant analysis via mixtures of multivariate t-distributions

被引：99

作者：

Andrews, Jeffrey L. ^{[1
]}

McNicholas, Paul D. ^{[1
]}

机构：

[1] Univ Guelph, Dept Math & Stat, Guelph, ON N1G 2W1, Canada

来源：

STATISTICS AND COMPUTING | 2012年 / 22卷 / 05期

基金：

加拿大创新基金会;

关键词：

Classification; Clustering; Discriminant analysis; Eigen-decomposition; Mixture models; Model-based clustering; Multivariate t-distribution; MAXIMUM-LIKELIHOOD; VARIABLE SELECTION; FACTOR ANALYZERS; ALGORITHM;

D O I：

10.1007/s11222-011-9272-x

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The last decade has seen an explosion of work on the use of mixture models for clustering. The use of the Gaussian mixture model has been common practice, with constraints sometimes imposed upon the component covariance matrices to give families of mixture models. Similar approaches have also been applied, albeit with less fecundity, to classification and discriminant analysis. In this paper, we begin with an introduction to model-based clustering and a succinct account of the state-of-the-art. We then put forth a novel family of mixture models wherein each component is modeled using a multivariate t-distribution with an eigen-decomposed covariance structure. This family, which is largely a t-analogue of the well-known MCLUST family, is known as the tEIGEN family. The efficacy of this family for clustering, classification, and discriminant analysis is illustrated with both real and simulated data. The performance of this family is compared to its Gaussian counterpart on three real data sets.

引用

页码：1021 / 1029

页数：9

共 50 条

[1] Model-based classification via mixtures of multivariate t-distributions
Andrews, Jeffrey L.
McNicholas, Paul D.
Subedi, Sanjeena
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2011, 55 (01) : 520 - 529
[2] Dimension reduction for model-based clustering via mixtures of multivariate t-distributions
Morris, Katherine
McNicholas, Paul D.
Scrucca, Luca
ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2013, 7 (03) : 321 - 338
[3] Model-based clustering, classification, and discriminant analysis via mixtures of multivariate t-distributionsThe tEIGEN family
Jeffrey L. Andrews
Paul D. McNicholas
Statistics and Computing, 2012, 22 : 1021 - 1029
[4] SEQUENTIAL DIRICHLET PROCESS MIXTURES OF MULTIVARIATE SKEW t-DISTRIBUTIONS FOR MODEL-BASED CLUSTERING OF FLOW CYTOMETRY DATA
Hejblum, Boris P.
Alkhassim, Chariff
Gottardo, Raphael
Caron, Frakois
Thiebaut, Rodolphe
ANNALS OF APPLIED STATISTICS, 2019, 13 (01): : 638 - 660
[5] Mixtures of modified t-factor analyzers for model-based clustering, classification, and discriminant analysis
Andrews, Jeffrey L.
McNicholas, Paul D.
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2011, 141 (04) : 1479 - 1486
[6] Robust clustering by deterministic agglomeration EM of mixtures of multivariate t-distributions
Shoham, S
PATTERN RECOGNITION, 2002, 35 (05) : 1127 - 1142
[7] On Model-Based Clustering, Classification, and Discriminant Analysis
McNicholas, Paul D.
JIRSS-JOURNAL OF THE IRANIAN STATISTICAL SOCIETY, 2011, 10 (02): : 181 - 199
[8] Model-based clustering of functional data via mixtures of t distributions
Anton, Cristina
Smith, Iain
ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2024, 18 (03) : 563 - 595
[9] Model-Based Clustering and Classification Using Mixtures of Multivariate Skewed Power Exponential Distributions
Dang, Utkarsh J.
Gallaugher, Michael P. B.
Browne, Ryan P.
McNicholas, Paul D.
JOURNAL OF CLASSIFICATION, 2023, 40 (01) : 145 - 167
[10] Model-Based Clustering and Classification Using Mixtures of Multivariate Skewed Power Exponential Distributions
Utkarsh J. Dang
Michael P.B. Gallaugher
Ryan P. Browne
Paul D. McNicholas
Journal of Classification, 2023, 40 : 145 - 167

← 1 2 3 4 5 →