Model-based clustering with non-elliptically contoured distributions

被引:75
|
作者
Karlis, Dimitris [1 ]
Santourian, Anais [1 ]
机构
[1] Athens Univ Econ & Business, Dept Stat, Athens 10434, Greece
关键词
Normal inverse Gaussian distribution; EM-algorithm; Scale normal mixtures; Heavy tailed distributions; SKEW-NORMAL-DISTRIBUTION; T-DISTRIBUTION; MIXTURE; ALGORITHM;
D O I
10.1007/s11222-008-9072-0
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The majority of the existing literature on model-based clustering deals with symmetric components. In some cases, especially when dealing with skewed subpopulations, the estimate of the number of groups can be misleading; if symmetric components are assumed we need more than one component to describe an asymmetric group. Existing mixture models, based on multivariate normal distributions and multivariate t distributions, try to fit symmetric distributions, i.e. they fit symmetric clusters. In the present paper, we propose the use of finite mixtures of the normal inverse Gaussian distribution (and its multivariate extensions). Such finite mixture models start from a density that allows for skewness and fat tails, generalize the existing models, are tractable and have desirable properties. We examine both the univariate case, to gain insight, and the multivariate case, which is more useful in real applications. EM type algorithms are described for fitting the models. Real data examples are used to demonstrate the potential of the new model in comparison with existing ones.
引用
收藏
页码:73 / 83
页数:11
相关论文
共 50 条