Model-based clustering using a new multivariate skew distribution

被引:2
|
作者
Tomarchio, Salvatore D. [1 ]
Bagnato, Luca [2 ]
Punzo, Antonio [1 ]
机构
[1] Univ Catania, Dept Econ & Business, Catania, Italy
[2] Univ Cattolica Sacro Cuore, Dept Econ & Social Sci, Piacenza, Italy
关键词
Mixture models; Skewed data; Model-based clustering; Cryptocurrencies; MAXIMUM-LIKELIHOOD; BAYESIAN-INFERENCE; MIXTURE;
D O I
10.1007/s11634-023-00552-8
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Quite often real data exhibit non-normal features, such as asymmetry and heavy tails, and present a latent group structure. In this paper, we first propose the multivariate skew shifted exponential normal distribution that can account for these non-normal characteristics. Then, we use this distribution in a finite mixture modeling framework. An EM algorithm is illustrated for maximum-likelihood parameter estimation. We provide a simulation study that compares the fitting performance of our model with those of several alternative models. The comparison is also conducted on a real dataset concerning the log returns of four cryptocurrencies.
引用
收藏
页码:61 / 83
页数:23
相关论文
共 50 条
  • [1] Model-based clustering using a new multivariate skew distribution
    Salvatore D. Tomarchio
    Luca Bagnato
    Antonio Punzo
    Advances in Data Analysis and Classification, 2024, 18 : 61 - 83
  • [2] Model-based clustering of multivariate skew data with circular components and missing values
    Lagona, Francesco
    Picone, Marco
    JOURNAL OF APPLIED STATISTICS, 2012, 39 (05) : 927 - 945
  • [3] The orthogonal skew model: computationally efficient multivariate skew-normal and skew-t distributions with applications to model-based clustering
    Browne, Ryan P.
    Andrews, Jeffrey L.
    TEST, 2024, 33 (03) : 752 - 785
  • [4] An overview of skew distributions in model-based clustering
    Lee, Sharon X.
    McLachlan, Geoffrey J.
    JOURNAL OF MULTIVARIATE ANALYSIS, 2022, 188
  • [5] Model-based clustering for multivariate functional data
    Jacques, Julien
    Preda, Cristian
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 71 : 92 - 106
  • [6] The multivariate leptokurtic-normal distribution and its application in model-based clustering
    Bagnato, Luca
    Punzo, Antonio
    Zoia, Maria G.
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2017, 45 (01): : 95 - 119
  • [7] Parsimonious skew mixture models for model-based clustering and classification
    Vrbik, Irene
    McNicholas, Paul D.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 71 : 196 - 210
  • [8] MMPClust: A skew prevention algorithm for model-based document clustering
    Li, XG
    Yu, G
    Wang, DL
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2005, 3453 : 536 - 547
  • [9] teigen: An R Package for Model-Based Clustering and Classification via the Multivariate t Distribution
    Andrews, Jeffrey L.
    Wickins, Jaymeson R.
    Boers, Nicholas M.
    McNicholas, Paul D.
    JOURNAL OF STATISTICAL SOFTWARE, 2018, 83 (07): : 1 - 32
  • [10] A Model-Based Multivariate Time Series Clustering Algorithm
    Zhou, Pei-Yuan
    Chan, Keith C. C.
    TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, 2014, 8643 : 805 - 817