A Penalized Matrix Normal Mixture Model for Clustering Matrix Data

被引:0
|
作者
Heo, Jinwon [1 ]
Baek, Jangsun [1 ]
机构
[1] Chonnam Natl Univ, Dept Math & Stat, 77 Yongbong Ro, Gwangju 61186, South Korea
基金
新加坡国家研究基金会;
关键词
clustering; image analysis; matrix normal distribution; expectation maximization algorithm; penalized likelihood; SELECTION;
D O I
10.3390/e23101249
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Along with advances in technology, matrix data, such as medical/industrial images, have emerged in many practical fields. These data usually have high dimensions and are not easy to cluster due to their intrinsic correlated structure among rows and columns. Most approaches convert matrix data to multi dimensional vectors and apply conventional clustering methods to them, and thus, suffer from an extreme high-dimensionality problem as well as a lack of interpretability of the correlated structure among row/column variables. Recently, a regularized model was proposed for clustering matrix-valued data by imposing a sparsity structure for the mean signal of each cluster. We extend their approach by regularizing further on the covariance to cope better with the curse of dimensionality for large size images. A penalized matrix normal mixture model with lasso-type penalty terms in both mean and covariance matrices is proposed, and then an expectation maximization algorithm is developed to estimate the parameters. The proposed method has the competence of both parsimonious modeling and reflecting the proper conditional correlation structure. The estimators are consistent, and their limiting distributions are derived. We applied the proposed method to simulated data as well as real datasets and measured its clustering performance with the clustering accuracy (ACC) and the adjusted rand index (ARI). The experiment results show that the proposed method performed better with higher ACC and ARI than those of conventional methods.</p>
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Tumor Clustering based on Penalized Matrix Decomposition
    Zheng, Chun-Hou
    Wang, Juan
    Ng, To-Yee
    Shiu, Chi Keung
    [J]. 2010 4TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING (ICBBE 2010), 2010,
  • [2] Three-way data clustering based on the mean-mixture of matrix-variate normal distributions
    Naderi, Mehrdad
    Tamandi, Mostafa
    Mirfarah, Elham
    Wang, Wan-Lun
    Lin, Tsung-, I
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2024, 199
  • [3] On model-based clustering of skewed matrix data
    Melnykov, Volodymyr
    Zhu, Xuwen
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2018, 167 : 181 - 194
  • [4] Model Based Penalized Clustering for Multivariate Data
    Ghosh, Samiran
    Dey, Dipak K.
    [J]. ADVANCES IN MULTIVARIATE STATISTICAL METHODS, 2009, 4 : 53 - +
  • [5] DIRECT CLUSTERING OF A DATA MATRIX
    HARTIGAN, JA
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1972, 67 (337) : 123 - &
  • [6] A theoretical framework for Landsat data modeling based on the matrix variate mean-mixture of normal model
    Naderi, Mehrdad
    Bekker, Andriette
    Arashi, Mohammad
    Jamalizadeh, Ahad
    [J]. PLOS ONE, 2020, 15 (04):
  • [7] Mixture of Networks for Clustering Categorical Data: A Penalized Composite Likelihood Approach
    Baek, Jangsun
    Park, Jeong-Soo
    [J]. AMERICAN STATISTICIAN, 2023, 77 (03): : 259 - 273
  • [8] Clustering longitudinal ordinal data via finite mixture of matrix-variate distributions
    Amato, Francesco
    Jacques, Julien
    Prim-Allaz, Isabelle
    [J]. STATISTICS AND COMPUTING, 2024, 34 (02)
  • [9] Clustering longitudinal ordinal data via finite mixture of matrix-variate distributions
    Francesco Amato
    Julien Jacques
    Isabelle Prim-Allaz
    [J]. Statistics and Computing, 2024, 34
  • [10] THE TRANSFORMATION MATRIX OF COLOR MIXTURE DATA
    COHEN, J
    WALKER, RJ
    [J]. AMERICAN JOURNAL OF PSYCHOLOGY, 1946, 59 (02): : 299 - 305