Robust clustering via mixtures of t factor analyzers with incomplete data

被引:6
|
作者
Wang, Wan-Lun [1 ]
Lin, Tsung-, I [2 ,3 ]
机构
[1] Feng Chia Univ, Grad Inst Stat & Actuarial Sci, Dept Stat, Taichung 40724, Taiwan
[2] Natl Chung Hsing Univ, Inst Stat, Taichung 402, Taiwan
[3] China Med Univ, Dept Publ Hlth, Taichung 404, Taiwan
关键词
Data reduction; Factor analyzer; Information matrix; Mixture models; Multivariate t distribution; Missing data; MAXIMUM-LIKELIHOOD-ESTIMATION; MULTIVARIATE NORMAL-DISTRIBUTION; ECM ALGORITHM; ML ESTIMATION; MODELS; INFERENCE;
D O I
10.1007/s11634-021-00453-8
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Mixtures of t factor analyzers (MtFA) are powerful and widely used tools for robust clustering of high-dimensional data in the presence of outliers. However, the occurrence of missing values may cause analytical intractability and computational complexity when fitting the MtFA model. We explicitly derive the score vector and Hessian matrix of the MtFA model with incomplete data to approximate the information matrix. In this regard, some asymptotic properties can be established under certain regularity conditions. Three expectation-maximization-based algorithms are developed for maximum likelihood estimation of the MtFA model with possibly missing values at random. Practical issues related to the recovery of missing values and clustering of partially observed samples are also investigated. The relevant utility of our methodology is exemplified through the analysis of simulated and real data sets.
引用
收藏
页码:659 / 690
页数:32
相关论文
共 50 条
  • [21] Mixtures of modified t-factor analyzers for model-based clustering, classification, and discriminant analysis
    Andrews, Jeffrey L.
    McNicholas, Paul D.
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2011, 141 (04) : 1479 - 1486
  • [22] Simultaneous Bayesian Clustering and Model Selection with Mixture of Robust Factor Analyzers
    Feng, Shan
    Xie, Wenxian
    Nie, Yufeng
    [J]. MATHEMATICS, 2024, 12 (07)
  • [23] Clustering and classification via cluster-weighted factor analyzers
    Sanjeena Subedi
    Antonio Punzo
    Salvatore Ingrassia
    Paul D. McNicholas
    [J]. Advances in Data Analysis and Classification, 2013, 7 : 5 - 40
  • [24] Clustering and classification via cluster-weighted factor analyzers
    Subedi, Sanjeena
    Punzo, Antonio
    Ingrassia, Salvatore
    McNicholas, Paul D.
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2013, 7 (01) : 5 - 40
  • [25] Robust fitting of mixtures of factor analyzers using the trimmed likelihood estimator
    Yang, Li
    Xiang, Sijia
    Yao, Weixin
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2017, 46 (02) : 1280 - 1291
  • [26] Logistic Normal Multinomial Factor Analyzers for Clustering Microbiome Data
    Tu, Wangshu
    Subedi, Sanjeena
    [J]. JOURNAL OF CLASSIFICATION, 2023, 40 (03) : 638 - 667
  • [27] Logistic Normal Multinomial Factor Analyzers for Clustering Microbiome Data
    Wangshu Tu
    Sanjeena Subedi
    [J]. Journal of Classification, 2023, 40 : 638 - 667
  • [28] Modelling high-dimensional data by mixtures of factor analyzers
    McLachlan, GJ
    Peel, D
    Bean, RW
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2003, 41 (3-4) : 379 - 388
  • [29] Mixtures of restricted skew-t factor analyzers with common factor loadings
    Wan-Lun Wang
    Luis M. Castro
    Yen-Ting Chang
    Tsung-I Lin
    [J]. Advances in Data Analysis and Classification, 2019, 13 : 445 - 480
  • [30] Mixtures of restricted skew-t factor analyzers with common factor loadings
    Wang, Wan-Lun
    Castro, Luis M.
    Chang, Yen-Ting
    Lin, Tsung-I
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2019, 13 (02) : 445 - 480