A Trimmed Clustering-Based l1-Principal Component Analysis Model for Image Classification and Clustering Problems with Outliers

被引:1
|
作者
Lam, Benson S. Y. [1 ]
Choy, S. K. [1 ]
机构
[1] Hang Seng Univ Hong Kong, Dept Math & Stat, Hong Kong, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 08期
关键词
principal component analysis; dimensionality reduction; image processing; pattern recognition; clustering; ROBUST PCA; REPRESENTATION; MAXIMIZATION;
D O I
10.3390/app9081562
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Different versions of principal component analysis (PCA) have been widely used to extract important information for image recognition and image clustering problems. However, owing to the presence of outliers, this remains challenging. This paper proposes a new PCA methodology based on a novel discovery that the widely used l1-PCA is equivalent to a two-groups k-means clustering model. The projection vector of the l1-PCA is the vector difference between the two cluster centers estimated by the clustering model. In theory, this vector difference provides inter-cluster information, which is beneficial for distinguishing data objects from different classes. However, the performance of l1-PCA is not comparable with the state-of-the-art methods. This is because the l1-PCA can be sensitive to outliers, as the equivalent clustering model is not robust to outliers. To overcome this limitation, we introduce a trimming function to the clustering model and propose a trimmed-clustering based l1-PCA (TC-PCA). With this trimming set formulation, the TC-PCA is not sensitive to outliers. Besides, we mathematically prove the convergence of the proposed algorithm. Experimental results on image classification and clustering indicate that our proposed method outperforms the current state-of-the-art methods.
引用
收藏
页数:25
相关论文
共 50 条
  • [31] Fault detection of flywheel system based on clustering and principal component analysis
    Wang Rixin
    Gong Xuebing
    Xu Minqiang
    Li Yuqing
    Chinese Journal of Aeronautics, 2015, (06) : 1676 - 1688
  • [32] Fault detection of flywheel system based on clustering and principal component analysis
    Wang Rixin
    Gong Xuebing
    Xu Minqiang
    Li Yuqing
    Chinese Journal of Aeronautics, 2015, 28 (06) : 1676 - 1688
  • [33] Fault detection of flywheel system based on clustering and principal component analysis
    Wang Rixin
    Gong Xuebing
    Xu Minqiang
    Li Yuqing
    CHINESE JOURNAL OF AERONAUTICS, 2015, 28 (06) : 1676 - 1688
  • [34] Clustering-Based Extraction of Near Border Data Samples for Remote Sensing Image Classification
    Xiaoyong Bian
    Tianxu Zhang
    Xiaolong Zhang
    LuXin Yan
    Bo Li
    Cognitive Computation, 2013, 5 : 19 - 31
  • [35] Multivariate time series clustering based on common principal component analysis
    Li, Hailin
    NEUROCOMPUTING, 2019, 349 : 239 - 247
  • [36] A Clustering Algorithm for Binary Protocol Data Frames Based on Principal Component Analysis and Density Peaks Clustering
    Yan, Xiaoyong
    Li, Qing
    Tao, Siyu
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT 2017), 2017, : 1260 - 1266
  • [37] Clustering-Based Extraction of Near Border Data Samples for Remote Sensing Image Classification
    Bian, Xiaoyong
    Zhang, Tianxu
    Zhang, Xiaolong
    Yan, Luxin
    Li, Bo
    COGNITIVE COMPUTATION, 2013, 5 (01) : 19 - 31
  • [38] Clustering-Based Multi-instance Learning Network for Whole Slide Image Classification
    Wu, Wei
    Zhu, Zhonghang
    Magnier, Baptiste
    Wang, Liansheng
    COMPUTATIONAL MATHEMATICS MODELING IN CANCER ANALYSIS, CMMCA 2022, 2022, 13574 : 100 - 109
  • [39] EUSC: A clustering-based surrogate model to accelerate evolutionary undersampling in imbalanced classification
    Hoang Lam Le
    Landa-Silva, Dario
    Galar, Mikel
    Garcia, Salvador
    Triguero, Isaac
    APPLIED SOFT COMPUTING, 2021, 101
  • [40] The Effect of Different Distance Measures in Detecting Outliers using Clustering-based Algorithm for Circular Regression Model
    Di, Nur Faraidah Muhammad
    Satari, Siti Zanariah
    3RD ISM INTERNATIONAL STATISTICAL CONFERENCE 2016 (ISM III): BRINGING PROFESSIONALISM AND PRESTIGE IN STATISTICS, 2017, 1842