PCA-guided search for K-means

被引:41
|
作者
Xu, Qin [1 ]
Ding, Chris [2 ]
Liu, Jinpei [3 ]
Luo, Bin [1 ]
机构
[1] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Anhui, Peoples R China
[2] Univ Texas Arlington, Dept Comp Sci & Engn, Arlington, TX 76019 USA
[3] Anhui Univ, Sch Business, Hefei 730601, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
K-means; Principal component analysis; Cluster centroid initialization; Clustering; ALGORITHM;
D O I
10.1016/j.patrec.2014.11.017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
K-means is undoubtedly the most widely used partitional clustering algorithm. Unfortunately, due to the non-convexity of the model formulations, expectation-maximization (EM) type algorithms converge to different local optima with different initializations. Recent discoveries have identified that the global solution of K-means cluster centroids lies in the principal component analysis (PCA) subspace. Based on this insight, we propose PCA-guided effective search for K-means. Because the PCA subspace is much smaller than the original space, searching in the PCA subspace is both more effective and efficient. Extensive experiments on four real world data sets and systematic comparison with previous algorithms demonstrate that our proposed method outperforms the rest as it makes the K-means more effective. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:50 / 55
页数:6
相关论文
共 50 条
  • [41] Local Search Yields a PTAS for k-Means in Doubling Metrics
    Friggstad, Zachary
    Rezapour, Mohsen
    Salavatipour, Mohammad R.
    2016 IEEE 57TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS), 2016, : 365 - 374
  • [42] Local search approximation algorithms for the k-means problem with penalties
    Zhang, Dongmei
    Hao, Chunlin
    Wu, Chenchen
    Xu, Dachuan
    Zhang, Zhenning
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 2019, 37 (02) : 439 - 453
  • [43] Joint K-Means Quantization for Approximate Nearest Neighbor Search
    Ozan, Ezgi Can
    Kiranyaz, Serkan
    Gabbouj, Moncef
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3645 - 3649
  • [44] LOCAL SEARCH YIELDS A PTAS FOR k-MEANS IN DOUBLING METRICS
    Friggstad, Zachary
    Rezapour, Mohsen
    Salavatipour, Mohammad R.
    SIAM JOURNAL ON COMPUTING, 2019, 48 (02) : 452 - 480
  • [45] Empirical Evaluation of K-Means, Bisecting K-Means, Fuzzy C-Means and Genetic K-Means Clustering Algorithms
    Banerjee, Shreya
    Choudhary, Ankit
    Pal, Somnath
    2015 IEEE INTERNATIONAL WIE CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (WIECON-ECE), 2015, : 172 - 176
  • [46] Identification of piecewise affine systems based on fuzzy PCA-guided robust clustering technique
    Esmaeel Khanmirza
    Milad Nazarahari
    Alireza Mousavi
    EURASIP Journal on Advances in Signal Processing, 2016
  • [47] Identification of piecewise affine systems based on fuzzy PCA-guided robust clustering technique
    Khanmirza, Esmaeel
    Nazarahari, Milad
    Mousavi, Alireza
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2016,
  • [48] Deep k-Means: Jointly clustering with k-Means and learning representations
    Fard, Maziar Moradi
    Thonet, Thibaut
    Gaussier, Eric
    PATTERN RECOGNITION LETTERS, 2020, 138 : 185 - 192
  • [49] Clustering of Image Data Using K-Means and Fuzzy K-Means
    Rahmani, Md. Khalid Imam
    Pal, Naina
    Arora, Kamiya
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2014, 5 (07) : 160 - 163
  • [50] K and starting means for k-means algorithm
    Fahim, Ahmed
    JOURNAL OF COMPUTATIONAL SCIENCE, 2021, 55