PCA-guided search for K-means

被引:41
|
作者
Xu, Qin [1 ]
Ding, Chris [2 ]
Liu, Jinpei [3 ]
Luo, Bin [1 ]
机构
[1] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Anhui, Peoples R China
[2] Univ Texas Arlington, Dept Comp Sci & Engn, Arlington, TX 76019 USA
[3] Anhui Univ, Sch Business, Hefei 730601, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
K-means; Principal component analysis; Cluster centroid initialization; Clustering; ALGORITHM;
D O I
10.1016/j.patrec.2014.11.017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
K-means is undoubtedly the most widely used partitional clustering algorithm. Unfortunately, due to the non-convexity of the model formulations, expectation-maximization (EM) type algorithms converge to different local optima with different initializations. Recent discoveries have identified that the global solution of K-means cluster centroids lies in the principal component analysis (PCA) subspace. Based on this insight, we propose PCA-guided effective search for K-means. Because the PCA subspace is much smaller than the original space, searching in the PCA subspace is both more effective and efficient. Extensive experiments on four real world data sets and systematic comparison with previous algorithms demonstrate that our proposed method outperforms the rest as it makes the K-means more effective. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:50 / 55
页数:6
相关论文
共 50 条
  • [1] PCA-guided k-Means Clustering With Incomplete Data
    Honda, Katsuhiro
    Nonoguchi, Ryoichi
    Notsu, Akira
    Ichihashi, Hidetomo
    IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 1710 - 1714
  • [2] Fuzzy PCA-Guided Robust k-Means Clustering
    Honda, Katsuhiro
    Notsu, Akira
    Ichihashi, Hidetomo
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2010, 18 (01) : 67 - 79
  • [3] Cluster Validation in k-Means Clustering Based on PCA-guided k-Means and Procrustean Transformation of PC Scores
    Matsui, Tomohiro
    Honda, Katsuhiro
    Oh, Chi-Hyon
    Notsu, Akira
    Ichihashi, Hidetomo
    2009 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 2009, : 1546 - +
  • [4] Variable Weighting in PCA-Guided k-Means and its Connection with Information Summarization
    Honda, Katsuhiro
    Notsu, Akira
    Ichihashi, Hidetomo
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2011, 15 (01) : 83 - 89
  • [5] PCA-Guided k-Means with Variable Weighting and Its Application to Document Clustering
    Honda, Katsuhiro
    Notsu, Akira
    Ichihashi, Hidetomo
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, 5861 : 282 - 292
  • [6] PCA-Guided Routing Algorithm for Wireless Sensor Networks
    Chen, Gong
    Tan, Liansheng
    Gong, Yanlin
    Zhang, Wei
    JOURNAL OF COMPUTER NETWORKS AND COMMUNICATIONS, 2012, 2012
  • [7] Selective K-means Tree Search
    Tuan Anh Nguyen
    Matsui, Yusuke
    Yamasaki, Toshihiko
    Aizawa, Kiyoharu
    MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 875 - 878
  • [8] Search Space Reduction for Determination of Earthquake Source Parameters Using PCA and k-Means Clustering
    Lee, Seongjae
    Kim, Taehyoun
    JOURNAL OF SENSORS, 2020, 2020
  • [9] Network Traffic Prediction using PCA and K-means
    Holanda Filho, Raimir
    Bessa Maia, Jose Everardo
    PROCEEDINGS OF THE 2010 IEEE-IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, 2010, : 938 - 941
  • [10] A heuristic K-means clustering algorithm by kernel PCA
    Xu, MT
    Fränti, P
    ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 3503 - 3506