Improved initial cluster center selection in K-means clustering

被引:15
|
作者
Zhu, Minchen [1 ]
Wang, Weizhi [2 ]
Huang, Jingshan [3 ]
机构
[1] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou 350002, Peoples R China
[2] Fuzhou Univ, Coll Civil Engn, Fuzhou 350002, Peoples R China
[3] Univ S Alabama, Sch Comp, Mobile, AL 36688 USA
关键词
Initial cluster centre; Inner-class distance; Inter-class distance; K-means clustering; ALGORITHM;
D O I
10.1108/EC-11-2012-0288
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Purpose - It is well known that the selection of initial cluster centers can significantly affect K-means clustering results. The purpose of this paper is to propose an improved, efficient methodology to handle such a challenge. Design/methodology/approach - According to the fact that the inner-class distance among samples within the same cluster is supposed to be smaller than the inter-class distance among clusters, the algorithm will dynamically adjust initial cluster centers that are randomly selected. Consequently, such adjusted initial cluster centers will be highly representative in the sense that they are distributed among as many samples as possible. As a result, local optima that are common in K-means clustering can then be effectively reduced. In addition, the algorithm is able to obtain all initial cluster centers simultaneously (instead of one center at a time) during the dynamic adjustment. Findings - Experimental results demonstrate that the proposed algorithm greatly improves the accuracy of traditional K-means clustering results and, in a more efficient manner. Originality/value - The authors presented in this paper an efficient algorithm, which is able to dynamically adjust initial cluster centers that are randomly selected. The adjusted centers are highly representative, i. e. they are distributed among as many samples as possible. As a result, local optima that are common in K-means clustering can be effectively reduced so that the authors can achieve an improved clustering accuracy. In addition, the algorithm is a cost-efficient one and the enhanced clustering accuracy can be obtained in a more efficient manner compared with traditional K-means algorithm.
引用
收藏
页码:1661 / 1667
页数:7
相关论文
共 50 条
  • [21] A fast k-means clustering algorithm using cluster center displacement
    Lai, Jim Z. C.
    Huang, Tsung-Jen
    Liaw, Yi-Ching
    [J]. PATTERN RECOGNITION, 2009, 42 (11) : 2551 - 2556
  • [22] A K-means Clustering with Optimized Initial Center Based on Hadoop Platform
    Lin, Kunhui
    Li, Xiang
    Zhang, Zhongnan
    Chen, Jiahong
    [J]. 2014 PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2014), 2014, : 263 - 266
  • [23] Initial Centroid Selection Method for an Enhanced K-means Clustering Algorithm
    Aamer, Youssef
    Benkaouz, Yahya
    Ouzzif, Mohammed
    Bouragba, Khalid
    [J]. UBIQUITOUS NETWORKING, UNET 2019, 2020, 12293 : 182 - 190
  • [24] Improved K-means Clustering Algorithm Based on the Optimized Initial Centriods
    Wang, Shunye
    [J]. 2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 450 - 453
  • [25] An Improved K-means Clustering Algorithm Based on Meliorated Initial Centre
    Li, Xiang
    Wei, Zhenwei
    Li, Lingling
    [J]. PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INDUSTRIAL ENGINEERING (AIIE 2016), 2016, 133 : 73 - 76
  • [26] An Improved K-means Clustering Algorithm
    Wang Yintong
    Li Wanlong
    Gao Rujia
    [J]. 2012 WORLD AUTOMATION CONGRESS (WAC), 2012,
  • [27] Improved K-means clustering algorithm
    Zhang, Zhe
    Zhang, Junxi
    Xue, Huifeng
    [J]. CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 169 - 172
  • [28] Improved Algorithm for the k-means Clustering
    Zhang, Sheng
    Wang, Shouqiang
    [J]. PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012), 2012, : 4717 - 4720
  • [29] An Improved Method for K-Means Clustering
    Cui, Xiaowei
    Wang, Fuxiang
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 756 - 759
  • [30] An improved K-means clustering algorithm
    Huang, Xiuchang
    Su, Wei
    [J]. Journal of Networks, 2014, 9 (01) : 161 - 167