Improved initial cluster center selection in K-means clustering

被引:15
|
作者
Zhu, Minchen [1 ]
Wang, Weizhi [2 ]
Huang, Jingshan [3 ]
机构
[1] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou 350002, Peoples R China
[2] Fuzhou Univ, Coll Civil Engn, Fuzhou 350002, Peoples R China
[3] Univ S Alabama, Sch Comp, Mobile, AL 36688 USA
关键词
Initial cluster centre; Inner-class distance; Inter-class distance; K-means clustering; ALGORITHM;
D O I
10.1108/EC-11-2012-0288
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Purpose - It is well known that the selection of initial cluster centers can significantly affect K-means clustering results. The purpose of this paper is to propose an improved, efficient methodology to handle such a challenge. Design/methodology/approach - According to the fact that the inner-class distance among samples within the same cluster is supposed to be smaller than the inter-class distance among clusters, the algorithm will dynamically adjust initial cluster centers that are randomly selected. Consequently, such adjusted initial cluster centers will be highly representative in the sense that they are distributed among as many samples as possible. As a result, local optima that are common in K-means clustering can then be effectively reduced. In addition, the algorithm is able to obtain all initial cluster centers simultaneously (instead of one center at a time) during the dynamic adjustment. Findings - Experimental results demonstrate that the proposed algorithm greatly improves the accuracy of traditional K-means clustering results and, in a more efficient manner. Originality/value - The authors presented in this paper an efficient algorithm, which is able to dynamically adjust initial cluster centers that are randomly selected. The adjusted centers are highly representative, i. e. they are distributed among as many samples as possible. As a result, local optima that are common in K-means clustering can be effectively reduced so that the authors can achieve an improved clustering accuracy. In addition, the algorithm is a cost-efficient one and the enhanced clustering accuracy can be obtained in a more efficient manner compared with traditional K-means algorithm.
引用
收藏
页码:1661 / 1667
页数:7
相关论文
共 50 条
  • [1] Improved initial clustering center selection algorithm for K-means
    Chen Lasheng
    Li Yuqiang
    [J]. 2017 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA 2017), 2017, : 275 - 279
  • [2] Improved Initial Clustering Center Selection Method for k-means Algorithm
    Xie, Qingqing
    Jiang, He
    Han, Bing
    Wang, Dongyuan
    [J]. 2018 EIGHTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2018), 2018, : 1092 - 1095
  • [3] A Fast K-Means Clustering Using Prototypes for Initial Cluster Center Selection
    Kumar, K. Mahesh
    Reddy, A. Rama Mohan
    [J]. PROCEEDINGS OF 2015 IEEE 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO), 2015,
  • [4] K-means Clustering Algorithm with improved Initial Center
    Zhang Chen
    Xia Shixiong
    [J]. WKDD: 2009 SECOND INTERNATIONAL WORKSHOP ON KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, : 790 - 792
  • [5] A New Selection Method of K-means Clustering with Initial Clustering Center Point
    Li, Wen-jun
    Zou, Hai-lin
    [J]. 2ND INTERNATIONAL SYMPOSIUM ON COMPUTER NETWORK AND MULTIMEDIA TECHNOLOGY (CNMT 2010), VOLS 1 AND 2, 2010, : 580 - 582
  • [6] K-Means Clustering Efficient Algorithm with Initial Class Center Selection
    Huang Suyu
    Hu Pingfang
    [J]. PROCEEDINGS OF THE 2018 3RD INTERNATIONAL WORKSHOP ON MATERIALS ENGINEERING AND COMPUTER SCIENCES (IWMECS 2018), 2018, 78 : 301 - 305
  • [7] A parallel k-means clustering initial center selection and dynamic center correction on GPU
    Kakooei, Mohammad
    Shahhoseini, Hadi Shahriar
    [J]. 2014 22nd Iranian Conference on Electrical Engineering (ICEE), 2014, : 20 - 25
  • [8] An Improved K-means text clustering algorithm By Optimizing initial cluster centers
    Xiong, Caiquan
    Hua, Zhen
    Lv, Ke
    Li, Xuan
    [J]. 2016 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CCBD), 2016, : 265 - 268
  • [9] Study of a Reasonable Initial Center Selection Method Applied to a K-Means Clustering
    Lee, WonHee
    Lee, Samuel Sangkon
    An, Dong-Un
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (08) : 1727 - 1733
  • [10] Cluster center initialization algorithm for K-means clustering
    Khan, SS
    Ahmad, A
    [J]. PATTERN RECOGNITION LETTERS, 2004, 25 (11) : 1293 - 1302