Adaptive Initialization Method for K-Means Algorithm

被引:8
|
作者
Yang, Jie [1 ]
Wang, Yu-Kai [1 ]
Yao, Xin [2 ,3 ]
Lin, Chin-Teng [1 ]
机构
[1] Univ Technol Sydney, Australian Artificial Intelligence Inst, Computat Intelligence & Brain Comp Interface Lab, FEIT, Sydney, NSW, Australia
[2] Southern Univ Sci & Technol, Dept Comp Sci & Engn, Shenzhen Key Lab Computat Intelligence, Shenzhen, Peoples R China
[3] Univ Birmingham, Sch Comp Sci, CERCIA, Birmingham, W Midlands, England
来源
基金
澳大利亚研究理事会;
关键词
k-means; adaptive; initialization method; initial cluster centers; clustering; CLUSTERING-ALGORITHM; MODEL;
D O I
10.3389/frai.2021.740817
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The K-means algorithm is a widely used clustering algorithm that offers simplicity and efficiency. However, the traditional K-means algorithm uses a random method to determine the initial cluster centers, which make clustering results prone to local optima and then result in worse clustering performance. In this research, we propose an adaptive initialization method for the K-means algorithm (AIMK) which can adapt to the various characteristics in different datasets and obtain better clustering performance with stable results. For larger or higher-dimensional datasets, we even leverage random sampling in AIMK (name as AIMK-RS) to reduce the time complexity. 22 real-world datasets were applied for performance comparisons. The experimental results show AIMK and AIMK-RS outperform the current initialization methods and several well-known clustering algorithms. Specifically, AIMK-RS can significantly reduce the time complexity to O (n). Moreover, we exploit AIMK to initialize K-medoids and spectral clustering, and better performance is also explored. The above results demonstrate superior performance and good scalability by AIMK or AIMK-RS. In the future, we would like to apply AIMK to more partition-based clustering algorithms to solve real-life practical problems.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] AN INTELLIGENT INITIALIZATION METHOD FOR THE K-MEANS CLUSTERING ALGORITHM
    Sheu, Jyh-Jian
    Chen, Wei-Ming
    Tsai, Wen-Bin
    Chu, Ko-Tsung
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2010, 6 (06): : 2551 - 2566
  • [2] Density K-means : A New Algorithm for Centers Initialization for K-means
    Lan, Xv
    Li, Qian
    Zheng, Yi
    [J]. PROCEEDINGS OF 2015 6TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE, 2015, : 958 - 961
  • [3] An Initialization Method Based on Hybrid Distance for k-Means Algorithm
    Yang, Jie
    Ma, Yan
    Zhang, Xiangfen
    Li, Shunbao
    Zhang, Yuping
    [J]. NEURAL COMPUTATION, 2017, 29 (11) : 3094 - 3117
  • [4] AN INITIALIZATION METHOD OF K-MEANS CLUSTERING ALGORITHM FOR MIXED DATA
    Li, Taoying
    Jin, Zhihong
    Chen, Yan
    Ebonzo, Angelo Dan Menga
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2014, 10 (05): : 1873 - 1883
  • [5] An initialization method for the K-Means algorithm using neighborhood model
    Cao, Fuyuan
    Liang, Jiye
    Jiang, Guang
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2009, 58 (03) : 474 - 483
  • [6] HOW THE INITIALIZATION AFFECTS THE STABILITY OF THE k-MEANS ALGORITHM
    Bubeck, Sebastien
    Meila, Marina
    von Luxburg, Ulrike
    [J]. ESAIM-PROBABILITY AND STATISTICS, 2012, 16 : 436 - 452
  • [7] Cluster center initialization algorithm for K-means clustering
    Khan, SS
    Ahmad, A
    [J]. PATTERN RECOGNITION LETTERS, 2004, 25 (11) : 1293 - 1302
  • [8] Adaptive K-Means clustering algorithm
    Chen, Hailin
    Wu, Xiuqing
    Hu, Junhua
    [J]. MIPPR 2007: PATTERN RECOGNITION AND COMPUTER VISION, 2007, 6788
  • [9] An Improved Heuristic K-Means Clustering Method Using Genetic Algorithm Based Initialization
    Mustafi, D.
    Sahoo, G.
    Mustafi, A.
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE, 2017, 509 : 123 - 132
  • [10] An empirical comparison of four initialization methods for the K-Means algorithm
    Peña, JM
    Lozano, JA
    Larrañaga, P
    [J]. PATTERN RECOGNITION LETTERS, 1999, 20 (10) : 1027 - 1040