A Parameter-free Clustering Algorithm based K-means

被引:0
|
作者
Slaoui, Said [1 ]
Dafir, Zineb [1 ]
机构
[1] Mohammed V Univ, Fac Sci Rabat, Rabat, Morocco
关键词
Data mining; clustering; overlapping clustering; k-means; cluster centre initialization; ENHANCED VERSION;
D O I
10.14569/IJACSA.2021.0120372
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Clustering is one of the relevant data mining tasks, which aims to process data sets in an effective way. This paper introduces a new clustering heuristic combining the E-transitive heuristic adapted to quantitative data and the k-means algorithm with the goal of ensuring the optimal number of clusters and the suitable initial cluster centres for k-means. The suggested heuristic, called PFK-means, is a parameter-free clustering algorithm since it does not require the prior initialization of the number of clusters. Thus, it generates progressively the initial cluster centres until the appropriate number of clusters is automatically detected. Moreover, this paper exposes a thorough comparison between the PFK-means heuristic, its diverse variants, the E-Transitive heuristic for clustering quantitative data and the traditional k-means in terms of the sum of squared errors and accuracy using different data sets. The experiments results reveal that, in general, the proposed heuristic and its variants provide the appropriate number of clusters for different real-world data sets and give good clusters quality related to the traditional k-means. Furthermore, the experiments conducted on synthetic data sets report the performance of this heuristic in terms of processing time.
引用
收藏
页码:612 / 619
页数:8
相关论文
共 50 条
  • [31] A parameter-free affinity based clustering
    Bhaskar Mukhoty
    Ruchir Gupta
    Lakshmanan K.
    Mayank Kumar
    [J]. Applied Intelligence, 2020, 50 : 4543 - 4556
  • [32] Modified k-Means Clustering Algorithm
    Patel, Vaishali R.
    Mehta, Rupa G.
    [J]. COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY, 2011, 250 : 307 - +
  • [33] An Improved K-means Clustering Algorithm
    Wang Yintong
    Li Wanlong
    Gao Rujia
    [J]. 2012 WORLD AUTOMATION CONGRESS (WAC), 2012,
  • [34] Granular K-means Clustering Algorithm
    Zhou, Chenglong
    Chen, Yuming
    Zhu, Yidong
    [J]. Computer Engineering and Applications, 2023, 59 (13) : 317 - 324
  • [35] Unsupervised K-Means Clustering Algorithm
    Sinaga, Kristina P.
    Yang, Miin-Shen
    [J]. IEEE ACCESS, 2020, 8 : 80716 - 80727
  • [36] Modified K-means clustering algorithm
    Li, Wei
    [J]. CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 4, PROCEEDINGS, 2008, : 618 - 621
  • [37] The MinMax k-Means clustering algorithm
    Tzortzis, Grigorios
    Likas, Aristidis
    [J]. PATTERN RECOGNITION, 2014, 47 (07) : 2505 - 2516
  • [38] Improved K-means clustering algorithm
    Zhang, Zhe
    Zhang, Junxi
    Xue, Huifeng
    [J]. CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 169 - 172
  • [39] The global k-means clustering algorithm
    Likas, A
    Vlassis, N
    Verbeek, JJ
    [J]. PATTERN RECOGNITION, 2003, 36 (02) : 451 - 461
  • [40] An improved K-means clustering algorithm
    Huang, Xiuchang
    Su, Wei
    [J]. Journal of Networks, 2014, 9 (01) : 161 - 167