A Parameter-free Clustering Algorithm based K-means

被引:0
|
作者
Slaoui, Said [1 ]
Dafir, Zineb [1 ]
机构
[1] Mohammed V Univ, Fac Sci Rabat, Rabat, Morocco
关键词
Data mining; clustering; overlapping clustering; k-means; cluster centre initialization; ENHANCED VERSION;
D O I
10.14569/IJACSA.2021.0120372
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Clustering is one of the relevant data mining tasks, which aims to process data sets in an effective way. This paper introduces a new clustering heuristic combining the E-transitive heuristic adapted to quantitative data and the k-means algorithm with the goal of ensuring the optimal number of clusters and the suitable initial cluster centres for k-means. The suggested heuristic, called PFK-means, is a parameter-free clustering algorithm since it does not require the prior initialization of the number of clusters. Thus, it generates progressively the initial cluster centres until the appropriate number of clusters is automatically detected. Moreover, this paper exposes a thorough comparison between the PFK-means heuristic, its diverse variants, the E-Transitive heuristic for clustering quantitative data and the traditional k-means in terms of the sum of squared errors and accuracy using different data sets. The experiments results reveal that, in general, the proposed heuristic and its variants provide the appropriate number of clusters for different real-world data sets and give good clusters quality related to the traditional k-means. Furthermore, the experiments conducted on synthetic data sets report the performance of this heuristic in terms of processing time.
引用
收藏
页码:612 / 619
页数:8
相关论文
共 50 条
  • [41] An Enhancement of K-means Clustering Algorithm
    Gu, Jirong
    Zhou, Jieming
    Chen, Xianwei
    [J]. 2009 INTERNATIONAL CONFERENCE ON BUSINESS INTELLIGENCE AND FINANCIAL ENGINEERING, PROCEEDINGS, 2009, : 237 - 240
  • [42] Adaptive K-Means clustering algorithm
    Chen, Hailin
    Wu, Xiuqing
    Hu, Junhua
    [J]. MIPPR 2007: PATTERN RECOGNITION AND COMPUTER VISION, 2007, 6788
  • [43] Improved Algorithm for the k-means Clustering
    Zhang, Sheng
    Wang, Shouqiang
    [J]. PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012), 2012, : 4717 - 4720
  • [44] Parameter-Free Minimum Spanning Tree (PFMST) Based Clustering Algorithm
    Raju, B. H. V. S. Ramakrisbnam
    Kumari, V. Valli
    [J]. ADVANCES IN PARALLEL, DISTRIBUTED COMPUTING, 2011, 203 : 552 - +
  • [45] Research On The Regional Division Of Free Surface Machining Based On K-Means Clustering Algorithm
    Han, Xianlong
    [J]. PROCEEDINGS OF THE 2016 7TH INTERNATIONAL CONFERENCE ON MECHATRONICS, CONTROL AND MATERIALS (ICMCM 2016), 2016, 104 : 312 - 317
  • [46] k*-means:: A new generalized k-means clustering algorithm
    Cheung, YM
    [J]. PATTERN RECOGNITION LETTERS, 2003, 24 (15) : 2883 - 2893
  • [47] K*-Means: An Effective and Efficient K-means Clustering Algorithm
    Qi, Jianpeng
    Yu, Yanwei
    Wang, Lihong
    Liu, Jinglei
    [J]. PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCES ON BIG DATA AND CLOUD COMPUTING (BDCLOUD 2016) SOCIAL COMPUTING AND NETWORKING (SOCIALCOM 2016) SUSTAINABLE COMPUTING AND COMMUNICATIONS (SUSTAINCOM 2016) (BDCLOUD-SOCIALCOM-SUSTAINCOM 2016), 2016, : 242 - 249
  • [48] Environment Parameter Rating Evaluation for Smart Museum Based on Improved K-Means Clustering Algorithm
    Guo, Wenqiang
    Huang, Zixuan
    Hou, Yongyan
    Xiao, Qinkun
    Jia, Jia
    Mao, Lingling
    [J]. PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 5449 - 5453
  • [49] A K-means Optimized Clustering Algorithm Based on Improved Genetic Algorithm
    Pu, Qiu-Mei
    Wu, Qiong
    Li, Qian
    [J]. Lecture Notes in Electrical Engineering, 2022, 801 LNEE : 133 - 140
  • [50] Improved rough K-means clustering algorithm based on firefly algorithm
    Ye, Tingyu
    Ye, Jun
    Wang, Lei
    [J]. INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS, 2023, 17 (01) : 1 - 12