A Parameter-free Clustering Algorithm based K-means

被引：0

作者：

Slaoui, Said ^{[1
]}

Dafir, Zineb ^{[1
]}

机构：

[1] Mohammed V Univ, Fac Sci Rabat, Rabat, Morocco

来源：

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS | 2021年 / 12卷 / 03期

关键词：

Data mining; clustering; overlapping clustering; k-means; cluster centre initialization; ENHANCED VERSION;

D O I：

10.14569/IJACSA.2021.0120372

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Clustering is one of the relevant data mining tasks, which aims to process data sets in an effective way. This paper introduces a new clustering heuristic combining the E-transitive heuristic adapted to quantitative data and the k-means algorithm with the goal of ensuring the optimal number of clusters and the suitable initial cluster centres for k-means. The suggested heuristic, called PFK-means, is a parameter-free clustering algorithm since it does not require the prior initialization of the number of clusters. Thus, it generates progressively the initial cluster centres until the appropriate number of clusters is automatically detected. Moreover, this paper exposes a thorough comparison between the PFK-means heuristic, its diverse variants, the E-Transitive heuristic for clustering quantitative data and the traditional k-means in terms of the sum of squared errors and accuracy using different data sets. The experiments results reveal that, in general, the proposed heuristic and its variants provide the appropriate number of clusters for different real-world data sets and give good clusters quality related to the traditional k-means. Furthermore, the experiments conducted on synthetic data sets report the performance of this heuristic in terms of processing time.

引用

页码：612 / 619

页数：8

共 50 条

[31] A parameter-free affinity based clustering
Bhaskar Mukhoty
Ruchir Gupta
Lakshmanan K.
Mayank Kumar
[J]. Applied Intelligence, 2020, 50 : 4543 - 4556
[32] Modified k-Means Clustering Algorithm
Patel, Vaishali R.
Mehta, Rupa G.
[J]. COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY, 2011, 250 : 307 - +
[33] An Improved K-means Clustering Algorithm
Wang Yintong
Li Wanlong
Gao Rujia
[J]. 2012 WORLD AUTOMATION CONGRESS (WAC), 2012,
[34] Granular K-means Clustering Algorithm
Zhou, Chenglong
Chen, Yuming
Zhu, Yidong
[J]. Computer Engineering and Applications, 2023, 59 (13) : 317 - 324
[35] Unsupervised K-Means Clustering Algorithm
Sinaga, Kristina P.
Yang, Miin-Shen
[J]. IEEE ACCESS, 2020, 8 : 80716 - 80727
[36] Modified K-means clustering algorithm
Li, Wei
[J]. CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 4, PROCEEDINGS, 2008, : 618 - 621
[37] The MinMax k-Means clustering algorithm
Tzortzis, Grigorios
Likas, Aristidis
[J]. PATTERN RECOGNITION, 2014, 47 (07) : 2505 - 2516
[38] Improved K-means clustering algorithm
Zhang, Zhe
Zhang, Junxi
Xue, Huifeng
[J]. CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 169 - 172
[39] The global k-means clustering algorithm
Likas, A
Vlassis, N
Verbeek, JJ
[J]. PATTERN RECOGNITION, 2003, 36 (02) : 451 - 461
[40] An improved K-means clustering algorithm
Huang, Xiuchang
Su, Wei
[J]. Journal of Networks, 2014, 9 (01) : 161 - 167

← 1 2 3 4 5 →