Comparison of the performance of center-based clustering algorithms

被引:0
|
作者
Zhang, B [1 ]
机构
[1] Hewlett Packard Res Labs, Palo Alto, CA 94304 USA
关键词
clustering; K-means; K-Harmonic Means; expectation-maximization; data mining;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Center-based clustering algorithms like K-means, and EM are one of the most popular classes of clustering algorithms in use today. The author developed another variation in this family K-Harmonic Means (KHM). It has been demonstrated using a small number of "benchmark" datasets that KHM is more robust than K-means and EM. In this paper, we compare their performance statistically. We run K-means, K-Harmonic Means and EM on each of 3600 pairs of (dataset, initialization) to compare the statistical average and variation of the performance of these algorithms. The results axe that, for low dimensional datasets, KHM performs consistently better than KM, and KM performs consistently better than EM over a large variation of clustered-ness of the datasets and a large variation of initializations. Some of the reasons that contributed to this difference are explained.
引用
收藏
页码:63 / 74
页数:12
相关论文
共 50 条
  • [1] MapReduce algorithms for robust center-based clustering in doubling metrics
    Dandolo, Enrico
    Mazzetto, Alessio
    Pietracaprina, Andrea
    Pucci, Geppino
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2024, 194
  • [2] On efficient model selection for sparse hard and fuzzy center-based clustering algorithms
    Gupta, Avisek
    Das, Swagatam
    [J]. INFORMATION SCIENCES, 2022, 590 : 29 - 44
  • [3] Clustering Center-based Differential Evolution
    Khosrowshahli, Rasa
    Rahnamayan, Shahryar
    Bidgoli, Azam Asilian
    [J]. 2022 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2022,
  • [4] klcluster: Center-based Clustering of Trajectories
    Buchin, Kevin
    Driemel, Anne
    van de L'Isle, Natasja
    Nusser, Andre
    [J]. 27TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2019), 2019, : 496 - 499
  • [5] Multiple ellipse fitting by center-based clustering
    Marosevic, Tomislav
    Scitovski, Rudolf
    [J]. CROATIAN OPERATIONAL RESEARCH REVIEW, 2015, 6 (01) : 43 - 53
  • [6] Multiple circle detection based on center-based clustering
    Scitovski, Rudolf
    Marosevic, Tomislav
    [J]. PATTERN RECOGNITION LETTERS, 2015, 52 : 9 - 16
  • [7] Center-based clustering under perturbation stability
    Awasthi, Pranjal
    Blum, Avrim
    Sheffet, Or
    [J]. INFORMATION PROCESSING LETTERS, 2012, 112 (1-2) : 49 - 54
  • [8] Center-Based Sampling for Population-Based Algorithms
    Rahnamayan, Shahryar
    Wang, G. Gary
    [J]. 2009 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-5, 2009, : 933 - +
  • [9] CENTER-BASED L1-CLUSTERING METHOD
    Sabo, Kristian
    [J]. INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2014, 24 (01) : 151 - 163
  • [10] IDCUP algorithm to classifying arbitrary shapes and densities for center-based clustering performance analysis
    [J]. Altaf, Saud (saltaf@aut.ac.nz), 1600, Informing Science Institute (15):