A practical comparison of two K-Means clustering algorithms

被引:15
|
作者
Wilkin, Gregory A. [1 ]
Huang, Xiuzhen [1 ]
机构
[1] Arkansas State Univ, Dept Comp Sci, State Univ, AR 72467 USA
关键词
Cluster Algorithm; Cluster Center; Cluster Representative; Close Center; Proximity Relationship;
D O I
10.1186/1471-2105-9-S6-S19
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Data clustering is a powerful technique for identifying data with similar characteristics, such as genes with similar expression patterns. However, not all implementations of clustering algorithms yield the same performance or the same clusters. Results: In this paper, we study two implementations of a general method for data clustering: k-means clustering. Our experimentation compares the running times and distance efficiency of Lloyd's K-means Clustering and the Progressive Greedy K-means Clustering. Conclusion: Based on our implementation, not just in processing time, but also in terms of mean squared-difference (MSD), Lloyd's K-means Clustering algorithm is more efficient. This analysis was performed using both a gene expression level sample and on randomly-generated datasets in three-dimensional space. However, other circumstances may dictate a different choice in some situations.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] A practical comparison of two K-Means clustering algorithms
    Gregory A Wilkin
    Xiuzhen Huang
    [J]. BMC Bioinformatics, 9
  • [2] Comparison of distributed evolutionary k-means clustering algorithms
    Naldi, M. C.
    Campello, R. J. G. B.
    [J]. NEUROCOMPUTING, 2015, 163 : 78 - 93
  • [3] Comparison between K-Means and K-Medoids Clustering Algorithms
    Madhulatha, Tagaram Soni
    [J]. ADVANCES IN COMPUTING AND INFORMATION TECHNOLOGY, 2011, 198 : 472 - 481
  • [4] Applied Comparison of DBSCAN, OPTICS and K-Means Clustering Algorithms
    Bilgin, Turgay Tugay
    Camurcu, Yilmaz
    [J]. JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI, 2005, 8 (02): : 139 - 145
  • [5] Clustering performance comparison using K-means and expectation maximization algorithms
    Jung, Yong Gyu
    Kang, Min Soo
    Heo, Jun
    [J]. BIOTECHNOLOGY & BIOTECHNOLOGICAL EQUIPMENT, 2014, 28 : S44 - S48
  • [6] A Survey on Various K-Means algorithms for Clustering
    Singh, Malwinder
    Bansal, Meenakshi
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2015, 15 (06): : 60 - 65
  • [7] The seeding algorithms for spherical k-means clustering
    Min Li
    Dachuan Xu
    Dongmei Zhang
    Juan Zou
    [J]. Journal of Global Optimization, 2020, 76 : 695 - 708
  • [8] Robust Algorithms for Online k-means Clustering
    Bhaskara, Aditya
    Ruwanpathirana, Aravinda Kanchana
    [J]. ALGORITHMIC LEARNING THEORY, VOL 117, 2020, 117 : 148 - 173
  • [9] Acceleration of K-means and related clustering algorithms
    Phillips, SJ
    [J]. ALGORITHM ENGINEERING AND EXPERIMENTS, 2002, 2409 : 166 - 177
  • [10] The seeding algorithms for spherical k-means clustering
    Li, Min
    Xu, Dachuan
    Zhang, Dongmei
    Zou, Juan
    [J]. JOURNAL OF GLOBAL OPTIMIZATION, 2020, 76 (04) : 695 - 708