Fast k-means algorithms with constant approximation

被引:0
|
作者
Song, MJ [1 ]
Rajasekaran, S [1 ]
机构
[1] Univ Connecticut, Storrs, CT 06269 USA
来源
ALGORITHMS AND COMPUTATION | 2005年 / 3827卷
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper we study the k-means clustering problem. It is well-known that the general version of this problem is NP-hard. Numerous approximation algorithms have been proposed for this problem. In this paper, we proposed three constant approximation algorithms for k-means clustering. The first algorithm runs in time O((k/epsilon)(k)nd), where k is the number of clusters, n is the size of input points, d is dimension of attributes. The second algorithm runs in time O(k(3)n(2) log n). This is the first algorithm for k-means clustering that runs in time polynomial in n, k and d. The run time of the third algorithm (O(k(5) log(3) kd)) is independent of n. Though an algorithm whose run time is independent of n is known for the k-median problem, ours is the first such algorithm for the k-means problem.
引用
收藏
页码:1029 / 1038
页数:10
相关论文
共 50 条
  • [41] Faster Algorithms for the Constrained k-means Problem
    Bhattacharya, Anup
    Jaiswal, Ragesh
    Kumar, Amit
    [J]. THEORY OF COMPUTING SYSTEMS, 2018, 62 (01) : 93 - 115
  • [42] Acceleration of K-means and related clustering algorithms
    Phillips, SJ
    [J]. ALGORITHM ENGINEERING AND EXPERIMENTS, 2002, 2409 : 166 - 177
  • [43] The seeding algorithms for spherical k-means clustering
    Li, Min
    Xu, Dachuan
    Zhang, Dongmei
    Zou, Juan
    [J]. JOURNAL OF GLOBAL OPTIMIZATION, 2020, 76 (04) : 695 - 708
  • [44] Performance Analysis of K-Means Seeding Algorithms
    Ortiz-Bejar, Jose
    Tellez, Eric S.
    Graff, Mario
    Ortiz-Bejar, Jesus
    Jacobo, Jaime Cerda
    Zamora-Mendez, Alejandro
    [J]. 2019 IEEE INTERNATIONAL AUTUMN MEETING ON POWER, ELECTRONICS AND COMPUTING (ROPEC 2019), 2019,
  • [45] Robust Algorithms for Online k-means Clustering
    Bhaskara, Aditya
    Ruwanpathirana, Aravinda Kanchana
    [J]. ALGORITHMIC LEARNING THEORY, VOL 117, 2020, 117 : 148 - 173
  • [46] COMPARATIVE ANALYSIS OF K-MEANS AND DBSCAN ALGORITHMS
    Zurini, Madalina
    [J]. INTERNATIONAL CONFERENCE ON INFORMATICS IN ECONOMY, 2013, : 646 - 651
  • [47] A FAST k-MEANS IMPLEMENTATION USING CORESETS
    Frahling, Gereon
    Sohler, Christian
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL GEOMETRY & APPLICATIONS, 2008, 18 (06) : 605 - 625
  • [48] Fast K-means for Large Scale Clustering
    Hu, Qinghao
    Wu, Jiaxiang
    Bai, Lu
    Zhang, Yifan
    Cheng, Jian
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 2099 - 2102
  • [49] Fast Noise Removal for k-Means Clustering
    Im, Sungjin
    Qaem, Mahshid Montazer
    Moseley, Benjamin
    Sun, Xiaorui
    Zhou, Rudy
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 456 - 465
  • [50] Streaming k-Means Clustering with Fast Queries
    Zhang, Yu
    Tangwongsan, Kanat
    Tirthapura, Srikanta
    [J]. 2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, : 449 - 460