An Analytic Survey on MapReduce based K-Means and its Hybrid Clustering Algorithms

被引:0
|
作者
Bagde, Utkarsha [1 ]
Tripathi, Priyanka [1 ]
机构
[1] NITTTR, Dept Comp Engn & Applicat, Bhopal, India
关键词
Clustering; K-Means; K-Harmonic Means; MapReduce;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The challenging task of today's era in data clustering is the common technique of arranging similar data into chunks. The traditional clustering algorithm is effective for handling large amount of data which comes from various sources such as social media, business, internet, etc. However, the time complexity of the serial calculation method is very high in these traditional algorithms. The K-Means algorithm is sensitive for initial points and local optimization and many times K-Means runs for K value. K-Harmonic Means is insensitive to the initialization of the centers and suitable for large scale datasets. To overcome these defects of traditional clustering algorithm, a hybrid method is suggested in this paper. MapReduce is a parallel programming model for distributed processing and generates data sets with a parallel, distributed algorithmic program on a cluster. In this paper, observations are given based on the different MapReduce algorithms. A new hybrid clustering algorithm based on MapReduce is proposed on those observations.
引用
收藏
页码:32 / 36
页数:5
相关论文
共 50 条
  • [31] Cuckoo and krill herd-based k-means plus plus hybrid algorithms for clustering
    Aggarwal, Shruti
    Singh, Paramvir
    [J]. EXPERT SYSTEMS, 2019, 36 (04)
  • [32] Robust Algorithms for Online k-means Clustering
    Bhaskara, Aditya
    Ruwanpathirana, Aravinda Kanchana
    [J]. ALGORITHMIC LEARNING THEORY, VOL 117, 2020, 117 : 148 - 173
  • [33] Acceleration of K-means and related clustering algorithms
    Phillips, SJ
    [J]. ALGORITHM ENGINEERING AND EXPERIMENTS, 2002, 2409 : 166 - 177
  • [34] The seeding algorithms for spherical k-means clustering
    Li, Min
    Xu, Dachuan
    Zhang, Dongmei
    Zou, Juan
    [J]. JOURNAL OF GLOBAL OPTIMIZATION, 2020, 76 (04) : 695 - 708
  • [35] K-Means and Fuzzy based Hybrid Clustering Algorithm for WSN
    Angadi, Basavaraj M.
    Kakkasageri, Mahabaleshwar S.
    [J]. INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2023, 69 (04) : 793 - 801
  • [36] Empirical Evaluation of K-Means, Bisecting K-Means, Fuzzy C-Means and Genetic K-Means Clustering Algorithms
    Banerjee, Shreya
    Choudhary, Ankit
    Pal, Somnath
    [J]. 2015 IEEE INTERNATIONAL WIE CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (WIECON-ECE), 2015, : 172 - 176
  • [37] Single-pass and linear-time k-means clustering based on MapReduce
    Shahrivari, Saeed
    Jalili, Saeed
    [J]. INFORMATION SYSTEMS, 2016, 60 : 1 - 12
  • [38] Data Categorization Using Hadoop MapReduce-Based Parallel K-Means Clustering
    Ansari Z.
    Afzal A.
    Sardar T.H.
    [J]. Journal of The Institution of Engineers (India): Series B, 2019, 100 (2) : 95 - 103
  • [39] K-Means Parallel Algorithm of Big Data Clustering Based on Mapreduce PCAM Method
    Li, Yongyi
    Yang, Zhongqiang
    Han, Kaixu
    [J]. Engineering Intelligent Systems, 2021, 29 (06): : 411 - 418
  • [40] Experience with a hybrid processor: K-means clustering
    Gokhale, M
    Frigo, J
    Mccabe, K
    Theiler, J
    Wolinski, C
    Lavenier, D
    [J]. JOURNAL OF SUPERCOMPUTING, 2003, 26 (02): : 131 - 148