An Analytic Survey on MapReduce based K-Means and its Hybrid Clustering Algorithms

被引:0
|
作者
Bagde, Utkarsha [1 ]
Tripathi, Priyanka [1 ]
机构
[1] NITTTR, Dept Comp Engn & Applicat, Bhopal, India
关键词
Clustering; K-Means; K-Harmonic Means; MapReduce;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The challenging task of today's era in data clustering is the common technique of arranging similar data into chunks. The traditional clustering algorithm is effective for handling large amount of data which comes from various sources such as social media, business, internet, etc. However, the time complexity of the serial calculation method is very high in these traditional algorithms. The K-Means algorithm is sensitive for initial points and local optimization and many times K-Means runs for K value. K-Harmonic Means is insensitive to the initialization of the centers and suitable for large scale datasets. To overcome these defects of traditional clustering algorithm, a hybrid method is suggested in this paper. MapReduce is a parallel programming model for distributed processing and generates data sets with a parallel, distributed algorithmic program on a cluster. In this paper, observations are given based on the different MapReduce algorithms. A new hybrid clustering algorithm based on MapReduce is proposed on those observations.
引用
收藏
页码:32 / 36
页数:5
相关论文
共 50 条
  • [11] A hybrid MapReduce-based k-means clustering using genetic algorithm for distributed datasets
    Ankita Sinha
    Prasanta K. Jana
    [J]. The Journal of Supercomputing, 2018, 74 : 1562 - 1579
  • [12] Design of K-means clustering algorithm in PGAS based Mapreduce framework
    Shomanov, A. S.
    Mansurova, M. E.
    Nugumanova, A. B.
    [J]. 2018 IEEE 12TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2018, : 158 - 160
  • [13] Parallel K-Means Clustering of Remote Sensing Images Based on MapReduce
    Lv, Zhenhua
    Hu, Yingjie
    Zhong, Haidong
    Wu, Jianping
    Li, Bo
    Zhao, Hui
    [J]. WEB INFORMATION SYSTEMS AND MINING, 2010, 6318 : 162 - +
  • [14] An Effective and Efficient Clustering Based on K-Means Using MapReduce and TLBO
    Pedireddla, Praveen Kumar
    Yadwad, Sunita A.
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGIES, IC3T 2015, VOL 3, 2016, 381 : 619 - 628
  • [15] An Improved parallel K-means Clustering Algorithm with MapReduce
    Liao, Qing
    Yang, Fan
    Zhao, Jingming
    [J]. 2013 15TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT), 2013, : 764 - 768
  • [16] Improved MapReduce k-Means Clustering Algorithm with Combiner
    Anchalia, Prajesh P.
    [J]. 2014 UKSIM-AMSS 16TH INTERNATIONAL CONFERENCE ON COMPUTER MODELLING AND SIMULATION (UKSIM), 2014, : 386 - 391
  • [17] Road extraction based on the algorithms of K-means clustering and hybrid model of SVM and FCM
    Zhu, Daming
    Wen, Xiang
    Xia, Rong
    [J]. ADVANCES IN ENVIRONMENTAL SCIENCE AND ENGINEERING, PTS 1-6, 2012, 518-523 : 5738 - 5743
  • [18] MapReduce Model of Improved K-Means Clustering Algorithm Using Hadoop MapReduce
    Akthar, Nadeem
    Ahamad, Mohd Vasim
    Ahmad, Shahbaaz
    [J]. 2016 SECOND INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE & COMMUNICATION TECHNOLOGY (CICT), 2016, : 192 - 198
  • [19] A Survey on Feature Weighting Based K-Means Algorithms
    Renato Cordeiro de Amorim
    [J]. Journal of Classification, 2016, 33 : 210 - 242
  • [20] A Survey on Feature Weighting Based K-Means Algorithms
    de Amorim, Renato Cordeiro
    [J]. JOURNAL OF CLASSIFICATION, 2016, 33 (02) : 210 - 242