Modified K-means Algorithm for Big Data Clustering

被引:1
|
作者
Sengupta, Debapriya [1 ]
Roy, Sayantan Singha [1 ]
Ghosh, Sarbani [1 ]
Dasgupta, Ranjan [1 ]
机构
[1] NITTTR, Dept CSE, Kolkata, India
关键词
K-means Clustering; BigData; Distance Matrix; Social Network;
D O I
10.1109/CSCI.2017.252
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering of Big data is a highly demanding research issue and efficient clustering, particularly for growing data, attracts further attention to the researchers as it is a very common phenomenon for social networks. Clustering algorithms in general deal with static data and various algorithms do exist with their respective pros and cons and are applicable to various types of data. We consider K-means algorithm with one dimensional data and modify it to handle frequent addition of data without re-clustering the entire set. We further improve volume of distance matrix calculation for additional data elements. Theoretical calculation along with case study is placed for establishing the benefits gained by the proposed modified algorithm.
引用
收藏
页码:1443 / 1448
页数:6
相关论文
共 50 条
  • [1] A modified K-means algorithm for categorical data clustering
    Sun, Y
    Zhu, QM
    Chen, ZX
    [J]. IC-AI'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 1-III, 2000, : 31 - 37
  • [2] Review on the Research of K-means Clustering Algorithm in Big Data
    Chen Jie
    Zhang Jiyue
    Wu Junhui
    Wu Yusheng
    Si Huiping
    Lin Kaiyan
    [J]. 2020 IEEE THE 3RD INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION ENGINEERING (ICECE), 2020, : 107 - 111
  • [3] The fast clustering algorithm for the big data based on K-means
    Xie, Ting
    Zhang, Taiping
    [J]. INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2020, 18 (06)
  • [4] Canopy with k-means Clustering Algorithm for Big Data Analytics
    Sagheer, Noor S.
    Yousif, Suhad A.
    [J]. FOURTH INTERNATIONAL CONFERENCE OF MATHEMATICAL SCIENCES (ICMS 2020), 2021, 2334
  • [5] K-MEANS plus : A DEVELOPED CLUSTERING ALGORITHM FOR BIG DATA
    Niu, Kun
    Gao, Zhipeng
    Jiao, Haizhen
    Deng, Nanjie
    [J]. PROCEEDINGS OF 2016 4TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (IEEE CCIS 2016), 2016, : 141 - 144
  • [6] Improvement of K-Means Algorithm for Accelerated Big Data Clustering
    Wu, Chunqiong
    Yan, Bingwen
    Yu, Rongrui
    Huang, Zhangshu
    Yu, Baoqin
    Yu, Yanliang
    Chen, Na
    Zhou, Xiukao
    [J]. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGIES AND SYSTEMS APPROACH, 2021, 14 (02) : 99 - 119
  • [7] A Novel K-Means based Clustering Algorithm for Big Data
    Sinha, Ankita
    Jana, Prasanta K.
    [J]. 2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 1875 - 1879
  • [8] Modified k-Means Clustering Algorithm
    Patel, Vaishali R.
    Mehta, Rupa G.
    [J]. COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY, 2011, 250 : 307 - +
  • [9] Modified K-means clustering algorithm
    Li, Wei
    [J]. CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 4, PROCEEDINGS, 2008, : 618 - 621
  • [10] Improvement of the Fast Clustering Algorithm Improved by K-Means in the Big Data
    Xie, Ting
    Liu, Ruihua
    Wei, Zhengyuan
    [J]. APPLIED MATHEMATICS AND NONLINEAR SCIENCES, 2020, 5 (01) : 1 - 10