Modified K-means Algorithm for Big Data Clustering

被引:1
|
作者
Sengupta, Debapriya [1 ]
Roy, Sayantan Singha [1 ]
Ghosh, Sarbani [1 ]
Dasgupta, Ranjan [1 ]
机构
[1] NITTTR, Dept CSE, Kolkata, India
关键词
K-means Clustering; BigData; Distance Matrix; Social Network;
D O I
10.1109/CSCI.2017.252
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering of Big data is a highly demanding research issue and efficient clustering, particularly for growing data, attracts further attention to the researchers as it is a very common phenomenon for social networks. Clustering algorithms in general deal with static data and various algorithms do exist with their respective pros and cons and are applicable to various types of data. We consider K-means algorithm with one dimensional data and modify it to handle frequent addition of data without re-clustering the entire set. We further improve volume of distance matrix calculation for additional data elements. Theoretical calculation along with case study is placed for establishing the benefits gained by the proposed modified algorithm.
引用
收藏
页码:1443 / 1448
页数:6
相关论文
共 50 条
  • [21] On K-means Data Clustering Algorithm with Genetic Algorithm
    Kapil, Shruti
    Chawla, Meenu
    Ansari, Mohd Dilshad
    [J]. 2016 FOURTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2016, : 202 - 206
  • [22] Research on k-means Clustering Algorithm An Improved k-means Clustering Algorithm
    Shi Na
    Liu Xumin
    Guan Yong
    [J]. 2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 63 - 67
  • [23] MK-means - Modified K-means clustering algorithm
    Dashti, Hesam T.
    Simas, Tiago
    Ribeiro, Rita A.
    Assadi, Amir
    Moitinho, Andre
    [J]. 2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [24] Soil data clustering by using K-means and fuzzy K-means algorithm
    Hot, Elma
    Popovic-Bugarin, Vesna
    [J]. 2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 890 - 893
  • [25] How to Use K-means for Big Data Clustering?
    Mussabayev, Rustam
    Mladenovic, Nenad
    Jarboui, Bassem
    Mussabayev, Ravil
    [J]. PATTERN RECOGNITION, 2023, 137
  • [26] Parallel batch k-means for Big data clustering
    Alguliyev, Rasim M.
    Aliguliyev, Ramiz M.
    Sukhostat, Lyudmila, V
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 152
  • [27] Application Research of Modified K-Means Clustering Algorithm
    Liu, Guo-li
    Tan, You-qian
    Yu, Li-mei
    Liu, Jia
    Gao, Jin-qiao
    [J]. 19TH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT: MANAGEMENT SYSTEM INNOVATION, 2013, : 1269 - 1280
  • [28] A modified K-means algorithm for circular invariant clustering
    Charalampidis, D
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (12) : 1856 - 1865
  • [29] Data Clustering: Integrating Different Distance Measures with Modified k-Means Algorithm
    Patel, Vaishali R.
    Mehta, Rupa G.
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2011), VOL 2, 2012, 131 : 691 - +
  • [30] Data Clustering with Cluster Size Constraints Using a Modified k-means Algorithm
    Ganganath, Nuwan
    Cheng, Chi-Tsun
    Tse, Chi K.
    [J]. 2014 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY (CYBERC), 2014, : 158 - 161