Strategies for Big Data Clustering

被引:28
|
作者
Kurasova, Olga [1 ]
Marcinkevicius, Virginijus [1 ]
Medvedev, Viktor [1 ]
Rapecka, Aurimas [1 ]
Stefanovic, Pavel [1 ]
机构
[1] Vilnius State Univ, Inst Math & Informat, LT-08663 Vilnius, Lithuania
关键词
big data; clustering methods; data mining; Hadoop; VISUAL ANALYSIS;
D O I
10.1109/ICTAI.2014.115
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the paper, an overview of methods and technologies used for big data clustering is presented. The clustering is one of the important data mining issue especially for big data analysis, where large volume data should be grouped. Here some clustering methods are described, great attention is paid to the k-means method and its modifications, because it still remains one of the popular methods and is implemented in innovative technologies for big data analysis. Neural network-based self-organizing maps and their extensions for big data clustering are reviewed, too. Some strategies for big data clustering are also presented and discussed. It is shown the data of which volume can be clustered in the well known data mining systems WEKA and KNIME and when new sophisticated technologies are needed.
引用
收藏
页码:740 / 747
页数:8
相关论文
共 50 条
  • [1] MapReduce Clustering for Big Data
    Ghattas, Badih
    Pinto, Antoine
    Diao, Sambou
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5116 - 5124
  • [2] Big Data Clustering: A Review
    Shirkhorshidi, Ali Seyed
    Aghabozorgi, Saeed
    Teh, Ying Wah
    Herawan, Tutut
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2014, PT V, 2014, 8583 : 707 - 720
  • [3] Big Data and Clustering Algorithms
    Ajin, V. W.
    Kumar, Lekshmy D.
    [J]. 2016 INTERNATIONAL CONFERENCE ON RESEARCH ADVANCES IN INTEGRATED NAVIGATION SYSTEMS (RAINS), 2016,
  • [4] Consensus Clustering on Big Data
    Liu, Hongfu
    Cheng, Gong
    Wu, Junjie
    [J]. 2015 12TH INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT (ICSSSM), 2015,
  • [5] Big Data clustering validity
    Tlili, Monia
    Hamdani, Tarek M.
    [J]. 2014 6TH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2014, : 348 - 352
  • [6] Marketing strategies evaluation based on big data analysis: a CLUSTERING-MCDM approach
    Mahdiraji, Hannan Amoozad
    Zavadskas, Edmundas Kazimieras
    Kazeminia, Aliakbar
    Abbasi Kamardi, AliAsghar
    [J]. ECONOMIC RESEARCH-EKONOMSKA ISTRAZIVANJA, 2019, 32 (01): : 2882 - 2898
  • [7] A Review of Clustering Algorithms for Big Data
    Djouzi, Kheyreddine
    Beghdad-Bey, Kadda
    [J]. 2019 4TH INTERNATIONAL CONFERENCE ON NETWORKING AND ADVANCED SYSTEMS (ICNAS 2019), 2019, : 117 - 122
  • [8] Iterative Unified Clustering in Big Data
    Misal, Vasundhara
    Janeja, Vandana P.
    Pallaprolu, Sai C.
    Yesha, Yelena
    Chintalapati, Raghu
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 3412 - 3421
  • [9] A Hybrid Approach to Clustering in Big Data
    Kumar, Dheeraj
    Bezdek, James C.
    Palaniswami, Marimuthu
    Rajasegarar, Sutharshan
    Leckie, Christopher
    Havens, Timothy Craig
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (10) : 2372 - 2385
  • [10] High Performance Big Data Clustering
    Agrawal, Ankit
    Patwary, Md. Mostofa Ali
    Hendrix, William
    Liao, Wei-keng
    Choudhary, Alok
    [J]. CLOUD COMPUTING AND BIG DATA, 2013, 23 : 192 - 211