MapReduce-based Dragonfly Algorithm for large-scale Data-Clustering

被引:0
|
作者
Tripathi, Ashish Kumar [1 ]
Saxena, Pranav [1 ]
Gupta, Siddharth [1 ]
机构
[1] Jaypee Inst Informat Technol, Noida, India
关键词
Salp swarm algorithm; Metaheuristic method; Spiral search; Convergence; SEGMENTATION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The improvement in networking-technology, sensors, and availability of the internet have accelerated the huge growth to the electronic data. This immense amount of data has stimulated the development of new data analysis methods for better decision making. Clustering is an influential unsupervised data analysis approach, with wide areas of applications. K-Means is a fast and prolific approach for data-clustering present in the literature. However, the algorithm is easily-influenced by the initial positions of cluster-centroids, and the method converges to the local optimum that is nearest to the initial centroid positions. Furthermore, the algorithm cannot process large datasets within a reasonable time-period. In this work, a novel data-clustering method named MapReduce-based Dragonfly Algorithm (MR-DA) is introduced. The efficiency of MR-DA is compared with 4 other recent methods. The experimental results demonstrate that MR-DA surpassed the other considered methods on the majority of the datasets.
引用
收藏
页码:171 / 175
页数:5
相关论文
共 50 条
  • [31] MR-IBC: MapReduce-based incremental betweenness centrality in large-scale complex networks
    Ranjan Kumar Behera
    Debadatta Naik
    Dharavath Ramesh
    Santanu Kumar Rath
    [J]. Social Network Analysis and Mining, 2020, 10
  • [32] A Novel Clustering Algorithm on Large-Scale Graph Data
    Zhang, Hao
    Zhou, Wei
    Wan, Xiaoyu
    Fu, Ge
    Xu, Zhiyong
    Han, Jizhong
    [J]. 2014 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CCBD), 2014, : 47 - 54
  • [33] TVD-MRDL: traffic violation detection system using MapReduce-based deep learning for large-scale data
    Shiva Asadianfam
    Mahboubeh Shamsi
    Abdolreza Rasouli Kenari
    [J]. Multimedia Tools and Applications, 2021, 80 : 2489 - 2516
  • [34] TVD-MRDL: traffic violation detection system using MapReduce-based deep learning for large-scale data
    Asadianfam, Shiva
    Shamsi, Mahboubeh
    Kenari, Abdolreza Rasouli
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (02) : 2489 - 2516
  • [35] MapReduce-based K-Prototypes Clustering Method for Big Data
    Ben HajKacem, Mohamed Aymen
    Ben N'cir, Chiheb-Eddine
    Essoussi, Nadia
    [J]. PROCEEDINGS OF THE 2015 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (IEEE DSAA 2015), 2015, : 1030 - 1036
  • [36] MapReduce-based Fuzzy C-means Algorithm for Distributed Document Clustering
    Sardar T.H.
    Ansari Z.
    [J]. Journal of The Institution of Engineers (India): Series B, 2022, 103 (01): : 131 - 142
  • [37] RAD: A Radar-Alike Data-Clustering Algorithm for Large Databases
    Tsai, Cheng-Fa
    Ju, Jiun-Huang
    [J]. OPPORTUNITIES AND CHALLENGES FOR NEXT-GENERATION APPLIED INTELLIGENCE, 2009, 214 : 79 - 84
  • [38] A Sampling-Based Density Peaks Clustering Algorithm for Large-Scale Data
    Ding, Shifei
    Li, Chao
    Xu, Xiao
    Ding, Ling
    Zhang, Jian
    Guo, Lili
    Shi, Tianhao
    [J]. PATTERN RECOGNITION, 2023, 136
  • [39] Large-Scale Data Clustering Algorithm Based on Quantum Immune Regulation Network
    Li, Yangyang
    Bai, Xiaoyu
    Hou, Xiaoju
    Jiao, Licheng
    [J]. 2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017,
  • [40] MapReduce-based fuzzy c-means clustering algorithm: implementation and scalability
    Ludwig, Simone A.
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2015, 6 (06) : 923 - 934