MapReduce-based Dragonfly Algorithm for large-scale Data-Clustering

被引:0
|
作者
Tripathi, Ashish Kumar [1 ]
Saxena, Pranav [1 ]
Gupta, Siddharth [1 ]
机构
[1] Jaypee Inst Informat Technol, Noida, India
关键词
Salp swarm algorithm; Metaheuristic method; Spiral search; Convergence; SEGMENTATION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The improvement in networking-technology, sensors, and availability of the internet have accelerated the huge growth to the electronic data. This immense amount of data has stimulated the development of new data analysis methods for better decision making. Clustering is an influential unsupervised data analysis approach, with wide areas of applications. K-Means is a fast and prolific approach for data-clustering present in the literature. However, the algorithm is easily-influenced by the initial positions of cluster-centroids, and the method converges to the local optimum that is nearest to the initial centroid positions. Furthermore, the algorithm cannot process large datasets within a reasonable time-period. In this work, a novel data-clustering method named MapReduce-based Dragonfly Algorithm (MR-DA) is introduced. The efficiency of MR-DA is compared with 4 other recent methods. The experimental results demonstrate that MR-DA surpassed the other considered methods on the majority of the datasets.
引用
收藏
页码:171 / 175
页数:5
相关论文
共 50 条
  • [11] A MapReduce-based approach for shortest path problem in large-scale networks
    Aridhi, Sabeur
    Lacomme, Philippe
    Ren, Libo
    Vincent, Benjamin
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 41 : 151 - 165
  • [12] Aeromancer: A Workflow Manager for Large-Scale MapReduce-Based Scientific Workflows
    Mohamed, Nabeel
    Maji, Nabanita
    Zhang, Jing
    Timoshevskaya, Nataliya
    Feng, Wu-Chun
    [J]. 2014 IEEE 13TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM), 2014, : 739 - 746
  • [13] The MapReduce-based approach to improve the shortest path computation in large-scale road networks: the case of A* algorithm
    Adoni W.Y.H.
    Nahhal T.
    Aghezzaf B.
    Elbyed A.
    [J]. Journal of Big Data, 5 (1)
  • [14] A MapReduce-based K-means clustering algorithm
    Mao, YiMin
    Gan, DeJin
    Mwakapesa, D. S.
    Nanehkaran, Y. A.
    Tao, Tao
    Huang, XueYu
    [J]. JOURNAL OF SUPERCOMPUTING, 2022, 78 (04): : 5181 - 5202
  • [15] Online image search result grouping with MapReduce-based image clustering and graph construction for large-scale photos
    Hsieh, Liang-Chi
    Wu, Guan-Long
    Hsu, Yu-Ming
    Hsu, Winston
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2014, 25 (02) : 384 - 395
  • [16] A MapReduce-based K-means clustering algorithm
    YiMin Mao
    DeJin Gan
    D. S. Mwakapesa
    Y. A. Nanehkaran
    Tao Tao
    XueYu Huang
    [J]. The Journal of Supercomputing, 2022, 78 : 5181 - 5202
  • [17] MapReduce-based fast fuzzy c-means algorithm for large-scale underwater image segmentation
    Li, Xiu
    Song, Jingdong
    Zhang, Fan
    Ouyang, Xiaogang
    Khan, Samee U.
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2016, 65 : 90 - 101
  • [18] A Large-Scale Implementation Using MapReduce-Based SVM for Tweets Sentiment Analysis
    Lijo, V. P.
    Seetha, Hari
    [J]. INTELLIGENT COMPUTING AND COMMUNICATION, ICICC 2019, 2020, 1034 : 541 - 549
  • [19] Graph partitioning MapReduce-based algorithms for counting triangles in large-scale graphs
    Ahmed Sharafeldeen
    Mohammed Alrahmawy
    Samir Elmougy
    [J]. Scientific Reports, 13
  • [20] MR-ELM: a MapReduce-based framework for large-scale ELM training in big data era
    Chen, Jiaoyan
    Chen, Huajun
    Wan, Xiangyi
    Zheng, Guozhou
    [J]. NEURAL COMPUTING & APPLICATIONS, 2016, 27 (01): : 101 - 110