MapReduce-based Dragonfly Algorithm for large-scale Data-Clustering

被引:0
|
作者
Tripathi, Ashish Kumar [1 ]
Saxena, Pranav [1 ]
Gupta, Siddharth [1 ]
机构
[1] Jaypee Inst Informat Technol, Noida, India
关键词
Salp swarm algorithm; Metaheuristic method; Spiral search; Convergence; SEGMENTATION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The improvement in networking-technology, sensors, and availability of the internet have accelerated the huge growth to the electronic data. This immense amount of data has stimulated the development of new data analysis methods for better decision making. Clustering is an influential unsupervised data analysis approach, with wide areas of applications. K-Means is a fast and prolific approach for data-clustering present in the literature. However, the algorithm is easily-influenced by the initial positions of cluster-centroids, and the method converges to the local optimum that is nearest to the initial centroid positions. Furthermore, the algorithm cannot process large datasets within a reasonable time-period. In this work, a novel data-clustering method named MapReduce-based Dragonfly Algorithm (MR-DA) is introduced. The efficiency of MR-DA is compared with 4 other recent methods. The experimental results demonstrate that MR-DA surpassed the other considered methods on the majority of the datasets.
引用
收藏
页码:171 / 175
页数:5
相关论文
共 50 条
  • [1] A MapReduce-based artificial bee colony for large-scale data clustering
    Banharnsakun, Anan
    [J]. PATTERN RECOGNITION LETTERS, 2017, 93 : 78 - 84
  • [2] A MapReduce-based parallel K-means clustering for large-scale CIM data verification
    Deng, Chuang
    Liu, Yang
    Xu, Lixiong
    Yang, Jie
    Liu, Junyong
    Li, Siguang
    Li, Maozhen
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (11): : 3096 - 3114
  • [3] MapReduce-Based Crow Search-Adopted Partitional Clustering Algorithms for Handling Large-Scale Data
    Visalakshi, Karthikeyani N.
    Shanthi, S.
    Lakshmi, K.
    [J]. INTERNATIONAL JOURNAL OF COGNITIVE INFORMATICS AND NATURAL INTELLIGENCE, 2021, 15 (04)
  • [4] One-pass MapReduce-based clustering method for mixed large scale data
    Ben HajKacem, Mohamed Aymen
    Ben N'cir, Chiheb-Eddine
    Essoussi, Nadia
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2019, 52 (03) : 619 - 636
  • [5] One-pass MapReduce-based clustering method for mixed large scale data
    Mohamed Aymen Ben HajKacem
    Chiheb-Eddine Ben N’cir
    Nadia Essoussi
    [J]. Journal of Intelligent Information Systems, 2019, 52 : 619 - 636
  • [6] Scalable Implementation of a MapReduce-based Graph Processing Algorithm for Large-scale Heterogeneous Supercomputers
    Shirahata, Koichi
    Sato, Hitoshi
    Suzumura, Toyotaro
    Matsuoka, Satoshi
    [J]. PROCEEDINGS OF THE 2013 13TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID 2013), 2013, : 277 - 284
  • [7] MELT: Mapreduce-based Efficient Large-scale Trajectory Anonymization
    Ward, Katrina
    Lin, Dan
    Madria, Sanjay
    [J]. SSDBM 2017: 29TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, 2017,
  • [8] MapReduce-based distributed tensor clustering algorithm
    Zhang, Hongjun
    Li, Peng
    Meng, Fanshuo
    Fan, Weibei
    Xue, Zhuangzhuang
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (35): : 24633 - 24649
  • [9] MapReduce-based distributed tensor clustering algorithm
    Hongjun Zhang
    Peng Li
    Fanshuo Meng
    Weibei Fan
    Zhuangzhuang Xue
    [J]. Neural Computing and Applications, 2023, 35 : 24633 - 24649
  • [10] ARLS: A MapReduce-based output analysis tool for large-scale simulations
    Lee, Kangsun
    Jung, Kwanghoon
    Park, Joonho
    Kwon, Dongseop
    [J]. ADVANCES IN ENGINEERING SOFTWARE, 2016, 95 : 28 - 37