Scalable Parallel Clustering Approach for Large Data Using Parallel K Means and Firefly Algorithms

被引:0
|
作者
Mathew, Juby [1 ]
Vijayakumar, R. [2 ]
机构
[1] Amaljyothi Coll Engn, Dept MCA, Kanjirappally, Kerala, India
[2] Mahatma Gandhi Univ, Kottayam, Kerala, India
来源
2014 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND APPLICATIONS (ICHPCA) | 2014年
关键词
Clustering; k-means; parallel k-means; Firefly algorithm; join and fork parallelism;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper mainly focuses in identifying the limitations of the k means algorithm and to propose the parallelization of the k-means using firefly based clustering method. The new parallel architecture can handle large number of clusters. Firefly algorithm to find initial optimal cluster centroid and then k-means algorithm with optimized centroid to refined them and improve clustering accuracy. The final convergence issue is also addressed and solved to a great extent. Finally modified algorithm is compared with parallel k means is demonstrated with experiments and it has been found that the performance of modified algorithm is better than the existing algorithm. Four typical benchmark data sets from the UCI machine learning repository are used to demonstrate the results of the techniques. To achieve this we can use fork/join method in java programming. It is the most effective design method for achieve good parallel performance
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Visual clustering of multidimensional and large data sets using parallel environments
    Blasiak, J
    Dzwinel, W
    HIGH-PERFORMANCE COMPUTING AND NETWORKING, 1998, 1401 : 403 - 410
  • [42] Network Traffic Anomaly Detection Using Shallow Packet Inspection and Parallel K-means Data Clustering
    Velea, Radu
    Ciobanip, Casian
    Margarit, Laurentiu
    Bica, Ion
    STUDIES IN INFORMATICS AND CONTROL, 2017, 26 (04): : 387 - 395
  • [43] Parallel Implementation of K-Means Algorithm Using MapReduce Approach
    Borlea, Ioan-Daniel
    Precup, Radu-Emil
    Dragan, Florin
    Borlea, Alexandra-Bianca
    2018 IEEE 12TH INTERNATIONAL SYMPOSIUM ON APPLIED COMPUTATIONAL INTELLIGENCE AND INFORMATICS (SACI), 2018, : 75 - 80
  • [44] Massively Parallel k -Means Clustering for Perturbation Resilient Instances
    Cohen-Addad, Vincent
    Mirrokni, Vahab
    Zhong, Peilin
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [45] Parallel K-means clustering algorithm on DNA dataset
    Othman, F
    Abdullah, R
    Rashid, NA
    Salam, RA
    PARALLEL AND DISTRIBUTED COMPUTING: APPLICATIONS AND TECHNOLOGIES, PROCEEDINGS, 2004, 3320 : 248 - 251
  • [46] An Improved parallel K-means Clustering Algorithm with MapReduce
    Liao, Qing
    Yang, Fan
    Zhao, Jingming
    2013 15TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT), 2013, : 764 - 768
  • [47] Clustering the Patent Data Using K-Means Approach
    Anuranjana
    Mittas, Nisha
    Mehrotra, Deepti
    SOFTWARE ENGINEERING (CSI 2015), 2019, 731 : 639 - 645
  • [48] Parallel bisecting k-means with prediction clustering algorithm
    Li, Yanjun
    Chung, Soon M.
    JOURNAL OF SUPERCOMPUTING, 2007, 39 (01): : 19 - 37
  • [49] Enhanced Parallel Implementation of the K-Means Clustering Algorithm
    Baydoun, Mohammed
    Dawi, Mohammad
    Ghaziri, Hassan
    2016 3RD INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTATIONAL TOOLS FOR ENGINEERING APPLICATIONS (ACTEA), 2016, : 7 - 11
  • [50] Parallel bisecting k-means with prediction clustering algorithm
    Yanjun Li
    Soon M. Chung
    The Journal of Supercomputing, 2007, 39 : 19 - 37