Scalable Parallel Clustering Approach for Large Data Using Parallel K Means and Firefly Algorithms

被引:0
|
作者
Mathew, Juby [1 ]
Vijayakumar, R. [2 ]
机构
[1] Amaljyothi Coll Engn, Dept MCA, Kanjirappally, Kerala, India
[2] Mahatma Gandhi Univ, Kottayam, Kerala, India
来源
2014 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND APPLICATIONS (ICHPCA) | 2014年
关键词
Clustering; k-means; parallel k-means; Firefly algorithm; join and fork parallelism;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper mainly focuses in identifying the limitations of the k means algorithm and to propose the parallelization of the k-means using firefly based clustering method. The new parallel architecture can handle large number of clusters. Firefly algorithm to find initial optimal cluster centroid and then k-means algorithm with optimized centroid to refined them and improve clustering accuracy. The final convergence issue is also addressed and solved to a great extent. Finally modified algorithm is compared with parallel k means is demonstrated with experiments and it has been found that the performance of modified algorithm is better than the existing algorithm. Four typical benchmark data sets from the UCI machine learning repository are used to demonstrate the results of the techniques. To achieve this we can use fork/join method in java programming. It is the most effective design method for achieve good parallel performance
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Scalable parallel clustering approach for large data using genetic possibilistic fuzzy c-means algorithm
    Mathew, Juby
    Vijayakumar, R.
    2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (IEEE ICCIC), 2014, : 226 - 232
  • [2] Parallel k/h-means clustering for large data sets
    Stoffel, K
    Belkoniene, A
    EURO-PAR'99: PARALLEL PROCESSING, 1999, 1685 : 1451 - 1454
  • [3] Parallel k-Means Clustering for Quantitative Ecoregion Delineation Using Large Data Sets
    Kumar, Jitendra
    Mills, Richard T.
    Hoffman, Forrest M.
    Hargrove, William W.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS), 2011, 4 : 1602 - 1611
  • [4] Data decomposition for parallel K-means clustering
    Gursoy, A
    PARALLEL PROCESSING AND APPLIED MATHEMATICS, 2004, 3019 : 241 - 248
  • [5] A Parallel Forecasting Approach Using Incremental K-means Clustering Technique
    Sahoo, Swagatika
    COMPUTATIONAL INTELLIGENCE IN DATA MINING, CIDM 2016, 2017, 556 : 165 - 172
  • [6] Parallel batch k-means for Big data clustering
    Alguliyev, Rasim M.
    Aliguliyev, Ramiz M.
    Sukhostat, Lyudmila, V
    COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 152
  • [7] Scalable Parallel Algorithms for Shared Nearest Neighbor Clustering
    Kumari, Sonal
    Maurya, Saurabh
    Goyal, Poonam
    Balasubramaniam, Sundar S.
    Goyal, Navneet
    PROCEEDINGS OF 2016 IEEE 23RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2016, : 72 - 81
  • [8] Parallel BVH construction using k-means clustering
    Daniel Meister
    Jiří Bittner
    The Visual Computer, 2016, 32 : 977 - 987
  • [9] Parallel BVH construction using k-means clustering
    Meister, Daniel
    Bittner, Jiri
    VISUAL COMPUTER, 2016, 32 (6-8): : 977 - 987
  • [10] Performance Analysis of Parallel K-Means with Optimization Algorithms for Clustering on Spark
    Santhi, V.
    Jose, Rini
    DISTRIBUTED COMPUTING AND INTERNET TECHNOLOGY (ICDCIT 2018), 2018, 10722 : 158 - 162