Parallelizing k-means Algorithm for 1-d Data Using MPI

被引:10
|
作者
Savvas, Ilias K. [1 ]
Sofianidou, Georgia N. [1 ]
机构
[1] TEI Thessaly, Dept Comp Sci & Engn, Larisa, Greece
关键词
D O I
10.1109/WETICE.2014.13
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Nowadays, colossal amount of information is produced by computational systems and electronic instruments such as telescopes, medical devices and so on. To explore these petabytes of data, new fast algorithms must be discovered or old ones may be redesigned. One of the most popular and useful techniques in order to discover and extract information from data pools is clustering, and k-means is an algorithm which clusters data according its characteristics. Its main disadvantage is its computational complexity which makes the technique very difficult to apply on big data sets. Although k-means is a very well studied technique, a fully parallel version of it has not been explored yet. In this work, a parallel version of the k-means is presented for 1-d objects. The experimental results obtained are inline with the theoretical outcome and prove both the correctness and the effectiveness of the technique.
引用
收藏
页码:179 / 184
页数:6
相关论文
共 50 条
  • [1] Soil data clustering by using K-means and fuzzy K-means algorithm
    Hot, Elma
    Popovic-Bugarin, Vesna
    2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 890 - 893
  • [2] Parallelizing DBSCAN Algorithm Using MPI
    Savvas, Ilias K.
    Tselios, Dimitrios
    2016 IEEE 25TH INTERNATIONAL CONFERENCE ON ENABLING TECHNOLOGIES: INFRASTRUCTURE FOR COLLABORATIVE ENTERPRISES (WETICE), 2016, : 77 - 82
  • [3] Implementation of parallel K-means algorithm for image classification using OpenMP and MPI libraries
    Tanovic, Anja
    Vranjkovic, Vuk
    2024 ZOOMING INNOVATION IN CONSUMER TECHNOLOGIES CONFERENCE, ZINC 2024, 2024, : 54 - 59
  • [4] ABK-means: an algorithm for data clustering using ABC and K-means algorithm
    Krishnamoorthi, M.
    Natarajan, A. M.
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2013, 8 (04) : 383 - 391
  • [5] NEW ALGORITHM FOR CLUSTERING DISTRIBUTED DATA USING K-MEANS
    Khedr, Ahmed M.
    Bhatnagar, Raj K.
    COMPUTING AND INFORMATICS, 2014, 33 (04) : 943 - 964
  • [6] Using K-Means Clustering Algorithm for Handling Data Precision
    Suganthi, P.
    Kala, K.
    Balasubramanian, C.
    2016 INTERNATIONAL CONFERENCE ON COMPUTING TECHNOLOGIES AND INTELLIGENT DATA ENGINEERING (ICCTIDE'16), 2016,
  • [7] A novel near-parallel version of k-means algorithm for n-dimensional data objects using MPI
    Savvas, Ilias K.
    Sofianidou, Georgia N.
    INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2016, 7 (02) : 80 - 91
  • [8] Clustering of Image Data Using K-Means and Fuzzy K-Means
    Rahmani, Md. Khalid Imam
    Pal, Naina
    Arora, Kamiya
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2014, 5 (07) : 160 - 163
  • [9] On K-means Data Clustering Algorithm with Genetic Algorithm
    Kapil, Shruti
    Chawla, Meenu
    Ansari, Mohd Dilshad
    2016 FOURTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2016, : 202 - 206
  • [10] A K-Means Algorithm Application on Big Data
    Eren, Beste
    Karabulut, Ezgi Cilga
    Alptekin, S. Emre
    Alptekin, Gulfem Isiklar
    WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, WCECS 2015, VOL II, 2015, : 814 - 818