Parallelizing k-means Algorithm for 1-d Data Using MPI

被引:10
|
作者
Savvas, Ilias K. [1 ]
Sofianidou, Georgia N. [1 ]
机构
[1] TEI Thessaly, Dept Comp Sci & Engn, Larisa, Greece
关键词
D O I
10.1109/WETICE.2014.13
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Nowadays, colossal amount of information is produced by computational systems and electronic instruments such as telescopes, medical devices and so on. To explore these petabytes of data, new fast algorithms must be discovered or old ones may be redesigned. One of the most popular and useful techniques in order to discover and extract information from data pools is clustering, and k-means is an algorithm which clusters data according its characteristics. Its main disadvantage is its computational complexity which makes the technique very difficult to apply on big data sets. Although k-means is a very well studied technique, a fully parallel version of it has not been explored yet. In this work, a parallel version of the k-means is presented for 1-d objects. The experimental results obtained are inline with the theoretical outcome and prove both the correctness and the effectiveness of the technique.
引用
收藏
页码:179 / 184
页数:6
相关论文
共 50 条
  • [21] A Modified K-means Algorithm - Two-Layer K-means Algorithm
    Liu, Chen-Chung
    Chu, Shao-Wei
    Chan, Yung-Kuan
    Yu, Shyr-Shen
    2014 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2014), 2014, : 447 - 450
  • [22] The SKM Algorithm: A K-Means Algorithm for Clustering Sequential Data
    Dias, Jose G.
    Cortinhal, Maria Joao
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2008, PROCEEDINGS, 2008, 5290 : 173 - 182
  • [23] Research on k-means Clustering Algorithm An Improved k-means Clustering Algorithm
    Shi Na
    Liu Xumin
    Guan Yong
    2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 63 - 67
  • [24] Pattern Discovery Using K-Means Algorithm
    Ahmed, Almahdi Mohammed
    Norwawi, Norita Md
    Ishak, Wan Hussain Wan
    Alkilany, Ahmed
    2014 WORLD CONGRESS ON COMPUTER APPLICATIONS AND INFORMATION SYSTEMS (WCCAIS), 2014,
  • [25] K and starting means for k-means algorithm
    Fahim, Ahmed
    JOURNAL OF COMPUTATIONAL SCIENCE, 2021, 55
  • [26] K-means clustering algorithm using the entropy
    Palubinskas, G
    IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING IV, 1998, 3500 : 63 - 71
  • [27] Applying K-Means Clustering Algorithm Using Oracle Data Mining to Banking Data
    Hilala, Jafarova
    Rovshan, Aliyev
    PROCEEDINGS OF THE NINTH INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING MANAGEMENT, 2015, 362 : 809 - 816
  • [28] An efficient K-means clustering algorithm for tall data
    Capo, Marco
    Perez, Aritz
    Lozano, Jose A.
    DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 34 (03) : 776 - 811
  • [29] An extension of the K-means algorithm to clustering skewed data
    Volodymyr Melnykov
    Xuwen Zhu
    Computational Statistics, 2019, 34 : 373 - 394
  • [30] An efficient K-means clustering algorithm for tall data
    Marco Capó
    Aritz Pérez
    Jose A. Lozano
    Data Mining and Knowledge Discovery, 2020, 34 : 776 - 811