Parallelizing k-means Algorithm for 1-d Data Using MPI

被引：10

作者：

Savvas, Ilias K. ^{[1
]}

Sofianidou, Georgia N. ^{[1
]}

机构：

[1] TEI Thessaly, Dept Comp Sci & Engn, Larisa, Greece

来源：

2014 IEEE 23RD INTERNATIONAL WETICE CONFERENCE (WETICE) | 2014年

关键词：

D O I：

10.1109/WETICE.2014.13

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Nowadays, colossal amount of information is produced by computational systems and electronic instruments such as telescopes, medical devices and so on. To explore these petabytes of data, new fast algorithms must be discovered or old ones may be redesigned. One of the most popular and useful techniques in order to discover and extract information from data pools is clustering, and k-means is an algorithm which clusters data according its characteristics. Its main disadvantage is its computational complexity which makes the technique very difficult to apply on big data sets. Although k-means is a very well studied technique, a fully parallel version of it has not been explored yet. In this work, a parallel version of the k-means is presented for 1-d objects. The experimental results obtained are inline with the theoretical outcome and prove both the correctness and the effectiveness of the technique.

引用

页码：179 / 184

页数：6

共 50 条

[21] A Modified K-means Algorithm - Two-Layer K-means Algorithm
Liu, Chen-Chung
Chu, Shao-Wei
Chan, Yung-Kuan
Yu, Shyr-Shen
2014 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2014), 2014, : 447 - 450
[22] The SKM Algorithm: A K-Means Algorithm for Clustering Sequential Data
Dias, Jose G.
Cortinhal, Maria Joao
ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2008, PROCEEDINGS, 2008, 5290 : 173 - 182
[23] Research on k-means Clustering Algorithm An Improved k-means Clustering Algorithm
Shi Na
Liu Xumin
Guan Yong
2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 63 - 67
[24] Pattern Discovery Using K-Means Algorithm
Ahmed, Almahdi Mohammed
Norwawi, Norita Md
Ishak, Wan Hussain Wan
Alkilany, Ahmed
2014 WORLD CONGRESS ON COMPUTER APPLICATIONS AND INFORMATION SYSTEMS (WCCAIS), 2014,
[25] K and starting means for k-means algorithm
Fahim, Ahmed
JOURNAL OF COMPUTATIONAL SCIENCE, 2021, 55
[26] K-means clustering algorithm using the entropy
Palubinskas, G
IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING IV, 1998, 3500 : 63 - 71
[27] Applying K-Means Clustering Algorithm Using Oracle Data Mining to Banking Data
Hilala, Jafarova
Rovshan, Aliyev
PROCEEDINGS OF THE NINTH INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING MANAGEMENT, 2015, 362 : 809 - 816
[28] An efficient K-means clustering algorithm for tall data
Capo, Marco
Perez, Aritz
Lozano, Jose A.
DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 34 (03) : 776 - 811
[29] An extension of the K-means algorithm to clustering skewed data
Volodymyr Melnykov
Xuwen Zhu
Computational Statistics, 2019, 34 : 373 - 394
[30] An efficient K-means clustering algorithm for tall data
Marco Capó
Aritz Pérez
Jose A. Lozano
Data Mining and Knowledge Discovery, 2020, 34 : 776 - 811

← 1 2 3 4 5 →