Study on oceanic big data clustering based on incremental K-means algorithm

被引：0

作者：

Li Y. ^{[1
]}

Yang Z. ^{[1
]}

Han K. ^{[1
]}

机构：

[1] Key Laboratory for Advanced Technology to Internet of Things, College of Electronics and Information Engineering, Qinzhou University, Guangxi

来源：

International Journal of Innovative Computing and Applications | 2020年 / 11卷 / 2-3期

关键词：

Algorithm; Cluster; Cluster center; Data points; Distance model; Incremental; K-means; MATLAB; Oceanic big; Similarity;

D O I：

10.1504/ijica.2020.107119

中图分类号：

学科分类号：

摘要：

With the increase of marine industry in the Beibu Gulf, data clustering has become an important task of intelligent ocean. Partition clustering methods are suitable for marine data. However, traditional K-means algorithm is not suitable for large scale data. Focusing on the characteristics of oceanic big data, we propose a clustering method based on incremental K-means (IKM) algorithm. First, a vector model is adopted to represent data sets, and the calculation model for mean values and centres is used to initialise arbitrary numbers of data points. Second, the input data vectors are iteratively calculated in an incremental vector form. Finally, by applying incremental vector and distance model, the large-scale data are clustered according to convergence condition. Experiments show that the algorithm can increase the clustering efficiency, reduce time and space complexity, and lower the missing data rate. © 2020 Inderscience Enterprises Ltd.

引用

页码：89 / 95

页数：6

共 50 条

[21] An improved K-means algorithm for big data
Moodi, Fatemeh
Saadatfar, Hamid
IET SOFTWARE, 2022, 16 (01) : 48 - 59
[22] A K-Means Algorithm Application on Big Data
Eren, Beste
Karabulut, Ezgi Cilga
Alptekin, S. Emre
Alptekin, Gulfem Isiklar
WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, WCECS 2015, VOL II, 2015, : 814 - 818
[23] On K-means Data Clustering Algorithm with Genetic Algorithm
Kapil, Shruti
Chawla, Meenu
Ansari, Mohd Dilshad
2016 FOURTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2016, : 202 - 206
[24] Research on k-means Clustering Algorithm An Improved k-means Clustering Algorithm
Shi Na
Liu Xumin
Guan Yong
2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 63 - 67
[25] Design of Intelligent K-Means Based on Spark for Big Data Clustering
Kusuma, Ilham
Ma'sum, M. Anwar
Habibie, Novian
Jatmiko, Wisnu
Suhartanto, Heru
2016 INTERNATIONAL WORKSHOP ON BIG DATA AND INFORMATION SECURITY (IWBIS), 2016, : 89 - 95
[26] Soil data clustering by using K-means and fuzzy K-means algorithm
Hot, Elma
Popovic-Bugarin, Vesna
2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 890 - 893
[27] How to Use K-means for Big Data Clustering?
Mussabayev, Rustam
Mladenovic, Nenad
Jarboui, Bassem
Mussabayev, Ravil
PATTERN RECOGNITION, 2023, 137
[28] Parallel batch k-means for Big data clustering
Alguliyev, Rasim M.
Aliguliyev, Ramiz M.
Sukhostat, Lyudmila, V
COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 152
[29] An incremental K-means algorithm
Pham, DT
Dimov, SS
Nguyen, CD
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2004, 218 (07) : 783 - 795
[30] Clustering for Binary Data Sets by Using Genetic Algorithm-Incremental K-means
Saharan, S.
Baragona, R.
Nor, M. E.
Salleh, R. M.
Asrah, N. M.
INTERNATIONAL SEMINAR ON MATHEMATICS AND PHYSICS IN SCIENCES AND TECHNOLOGY 2017 (ISMAP 2017), 2018, 995

← 1 2 3 4 5 →