Study on oceanic big data clustering based on incremental K-means algorithm

被引：0

作者：

Li Y. ^{[1
]}

Yang Z. ^{[1
]}

Han K. ^{[1
]}

机构：

[1] Key Laboratory for Advanced Technology to Internet of Things, College of Electronics and Information Engineering, Qinzhou University, Guangxi

来源：

International Journal of Innovative Computing and Applications | 2020年 / 11卷 / 2-3期

关键词：

Algorithm; Cluster; Cluster center; Data points; Distance model; Incremental; K-means; MATLAB; Oceanic big; Similarity;

D O I：

10.1504/ijica.2020.107119

中图分类号：

学科分类号：

摘要：

With the increase of marine industry in the Beibu Gulf, data clustering has become an important task of intelligent ocean. Partition clustering methods are suitable for marine data. However, traditional K-means algorithm is not suitable for large scale data. Focusing on the characteristics of oceanic big data, we propose a clustering method based on incremental K-means (IKM) algorithm. First, a vector model is adopted to represent data sets, and the calculation model for mean values and centres is used to initialise arbitrary numbers of data points. Second, the input data vectors are iteratively calculated in an incremental vector form. Finally, by applying incremental vector and distance model, the large-scale data are clustered according to convergence condition. Experiments show that the algorithm can increase the clustering efficiency, reduce time and space complexity, and lower the missing data rate. © 2020 Inderscience Enterprises Ltd.

引用

页码：89 / 95

页数：6

共 50 条

[31] Dynamic Incremental K-means Clustering
Aaron, Bryant
Tamir, Dan E.
Rishe, Naphtali D.
Kandel, Abraham
2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), VOL 1, 2014, : 308 - 313
[32] HdK-Means: Hadoop Based Parallel K-Means Clustering for Big Data
Bandyopadhyay, Soumyendu Sekhar
Halder, Anup Kumar
Chatterjee, Piyali
Nasipuri, Mita
Basu, Subhadip
2017 IEEE CALCUTTA CONFERENCE (CALCON), 2017, : 452 - 456
[33] Research on parallel association rule mining of big data based on an improved K-means clustering algorithm
Hao, Li
Wang, Tuanbu
Guo, Chaoping
INTERNATIONAL JOURNAL OF AUTONOMOUS AND ADAPTIVE COMMUNICATIONS SYSTEMS, 2023, 16 (03) : 233 - 247
[34] Improvement Study and Application Based on K-Means Clustering Algorithm
Luo, Yu
Yu, Li
Liu, Xing-hua
FUZZY INFORMATION AND ENGINEERING, VOLUME 2, 2009, 62 : 937 - +
[35] Effective Clustering Analysis Based on New Designed Clustering Validity Index and Revised K-means Algorithm for Big Data
Zhu, Erzhou
Wen, Peng
Zhu, Binbin
Liu, Feng
Wang, Futian
Li, Xuejun
2018 IEEE INT CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, UBIQUITOUS COMPUTING & COMMUNICATIONS, BIG DATA & CLOUD COMPUTING, SOCIAL COMPUTING & NETWORKING, SUSTAINABLE COMPUTING & COMMUNICATIONS, 2018, : 96 - 102
[36] A Clustering Method Based on K-Means Algorithm
Li, Youguo
Wu, Haiyan
INTERNATIONAL CONFERENCE ON SOLID STATE DEVICES AND MATERIALS SCIENCE, 2012, 25 : 1104 - 1109
[37] A Fuzzy Clustering Algorithm Based on K-means
Yan, Zhen
Pi, Dechang
ECBI: 2009 INTERNATIONAL CONFERENCE ON ELECTRONIC COMMERCE AND BUSINESS INTELLIGENCE, PROCEEDINGS, 2009, : 523 - 528
[38] Data clustering using K-Means based on Crow Search Algorithm
K Lakshmi
N Karthikeyani Visalakshi
S Shanthi
Sādhanā, 2018, 43
[39] A fast K-Means clustering algorithm based on grid data reduction
Li, Daqi
Shen, Junyi
Chen, Hongmin
2008 IEEE AEROSPACE CONFERENCE, VOLS 1-9, 2008, : 2273 - +
[40] An extended study of the K-means algorithm for data clustering and its applications
Chen, JS
Ching, RKH
Lin, YS
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2004, 55 (09) : 976 - 987

← 1 2 3 4 5 →