Study on oceanic big data clustering based on incremental K-means algorithm

被引:0
|
作者
Li Y. [1 ]
Yang Z. [1 ]
Han K. [1 ]
机构
[1] Key Laboratory for Advanced Technology to Internet of Things, College of Electronics and Information Engineering, Qinzhou University, Guangxi
关键词
Algorithm; Cluster; Cluster center; Data points; Distance model; Incremental; K-means; MATLAB; Oceanic big; Similarity;
D O I
10.1504/ijica.2020.107119
中图分类号
学科分类号
摘要
With the increase of marine industry in the Beibu Gulf, data clustering has become an important task of intelligent ocean. Partition clustering methods are suitable for marine data. However, traditional K-means algorithm is not suitable for large scale data. Focusing on the characteristics of oceanic big data, we propose a clustering method based on incremental K-means (IKM) algorithm. First, a vector model is adopted to represent data sets, and the calculation model for mean values and centres is used to initialise arbitrary numbers of data points. Second, the input data vectors are iteratively calculated in an incremental vector form. Finally, by applying incremental vector and distance model, the large-scale data are clustered according to convergence condition. Experiments show that the algorithm can increase the clustering efficiency, reduce time and space complexity, and lower the missing data rate. © 2020 Inderscience Enterprises Ltd.
引用
收藏
页码:89 / 95
页数:6
相关论文
共 50 条
  • [1] The fast clustering algorithm for the big data based on K-means
    Xie, Ting
    Zhang, Taiping
    [J]. INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2020, 18 (06)
  • [2] A Novel K-Means based Clustering Algorithm for Big Data
    Sinha, Ankita
    Jana, Prasanta K.
    [J]. 2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 1875 - 1879
  • [3] Modified K-means Algorithm for Big Data Clustering
    Sengupta, Debapriya
    Roy, Sayantan Singha
    Ghosh, Sarbani
    Dasgupta, Ranjan
    [J]. PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2017, : 1443 - 1448
  • [4] Analysis and Study of Incremental K-Means Clustering Algorithm
    Chakraborty, Sanjay
    Nagwani, N. K.
    [J]. HIGH PERFORMANCE ARCHITECTURE AND GRID COMPUTING, 2011, 169 : 338 - 341
  • [5] Canopy with k-means Clustering Algorithm for Big Data Analytics
    Sagheer, Noor S.
    Yousif, Suhad A.
    [J]. FOURTH INTERNATIONAL CONFERENCE OF MATHEMATICAL SCIENCES (ICMS 2020), 2021, 2334
  • [6] K-MEANS plus : A DEVELOPED CLUSTERING ALGORITHM FOR BIG DATA
    Niu, Kun
    Gao, Zhipeng
    Jiao, Haizhen
    Deng, Nanjie
    [J]. PROCEEDINGS OF 2016 4TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (IEEE CCIS 2016), 2016, : 141 - 144
  • [7] Review on the Research of K-means Clustering Algorithm in Big Data
    Chen Jie
    Zhang Jiyue
    Wu Junhui
    Wu Yusheng
    Si Huiping
    Lin Kaiyan
    [J]. 2020 IEEE THE 3RD INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION ENGINEERING (ICECE), 2020, : 107 - 111
  • [8] Improvement of K-Means Algorithm for Accelerated Big Data Clustering
    Wu, Chunqiong
    Yan, Bingwen
    Yu, Rongrui
    Huang, Zhangshu
    Yu, Baoqin
    Yu, Yanliang
    Chen, Na
    Zhou, Xiukao
    [J]. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGIES AND SYSTEMS APPROACH, 2021, 14 (02) : 99 - 119
  • [9] Big Data Clustering Analysis Algorithm for Internet of Things Based on K-Means
    Yu, Zhanqiu
    [J]. INTERNATIONAL JOURNAL OF DISTRIBUTED SYSTEMS AND TECHNOLOGIES, 2019, 10 (01) : 1 - 12
  • [10] Enhancement of K-means clustering in big data based on equilibrium optimizer algorithm
    Al-kababchee, Sarah Ghanim Mahmood
    Algamal, Zakariya Yahya
    Qasim, Omar Saber
    [J]. JOURNAL OF INTELLIGENT SYSTEMS, 2023, 32 (01)