Study on oceanic big data clustering based on incremental K-means algorithm

被引:0
|
作者
Li Y. [1 ]
Yang Z. [1 ]
Han K. [1 ]
机构
[1] Key Laboratory for Advanced Technology to Internet of Things, College of Electronics and Information Engineering, Qinzhou University, Guangxi
关键词
Algorithm; Cluster; Cluster center; Data points; Distance model; Incremental; K-means; MATLAB; Oceanic big; Similarity;
D O I
10.1504/ijica.2020.107119
中图分类号
学科分类号
摘要
With the increase of marine industry in the Beibu Gulf, data clustering has become an important task of intelligent ocean. Partition clustering methods are suitable for marine data. However, traditional K-means algorithm is not suitable for large scale data. Focusing on the characteristics of oceanic big data, we propose a clustering method based on incremental K-means (IKM) algorithm. First, a vector model is adopted to represent data sets, and the calculation model for mean values and centres is used to initialise arbitrary numbers of data points. Second, the input data vectors are iteratively calculated in an incremental vector form. Finally, by applying incremental vector and distance model, the large-scale data are clustered according to convergence condition. Experiments show that the algorithm can increase the clustering efficiency, reduce time and space complexity, and lower the missing data rate. © 2020 Inderscience Enterprises Ltd.
引用
收藏
页码:89 / 95
页数:6
相关论文
共 50 条
  • [21] An improved K-means algorithm for big data
    Moodi, Fatemeh
    Saadatfar, Hamid
    IET SOFTWARE, 2022, 16 (01) : 48 - 59
  • [22] A K-Means Algorithm Application on Big Data
    Eren, Beste
    Karabulut, Ezgi Cilga
    Alptekin, S. Emre
    Alptekin, Gulfem Isiklar
    WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, WCECS 2015, VOL II, 2015, : 814 - 818
  • [23] On K-means Data Clustering Algorithm with Genetic Algorithm
    Kapil, Shruti
    Chawla, Meenu
    Ansari, Mohd Dilshad
    2016 FOURTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2016, : 202 - 206
  • [24] Research on k-means Clustering Algorithm An Improved k-means Clustering Algorithm
    Shi Na
    Liu Xumin
    Guan Yong
    2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 63 - 67
  • [25] Design of Intelligent K-Means Based on Spark for Big Data Clustering
    Kusuma, Ilham
    Ma'sum, M. Anwar
    Habibie, Novian
    Jatmiko, Wisnu
    Suhartanto, Heru
    2016 INTERNATIONAL WORKSHOP ON BIG DATA AND INFORMATION SECURITY (IWBIS), 2016, : 89 - 95
  • [26] Soil data clustering by using K-means and fuzzy K-means algorithm
    Hot, Elma
    Popovic-Bugarin, Vesna
    2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 890 - 893
  • [27] How to Use K-means for Big Data Clustering?
    Mussabayev, Rustam
    Mladenovic, Nenad
    Jarboui, Bassem
    Mussabayev, Ravil
    PATTERN RECOGNITION, 2023, 137
  • [28] Parallel batch k-means for Big data clustering
    Alguliyev, Rasim M.
    Aliguliyev, Ramiz M.
    Sukhostat, Lyudmila, V
    COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 152
  • [29] An incremental K-means algorithm
    Pham, DT
    Dimov, SS
    Nguyen, CD
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2004, 218 (07) : 783 - 795
  • [30] Clustering for Binary Data Sets by Using Genetic Algorithm-Incremental K-means
    Saharan, S.
    Baragona, R.
    Nor, M. E.
    Salleh, R. M.
    Asrah, N. M.
    INTERNATIONAL SEMINAR ON MATHEMATICS AND PHYSICS IN SCIENCES AND TECHNOLOGY 2017 (ISMAP 2017), 2018, 995