Incremental clustering algorithm based on representative points and covariance for large data

被引:0
|
作者
Li J. [1 ]
Wu Q. [1 ]
Li L. [2 ]
Sun R. [1 ,3 ]
Mu H. [1 ]
Zhao K. [1 ]
机构
[1] College of Information and Electrical Engineering, China Agricultural University, Beijing
[2] Computer School, Beijing Information Science and Technology University, Beijing
[3] Scientific Research Base for Integrated Technologies of Precision Agriculture (Animal Husbandry), The Ministry of Agriculture, Beijing
关键词
clustering algorithms; clustering methods; covariance; density peaks; incremental clustering algorithms; representative points;
D O I
10.1504/IJSPM.2023.136478
中图分类号
学科分类号
摘要
As the dynamic data increases, more space is needed to store the data. However, most traditional clustering methods are time-consuming and only suitable for static data. For this problem, incremental clustering methods are increasingly used in dynamic data. The study proposes an incremental clustering algorithm based on representative points and covariance for large data (IDPC_RC). Firstly, the representative points were selected in the initial data. Then, the similarity between new data points and representative points was calculated to find the pre-allocated cluster. Finally, the covariance determinant was used to measure the degree of local imbalance for pre-allocated clusters after new data is added, and the cluster numbers were adjusted adaptively. The performance of the proposed scheme was tested on five benchmark datasets and real consumption data. The experimental results show the scheme achieves excellent clustering performance and low time consumption on all datasets, which is useful for incremental clustering tasks. Copyright © 2023 Inderscience Enterprises Ltd.
引用
收藏
页码:113 / 124
页数:11
相关论文
共 50 条
  • [31] Fuzzy joint points based clustering algorithms for large data sets
    Nasibov, Efendi
    Atilgan, Can
    Berberler, Murat Ersen
    Nasiboglu, Resmiye
    FUZZY SETS AND SYSTEMS, 2015, 270 : 111 - 126
  • [32] Design and implementation of clustering algorithm using representative data
    Chen, Enhong
    Wang, Shangfei
    Ning, Yan
    Wang, Xufa
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2001, 14 (04):
  • [33] Incremental relative density-based clustering algorithm for mixture data sets
    Huang, De-Cai
    Li, Xiao-Chang
    Kongzhi yu Juece/Control and Decision, 2013, 28 (06): : 815 - 822
  • [34] Incremental Clustering Algorithm for Earth Science Data Mining
    Vatsavi, Ranga Raju
    COMPUTATIONAL SCIENCE - ICCS 2009, 2009, 5545 : 375 - 384
  • [35] PBIRCH: A scalable parallel clustering algorithm for incremental data
    Garg, Ashwani
    Mangla, Ashish
    Gupta, Neelima
    Bhatnagar, Vasudha
    10TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2006, : 315 - +
  • [36] An incremental irregular grid algorithm for clustering data streams
    College of Computer Science and Technology, Harbin Engineering University, Harbin 150001, China
    Harbin Gongcheng Daxue Xuebao, 2008, 8 (846-850):
  • [37] Split incremental clustering algorithm of mixed data stream
    Siwar Gorrab
    Fahmi Ben Rejab
    Kaouther Nouira
    Progress in Artificial Intelligence, 2024, 13 : 51 - 64
  • [38] Split incremental clustering algorithm of mixed data stream
    Gorrab, Siwar
    Ben Rejab, Fahmi
    Nouira, Kaouther
    PROGRESS IN ARTIFICIAL INTELLIGENCE, 2024, 13 (01) : 51 - 64
  • [39] An incremental clustering algorithm based on hyperbolic smoothing
    Bagirov, A. M.
    Ordin, B.
    Ozturk, G.
    Xavier, A. E.
    COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2015, 61 (01) : 219 - 241
  • [40] ICA: An Incremental Clustering Algorithm Based on OPTICS
    Fu, Jun-Song
    Liu, Yun
    Chao, Han-Chieh
    WIRELESS PERSONAL COMMUNICATIONS, 2015, 84 (03) : 2151 - 2170