Incremental clustering algorithm based on representative points and covariance for large data

被引:0
|
作者
Li J. [1 ]
Wu Q. [1 ]
Li L. [2 ]
Sun R. [1 ,3 ]
Mu H. [1 ]
Zhao K. [1 ]
机构
[1] College of Information and Electrical Engineering, China Agricultural University, Beijing
[2] Computer School, Beijing Information Science and Technology University, Beijing
[3] Scientific Research Base for Integrated Technologies of Precision Agriculture (Animal Husbandry), The Ministry of Agriculture, Beijing
关键词
clustering algorithms; clustering methods; covariance; density peaks; incremental clustering algorithms; representative points;
D O I
10.1504/IJSPM.2023.136478
中图分类号
学科分类号
摘要
As the dynamic data increases, more space is needed to store the data. However, most traditional clustering methods are time-consuming and only suitable for static data. For this problem, incremental clustering methods are increasingly used in dynamic data. The study proposes an incremental clustering algorithm based on representative points and covariance for large data (IDPC_RC). Firstly, the representative points were selected in the initial data. Then, the similarity between new data points and representative points was calculated to find the pre-allocated cluster. Finally, the covariance determinant was used to measure the degree of local imbalance for pre-allocated clusters after new data is added, and the cluster numbers were adjusted adaptively. The performance of the proposed scheme was tested on five benchmark datasets and real consumption data. The experimental results show the scheme achieves excellent clustering performance and low time consumption on all datasets, which is useful for incremental clustering tasks. Copyright © 2023 Inderscience Enterprises Ltd.
引用
收藏
页码:113 / 124
页数:11
相关论文
共 50 条
  • [21] Incremental Clustering for Time Series Data based on an Improved Leader Algorithm
    Huynh Thi Thu Thuy
    Duong Tuan Anh
    Vo Thi Ngoc Chau
    2019 IEEE - RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF), 2019, : 13 - 18
  • [22] An Efficient Density Based Incremental Clustering Algorithm in Data Warehousing Environment
    Goyal, Navneet
    Goyal, Poonam
    Venkatramaiah, K.
    Deepak, P. C.
    Sanoop, P. S.
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATIONS, 2009, : 556 - 560
  • [23] Incremental clustering algorithm of data stream based on artificial immune network
    Yue, Xun
    Chi, Zhongxian
    Hao, Yanyou
    Mo, Hongwei
    Yue, Xun
    WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 4021 - +
  • [24] An incremental data stream clustering algorithm based on dense units detection
    Gao, J
    Li, JZ
    Zhang, ZG
    Tan, PN
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2005, 3518 : 420 - 425
  • [25] Research on incremental clustering algorithm for big data
    Yang X.
    Applied Mathematics and Nonlinear Sciences, 2023, 8 (02) : 169 - 180
  • [26] A SOM based Incremental Clustering Algorithm
    Lei Chen
    Zhao, Bao-Jin
    Zhao, Li-Na
    JOURNAL OF COMPUTERS, 2014, 9 (03) : 601 - 607
  • [27] Information bottleneck based incremental fuzzy clustering for large biomedical data
    Liu, Yongli
    Wan, Xing
    JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 62 : 48 - 58
  • [28] A novel fuzzy-connectedness-based incremental clustering algorithm for large databases
    Dong, YH
    Tai, XY
    Zhao, JY
    FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PT 1, PROCEEDINGS, 2005, 3613 : 470 - 474
  • [29] A Supervised Clustering Algorithm Based on Representative Points and Its Application to Fault Diagnosis of Diesel Engine
    Pang, Yanjun
    Pan, Wei
    Liu, Kaidi
    NANOTECHNOLOGY AND COMPUTER ENGINEERING, 2010, 121-122 : 958 - 963
  • [30] RPC: Representative possible world based consistent clustering algorithm for uncertain data
    Liu, Han
    Zhang, Xiaotong
    Zhang, Xianchao
    Li, Qimai
    Wuc, Xiao-Ming
    COMPUTER COMMUNICATIONS, 2021, 176 : 128 - 137