Incremental clustering algorithm based on representative points and covariance for large data

被引:0
|
作者
Li J. [1 ]
Wu Q. [1 ]
Li L. [2 ]
Sun R. [1 ,3 ]
Mu H. [1 ]
Zhao K. [1 ]
机构
[1] College of Information and Electrical Engineering, China Agricultural University, Beijing
[2] Computer School, Beijing Information Science and Technology University, Beijing
[3] Scientific Research Base for Integrated Technologies of Precision Agriculture (Animal Husbandry), The Ministry of Agriculture, Beijing
关键词
clustering algorithms; clustering methods; covariance; density peaks; incremental clustering algorithms; representative points;
D O I
10.1504/IJSPM.2023.136478
中图分类号
学科分类号
摘要
As the dynamic data increases, more space is needed to store the data. However, most traditional clustering methods are time-consuming and only suitable for static data. For this problem, incremental clustering methods are increasingly used in dynamic data. The study proposes an incremental clustering algorithm based on representative points and covariance for large data (IDPC_RC). Firstly, the representative points were selected in the initial data. Then, the similarity between new data points and representative points was calculated to find the pre-allocated cluster. Finally, the covariance determinant was used to measure the degree of local imbalance for pre-allocated clusters after new data is added, and the cluster numbers were adjusted adaptively. The performance of the proposed scheme was tested on five benchmark datasets and real consumption data. The experimental results show the scheme achieves excellent clustering performance and low time consumption on all datasets, which is useful for incremental clustering tasks. Copyright © 2023 Inderscience Enterprises Ltd.
引用
收藏
页码:113 / 124
页数:11
相关论文
共 50 条
  • [1] REPRESENTATIVE POINTS AND CLUSTER ATTRIBUTES BASED INCREMENTAL SEQUENCE CLUSTERING ALGORITHM
    Wu, Di
    Ren, Jiadong
    COMPUTING AND INFORMATICS, 2017, 36 (06) : 1361 - 1384
  • [2] Incremental clustering of dynamic data streams using connectivity based representative points
    Luehr, Sebastian
    Lazarescu, Mihai
    DATA & KNOWLEDGE ENGINEERING, 2009, 68 (01) : 1 - 27
  • [3] Fast clustering algorithm based on MST of representative points
    Du, Hui
    Lu, Depeng
    Wang, Zhihe
    Ma, Cuntao
    Shi, Xinxin
    Wang, Xiaoli
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (09) : 15830 - 15858
  • [4] Spectral Clustering Algorithm Based on Density Representative Points
    Kong, Weiyu
    Wang, Guoyin
    Xie, Jiang
    Bai, Hao
    2023 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYTICS, ICCCBDA, 2023, : 147 - 151
  • [5] Large-Scale Spectral Clustering Based on Representative Points
    Yang, Libo
    Liu, Xuemei
    Nie, Feiping
    Liu, Mingtang
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2019, 2019
  • [6] Overlapping clustering incremental algorithm for large data collections processing
    Janier Gonzalez-Soler, Lazaro
    Perez-Suarez, Airel
    Chang-Fernandez, Leonardo
    GECONTEC-REVISTA INTERNACIONAL DE GESTION DEL CONOCIMIENTO Y LA TECNOLOGIA, 2015, 3 (02): : 1 - 12
  • [7] Representative points clustering algorithm based on density factor and relevant degree
    Wu, Di
    Ren, Jiadong
    Sheng, Long
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2017, 8 (02) : 641 - 649
  • [8] Representative points clustering algorithm based on density factor and relevant degree
    Di Wu
    Jiadong Ren
    Long Sheng
    International Journal of Machine Learning and Cybernetics, 2017, 8 : 641 - 649
  • [9] Data Incremental Clustering Algorithm based on Differential Privacy
    Gao, Qing
    Wang, Xiujun
    Gao, Yan
    Tao, Tao
    2023 IEEE 9TH WORLD FORUM ON INTERNET OF THINGS, WF-IOT, 2023,
  • [10] An Artificial Immune Based Incremental Data Clustering Algorithm
    Xiao, Xin
    ADVANCES IN CIVIL ENGINEERING II, PTS 1-4, 2013, 256-259 : 2935 - 2938