A Density Grid-based Clustering Algorithm for Uncertain Data Streams

被引:2
|
作者
Tu, Li [1 ]
Cui, Peng [1 ]
Tang, Keming [2 ]
机构
[1] Jiangyin Polytech Coll, Dept Comp Sci, Jiangsu Engn R&D Ctr Informat Fus Software, Jiangyin, Peoples R China
[2] Yancheng Teachers Univ, Coll Informat Sci & Technol, Yancheng, Peoples R China
关键词
clustering; uncertain stream; probability center; grid;
D O I
10.1109/WISA.2013.71
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a grid-based clustering algorithm Clu-US which is competent to find clusters of non-convex shapes on uncertain data stream. Clu-US maps the uncertain data tuples to the grid space which could store and update the summary information of stream. The uncertainty of data is taken into account for calculating the probability center of a grid. Then, the distance between the probability centers of two adjacent grids is adopted for measuring whether they are "close enough" in grids merging process. Furthermore, a dynamic outlier deletion mechanism is developed to improve clustering performance. The experimental results show that Clu-US outperforms other algorithms in terms of clustering quality and speed.
引用
收藏
页码:347 / +
页数:2
相关论文
共 50 条
  • [1] A density grid-based uncertain data stream clustering algorithm
    Zhao, J. (jintianzhao@yahoo.com), 1600, Binary Information Press (10):
  • [2] A Systematic Review of Density Grid-Based Clustering for Data Streams
    Tareq, Mustafa
    Sundararajan, Elankovan A.
    Harwood, Aaron
    Abu Bakar, Azuraliza
    IEEE ACCESS, 2022, 10 : 579 - 596
  • [3] A grid-based clustering algorithm for high-dimensional data streams
    Lu, YS
    Sun, YF
    Xu, GP
    Liu, G
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 824 - 831
  • [4] Online Clustering of Evolving Data Streams Using a Density Grid-Based Method
    Tareq, Mustafa
    Sundararajan, Elankovan A.
    Mohd, Masnizah
    Sani, Nor Samsiah
    IEEE ACCESS, 2020, 8 : 166472 - 166490
  • [5] A Grid-Based Density Peaks Clustering Algorithm
    Fang, Xintong
    Xu, Zhen
    Ji, Haifeng
    Wang, Baoliang
    Huang, Zhiyao
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (04) : 5476 - 5484
  • [6] A grid-based subspace clustering algorithm for high-dimensional data streams
    Sun, Yufen
    Lu, Yansheng
    WEB INFORMATION SYSTEMS - WISE 2006 WORKSHOPS, PROCEEDINGS, 2006, 4256 : 37 - 48
  • [7] Clustering data streams using grid-based synopsis
    Bhatnagar, Vasudha
    Kaur, Sharanjit
    Chakravarthy, Sharma
    KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (01) : 127 - 152
  • [8] Clustering data streams using grid-based synopsis
    Vasudha Bhatnagar
    Sharanjit Kaur
    Sharma Chakravarthy
    Knowledge and Information Systems, 2014, 41 : 127 - 152
  • [9] Statistical grid-based clustering over data streams
    Park, NH
    Lee, WS
    SIGMOD RECORD, 2004, 33 (01) : 32 - 37
  • [10] Grid-based clustering algorithm for muilti-density
    Qiu, BZ
    Zhang, XZ
    Shen, JY
    PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 1509 - 1512