A Density Grid-based Clustering Algorithm for Uncertain Data Streams

被引:2
|
作者
Tu, Li [1 ]
Cui, Peng [1 ]
Tang, Keming [2 ]
机构
[1] Jiangyin Polytech Coll, Dept Comp Sci, Jiangsu Engn R&D Ctr Informat Fus Software, Jiangyin, Peoples R China
[2] Yancheng Teachers Univ, Coll Informat Sci & Technol, Yancheng, Peoples R China
关键词
clustering; uncertain stream; probability center; grid;
D O I
10.1109/WISA.2013.71
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a grid-based clustering algorithm Clu-US which is competent to find clusters of non-convex shapes on uncertain data stream. Clu-US maps the uncertain data tuples to the grid space which could store and update the summary information of stream. The uncertainty of data is taken into account for calculating the probability center of a grid. Then, the distance between the probability centers of two adjacent grids is adopted for measuring whether they are "close enough" in grids merging process. Furthermore, a dynamic outlier deletion mechanism is developed to improve clustering performance. The experimental results show that Clu-US outperforms other algorithms in terms of clustering quality and speed.
引用
收藏
页码:347 / +
页数:2
相关论文
共 50 条
  • [1] A density grid-based uncertain data stream clustering algorithm
    [J]. Zhao, J. (jintianzhao@yahoo.com), 1600, Binary Information Press (10):
  • [2] A Systematic Review of Density Grid-Based Clustering for Data Streams
    Tareq, Mustafa
    Sundararajan, Elankovan A.
    Harwood, Aaron
    Abu Bakar, Azuraliza
    [J]. IEEE ACCESS, 2022, 10 : 579 - 596
  • [3] A grid-based clustering algorithm for high-dimensional data streams
    Lu, YS
    Sun, YF
    Xu, GP
    Liu, G
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 824 - 831
  • [4] Online Clustering of Evolving Data Streams Using a Density Grid-Based Method
    Tareq, Mustafa
    Sundararajan, Elankovan A.
    Mohd, Masnizah
    Sani, Nor Samsiah
    [J]. IEEE ACCESS, 2020, 8 : 166472 - 166490
  • [5] A Grid-Based Density Peaks Clustering Algorithm
    Fang, Xintong
    Xu, Zhen
    Ji, Haifeng
    Wang, Baoliang
    Huang, Zhiyao
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (04) : 5476 - 5484
  • [6] A grid-based subspace clustering algorithm for high-dimensional data streams
    Sun, Yufen
    Lu, Yansheng
    [J]. WEB INFORMATION SYSTEMS - WISE 2006 WORKSHOPS, PROCEEDINGS, 2006, 4256 : 37 - 48
  • [7] Clustering data streams using grid-based synopsis
    Bhatnagar, Vasudha
    Kaur, Sharanjit
    Chakravarthy, Sharma
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (01) : 127 - 152
  • [8] Clustering data streams using grid-based synopsis
    Vasudha Bhatnagar
    Sharanjit Kaur
    Sharma Chakravarthy
    [J]. Knowledge and Information Systems, 2014, 41 : 127 - 152
  • [9] Statistical grid-based clustering over data streams
    Park, NH
    Lee, WS
    [J]. SIGMOD RECORD, 2004, 33 (01) : 32 - 37
  • [10] EDACluster: An evolutionary density and grid-based clustering algorithm
    De Oliveira, Cisar S.
    Godinho, Paulo Igor
    Meiguins, Aruanda S. G.
    Meiguins, Bianchi S.
    Freitas, Alex A.
    [J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2007, : 143 - +