A Density Grid-based Clustering Algorithm for Uncertain Data Streams

被引:2
|
作者
Tu, Li [1 ]
Cui, Peng [1 ]
Tang, Keming [2 ]
机构
[1] Jiangyin Polytech Coll, Dept Comp Sci, Jiangsu Engn R&D Ctr Informat Fus Software, Jiangyin, Peoples R China
[2] Yancheng Teachers Univ, Coll Informat Sci & Technol, Yancheng, Peoples R China
关键词
clustering; uncertain stream; probability center; grid;
D O I
10.1109/WISA.2013.71
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a grid-based clustering algorithm Clu-US which is competent to find clusters of non-convex shapes on uncertain data stream. Clu-US maps the uncertain data tuples to the grid space which could store and update the summary information of stream. The uncertainty of data is taken into account for calculating the probability center of a grid. Then, the distance between the probability centers of two adjacent grids is adopted for measuring whether they are "close enough" in grids merging process. Furthermore, a dynamic outlier deletion mechanism is developed to improve clustering performance. The experimental results show that Clu-US outperforms other algorithms in terms of clustering quality and speed.
引用
收藏
页码:347 / +
页数:2
相关论文
共 50 条
  • [21] A deflected grid-based algorithm for clustering analysis
    Department of Computer Science and Information Engineering, Tamkang University, 151 Ying-Chuan Road, Tamsui, Taipei County, Taiwan
    WSEAS Trans. Comput., 2008, 3 (125-132):
  • [22] Clustering Algorithm Based on Grid and Density for Data Stream
    Wang, Lang
    Li, Haiqing
    MATERIALS SCIENCE, ENERGY TECHNOLOGY, AND POWER ENGINEERING I, 2017, 1839
  • [23] Clustering over data streams based on grid density and index tree
    Ren J.
    Cai B.
    Hu C.
    Journal of Convergence Information Technology, 2011, 6 (01) : 83 - 93
  • [24] Data Streams Clustering Algorithm Based on Grid and Particle Swarm Optimization
    Ke, Luo
    Lin, Wang
    2009 INTERNATIONAL FORUM ON COMPUTER SCIENCE-TECHNOLOGY AND APPLICATIONS, VOL 1, PROCEEDINGS, 2009, : 93 - 96
  • [25] Non-parametric grid-based clustering algorithm for remote sensing data
    Pestunov, IA
    Sinyavsky, YN
    Proceedings of the Second IASTED International Multi-Conference on Automation, Control, and Information Technology - Signal and Image Processing, 2005, : 5 - 9
  • [26] Density Grid-Based Clustering for Wireless Sensors Networks
    Abdullah, Manal
    Eldin, Hend Nour
    Al-Moshadak, Tahani
    Alshaik, Rawan
    Al-Anesi, Inas
    INTERNATIONAL CONFERENCE ON COMMUNICATIONS, MANAGEMENT, AND INFORMATION TECHNOLOGY (ICCMIT'2015), 2015, 65 : 35 - 47
  • [27] GACH: a grid-based algorithm for hierarchical clustering of high-dimensional data
    Mansoori, Eghbal G.
    SOFT COMPUTING, 2014, 18 (05) : 905 - 922
  • [28] GACH: a grid-based algorithm for hierarchical clustering of high-dimensional data
    Eghbal G. Mansoori
    Soft Computing, 2014, 18 : 905 - 922
  • [29] A real-time grid-based clustering algorithm for large data set
    Yu, Zhiwen
    Wong, Hau-San
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2006, : 740 - +
  • [30] PGMCLU: A Novel Parallel Grid-based Clustering Algorithm for Multi-density Datasets
    Chen Xiaoyun
    Chen Yi
    Qi Xiaoli
    Yue Min
    He Yanshan
    2009 1ST IEEE SYMPOSIUM ON WEB SOCIETY, PROCEEDINGS, 2009, : 166 - 171