A Density Grid-based Clustering Algorithm for Uncertain Data Streams

被引:2
|
作者
Tu, Li [1 ]
Cui, Peng [1 ]
Tang, Keming [2 ]
机构
[1] Jiangyin Polytech Coll, Dept Comp Sci, Jiangsu Engn R&D Ctr Informat Fus Software, Jiangyin, Peoples R China
[2] Yancheng Teachers Univ, Coll Informat Sci & Technol, Yancheng, Peoples R China
关键词
clustering; uncertain stream; probability center; grid;
D O I
10.1109/WISA.2013.71
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a grid-based clustering algorithm Clu-US which is competent to find clusters of non-convex shapes on uncertain data stream. Clu-US maps the uncertain data tuples to the grid space which could store and update the summary information of stream. The uncertainty of data is taken into account for calculating the probability center of a grid. Then, the distance between the probability centers of two adjacent grids is adopted for measuring whether they are "close enough" in grids merging process. Furthermore, a dynamic outlier deletion mechanism is developed to improve clustering performance. The experimental results show that Clu-US outperforms other algorithms in terms of clustering quality and speed.
引用
收藏
页码:347 / +
页数:2
相关论文
共 50 条
  • [31] An improved algorithm for clustering uncertain traffic data streams based on Hadoop platform
    Xu, Weixiang
    Li, Jiaojiao
    [J]. INTERNATIONAL JOURNAL OF MODERN PHYSICS B, 2019, 33 (19):
  • [32] An incremental irregular grid algorithm for clustering data streams
    College of Computer Science and Technology, Harbin Engineering University, Harbin 150001, China
    [J]. Harbin Gongcheng Daxue Xuebao, 2008, 8 (846-850):
  • [33] A grid-based clustering algorithm for wild bird distribution
    Wang, Yuwei
    Zhou, Yuanchun
    Liu, Ying
    Luo, Ze
    Guo, Danhuai
    Shao, Jing
    Tan, Fei
    Wu, Liang
    Li, Jianhui
    Yan, Baoping
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2013, 7 (04) : 475 - 485
  • [34] A grid-based clustering algorithm for wild bird distribution
    Yuwei WANG
    Yuanchun ZHOU
    Ying LIU
    Ze LUO
    Danhuai GUO
    Jing SHAO
    Fei TAN
    Liang WU
    Jianhui LI
    Baoping YAN
    [J]. Frontiers of Computer Science, 2013, 7 (04) : 475 - 485
  • [35] Grid-based clustering algorithm using fractal dimension
    Xiong, Xiao
    Zhang, Jie
    [J]. Journal of Information and Computational Science, 2007, 4 (03): : 997 - 1002
  • [36] A grid-based clustering algorithm for wild bird distribution
    Yuwei Wang
    Yuanchun Zhou
    Ying Liu
    Ze Luo
    Danhuai Guo
    Jing Shao
    Fei Tan
    Liang Wu
    Jianhui Li
    Baoping Yan
    [J]. Frontiers of Computer Science, 2013, 7 : 475 - 485
  • [37] A grid-based clustering algorithm for network anomaly detection
    Wei, Xiaotao
    Huang, Houkuan
    Tian, Shengfeng
    [J]. PROCEEDINGS OF THE FIRST INTERNATIONAL SYMPOSIUM ON DATA, PRIVACY, AND E-COMMERCE, 2007, : 104 - +
  • [38] A grid-based clustering algorithm with referential value of parameters
    Yantao, Zhou
    Xingdong, Yi
    Zhengguo, Wu
    [J]. 2007 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE & TECHNOLOGY, PROCEEDINGS, 2007, : 210 - 214
  • [39] A Density Granularity Grid Clustering Algorithm Based on Data Stream
    Wang, Li-fang
    Han, Xie
    [J]. EMERGING RESEARCH IN WEB INFORMATION SYSTEMS AND MINING, 2011, 238 : 113 - 120
  • [40] A Clustering Algorithm Based on Density-Grid for Stream Data
    Zhang, Dandan
    Tian, Hui
    Sang, Yingpeng
    Li, Yidong
    Wu, Yanbo
    Wu, Jun
    Shen, Hong
    [J]. 2012 13TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS, AND TECHNOLOGIES (PDCAT 2012), 2012, : 398 - 403