A Clustering Algorithm Based on Density-Grid for Stream Data

被引:5
|
作者
Zhang, Dandan [1 ]
Tian, Hui [1 ]
Sang, Yingpeng [1 ]
Li, Yidong [1 ]
Wu, Yanbo [1 ]
Wu, Jun [1 ]
Shen, Hong [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing, Peoples R China
关键词
Clustering; stream data; density-grid; Index Tree;
D O I
10.1109/PDCAT.2012.13
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many real applications, such as network traffic monitoring, intrusion detection, satellite remote sensing, and electronic business, generate data in the form of a stream arriving continuously at high speed. Clustering is an important data analysis tool for knowledge discovery. Compared with traditional clustering algorithms, clustering stream data is an improtant and challenging problem which has attracted many researchers. Clustering stream data is facing two main challenges. First, as the data is continuously arriving with high rate and the computer storage capacity is limited, raw data can only be scaned in one pass. Second, stream data is always changing with time, so viewing a data stream as a set of static data can deteriorate the clustering quality. In fact, users are more concerned with the evolving behaviors of clusters which can help people making correct decisions. This paper proposes a density-grid based clustering algorithm, PKS-Stream-I, for stream data. It is an optimization of PKS-Stream in density detection period selection, sporadic grid detection and removal. Empirical results show the proposed method yields out better performance.
引用
收藏
页码:398 / 403
页数:6
相关论文
共 50 条
  • [21] A clustering algorithm for data stream based on grid-tree and similarity
    Huang, Guoyan
    Guo, Wenyan
    Ren, Jiadong
    Chen, Lijuan
    [J]. International Journal of Advancements in Computing Technology, 2011, 3 (09) : 17 - 24
  • [22] FGCH: a fast and grid based clustering algorithm for hybrid data stream
    Chen, Jinyin
    Lin, Xiang
    Xuan, Qi
    Xiang, Yun
    [J]. APPLIED INTELLIGENCE, 2019, 49 (04) : 1228 - 1244
  • [23] A Grid and Fractal Dimension-Based Data Stream Clustering Algorithm
    Lin, Guoping
    Chen, Leisong
    [J]. ISISE 2008: INTERNATIONAL SYMPOSIUM ON INFORMATION SCIENCE AND ENGINEERING, VOL 1, 2008, : 66 - +
  • [24] FGCH: a fast and grid based clustering algorithm for hybrid data stream
    Jinyin Chen
    Xiang Lin
    Qi Xuan
    Yun Xiang
    [J]. Applied Intelligence, 2019, 49 : 1228 - 1244
  • [25] An Adaptive Density Data Stream Clustering Algorithm
    Shifei Ding
    Jian Zhang
    Hongjie Jia
    Jun Qian
    [J]. Cognitive Computation, 2016, 8 : 30 - 38
  • [26] An Adaptive Density Data Stream Clustering Algorithm
    Ding, Shifei
    Zhang, Jian
    Jia, Hongjie
    Qian, Jun
    [J]. COGNITIVE COMPUTATION, 2016, 8 (01) : 30 - 38
  • [27] Data Stream Clustering Algorithm Based on Bucket Density for Intrusion Detection
    Yin, Chunyong
    Xia, Lian
    Wang, Jin
    [J]. ADVANCES IN COMPUTER SCIENCE AND UBIQUITOUS COMPUTING, 2018, 474 : 846 - 850
  • [28] A Multi Density-based Clustering Algorithm for Data Stream with Noise
    Amini, Amineh
    Saboohi, Hadi
    Teh, Ying Wah
    [J]. 2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2013, : 1105 - 1112
  • [29] Data Stream Clustering Based on Grid Coupling
    Zhang, Dong-Yue
    Zhou, Li-Hua
    Wu, Xiang-Yun
    Zhao, Li-Hong
    [J]. Ruan Jian Xue Bao/Journal of Software, 2019, 30 (03): : 667 - 683
  • [30] A Density Grid-based Clustering Algorithm for Uncertain Data Streams
    Tu, Li
    Cui, Peng
    Tang, Keming
    [J]. 2013 10TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA 2013), 2013, : 347 - +