A Clustering Algorithm Based on Density-Grid for Stream Data

被引:5
|
作者
Zhang, Dandan [1 ]
Tian, Hui [1 ]
Sang, Yingpeng [1 ]
Li, Yidong [1 ]
Wu, Yanbo [1 ]
Wu, Jun [1 ]
Shen, Hong [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing, Peoples R China
关键词
Clustering; stream data; density-grid; Index Tree;
D O I
10.1109/PDCAT.2012.13
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many real applications, such as network traffic monitoring, intrusion detection, satellite remote sensing, and electronic business, generate data in the form of a stream arriving continuously at high speed. Clustering is an important data analysis tool for knowledge discovery. Compared with traditional clustering algorithms, clustering stream data is an improtant and challenging problem which has attracted many researchers. Clustering stream data is facing two main challenges. First, as the data is continuously arriving with high rate and the computer storage capacity is limited, raw data can only be scaned in one pass. Second, stream data is always changing with time, so viewing a data stream as a set of static data can deteriorate the clustering quality. In fact, users are more concerned with the evolving behaviors of clusters which can help people making correct decisions. This paper proposes a density-grid based clustering algorithm, PKS-Stream-I, for stream data. It is an optimization of PKS-Stream in density detection period selection, sporadic grid detection and removal. Empirical results show the proposed method yields out better performance.
引用
收藏
页码:398 / 403
页数:6
相关论文
共 50 条
  • [41] Grid-based clustering over an evolving data stream
    Wan, Renxia
    Chen, Jingchao
    Wang, Lixin
    Su, Xiaoke
    [J]. INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2009, 1 (04) : 393 - 410
  • [42] Grid-based data stream clustering for intrusion detection
    [J]. Quan, Q. (qqian@shu.edu.cn), 1600, Femto Technique Co., Ltd. (15):
  • [43] DWDP-Stream: A Dynamic Weight and Density Peaks Clustering Algorithm for Data Stream
    Chen, Di
    Du, Tao
    Zhou, Jin
    Wu, Yunzheng
    Wang, Xingeng
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2022, 15 (01)
  • [44] DWDP-Stream: A Dynamic Weight and Density Peaks Clustering Algorithm for Data Stream
    Di Chen
    Tao Du
    Jin Zhou
    Yunzheng Wu
    Xingeng Wang
    [J]. International Journal of Computational Intelligence Systems, 15
  • [45] Research on Data Stream Clustering Based on FCM Algorithm
    Gao, Tiancheng
    Li, Aihua
    Meng, Fan
    [J]. 5TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, ITQM 2017, 2017, 122 : 595 - 602
  • [46] Drifted Data Stream Clustering Based on ClusTree Algorithm
    Zgraja, Jakub
    Wozniak, Michal
    [J]. HYBRID ARTIFICIAL INTELLIGENT SYSTEMS (HAIS 2018), 2018, 10870 : 338 - 349
  • [47] A Data Stream Outlier Detection Algorithm Based on Grid
    Yu Xiang
    Lei Guohua
    Xu Xiandong
    Lin Liandong
    [J]. 2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, : 4136 - 4141
  • [48] Ant Colony Stream Clustering: A Fast Density Clustering Algorithm for Dynamic Data Streams
    Fahy, Conor
    Yang, Shengxiang
    Gongora, Mario
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (06) : 2215 - 2228
  • [49] Subspace clustering over high-dimensional data stream based on grid density and attribute relativity
    College of Information Science and Engineering, Yanshan University, Qinhuangdao City, 066004, China
    不详
    [J]. Adv. Inf. Sci. Serv. Sci., 17 (91-99):
  • [50] Data clustering using Hybridization of Clustering Based on Grid and Density with PSO
    Shan, Shi M.
    Deng, Gui S.
    He, Ying H.
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON SERVICE OPERATIONS AND LOGISTICS, AND INFORMATICS (SOLI 2006), PROCEEDINGS, 2006, : 868 - +