Incremental subspace clustering over multiple data streams

被引:4
|
作者
Zhang, Qi [1 ]
Liu, Jinze [1 ]
Wang, Wei [1 ]
机构
[1] Univ N Carolina, Chapel Hill, NC 27599 USA
关键词
D O I
10.1109/ICDM.2007.100
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data streams are often locally correlated, with a subset of streams exhibiting coherent patterns over a subset of time points. Subspace clustering can discover clusters of objects in different subspaces. However, traditional subspace clustering algorithms for static data sets are not readily used for incremental clustering, and is very expensive for frequent re-clustering over dynamically changing stream data. In this paper, we present an efficient incremental subspace clustering algorithm for multiple streams over sliding windows. Our algorithm detects all the delta-CC-Clusters, which capture the coherent changing patterns among a set of streams over a set of time points. delta-CC-Clusters are incrementally generated by traversing a directed acyclic graph pDAG. We propose efficient insertion and deletion operations to update the pDAG dynamically. In addition, effective pruning techniques are applied to reduce the search space. Experiments on real data sets demonstrate the performance of our algorithm.
引用
收藏
页码:727 / 732
页数:6
相关论文
共 50 条
  • [1] Efficient incremental subspace clustering in data streams
    Kontaki, Maria
    Papadopoulos, Apostolos N.
    Manolopoulos, Yannis
    [J]. 10TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2006, : 53 - 60
  • [2] Subspace Clustering and Visualization of Data Streams
    Louhi, Ibrahim
    Boudjeloud-Assala, Lydia
    Tamisier, Thomas
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2017), VOL 3, 2017, : 259 - 265
  • [3] SPARSE SUBSPACE CLUSTERING FOR EVOLVING DATA STREAMS
    Sui, Jinping
    Liu, Zhen
    Liu, Li
    Jung, Alexander
    Liu, Tianpeng
    Peng, Bo
    Li, Xiang
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7455 - 7459
  • [4] Subspace clustering of high dimensional data streams
    Wang, Shuyun
    Fan, Yingjie
    Zhang, Chenghong
    Xu, HeXiang
    Hao, Xiulan
    Hu, Yunfa
    [J]. 7TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE IN CONJUNCTION WITH 2ND IEEE/ACIS INTERNATIONAL WORKSHOP ON E-ACTIVITY, PROCEEDINGS, 2008, : 165 - +
  • [5] Combining Multiple Interrelated Streams for Incremental Clustering
    Siddiqui, Zaigham Faraz
    Spiliopoulou, Myra
    [J]. SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2009, 5566 : 535 - 552
  • [6] Incremental density-based ensemble clustering over evolving data streams
    Khan, Imran
    Huang, Joshua Z.
    Ivanov, Kamen
    [J]. NEUROCOMPUTING, 2016, 191 : 34 - 43
  • [7] Efficient discovering and maintenance algorithm of subspace clustering over high dimensional data streams
    Department of Computer Science and Engineering, Southeast University, Nanjing 210096, China
    [J]. Jisuanji Yanjiu yu Fazhan, 2006, 5 (834-840):
  • [8] Clustering Multiple Data Streams
    Balzanella, Antonio
    Lechevallier, Yves
    Verde, Rosanna
    [J]. NEW PERSPECTIVES IN STATISTICAL MODELING AND DATA ANALYSIS, 2011, : 247 - 254
  • [9] An incremental irregular grid algorithm for clustering data streams
    College of Computer Science and Technology, Harbin Engineering University, Harbin 150001, China
    [J]. Harbin Gongcheng Daxue Xuebao, 2008, 8 (846-850):
  • [10] Clustering on demand for multiple data streams
    Dai, BR
    Huang, JW
    Yeh, MY
    Chen, MS
    [J]. FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 367 - 370