Generalized projected clustering in high-dimensional data streams

被引:0
|
作者
Wang, T [1 ]
机构
[1] Univ British Columbia, Dept Comp Sci, Vancouver, BC V6T 1W5, Canada
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Consider the problem of identifying dense subgroups of data points exhibiting strong correlations in data stream. Such correlation connected clusters are meaningful in many applications. However, the inherent sparsity of high-dimensional space means that the correlations are local for specific subspace, and moreover, the correlation itself can be of arbitrarily complex direction, which blinds most traditional methods. We present ACID, a framework that can effectively detect correlation connected clusters in high dimensional stream. It has high scalability on both the size of stream and the dimension of data, and is robust against noise. Experiments on synthetic and real datasets are done to show its effectiveness and efficiency.
引用
收藏
页码:772 / 778
页数:7
相关论文
共 50 条
  • [1] On high dimensional projected clustering of data streams
    Aggarwal, CC
    Han, JW
    Wang, JY
    Yu, PS
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2005, 10 (03) : 251 - 273
  • [2] On High Dimensional Projected Clustering of Data Streams
    Charu C. Aggarwal
    Jiawei Han
    Jianyong Wang
    Philip S. Yu
    [J]. Data Mining and Knowledge Discovery, 2005, 10 : 251 - 273
  • [3] Detecting Projected Outliers in High-Dimensional Data Streams
    Zhang, Ji
    Gao, Qigang
    Wang, Hai
    Liu, Qing
    Xu, Kai
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2009, 5690 : 629 - +
  • [4] On High Dimensional Projected Clustering of Uncertain Data Streams
    Aggarwal, Charu C.
    [J]. ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2009, : 1152 - 1154
  • [5] Parallel Clustering of High-Dimensional Social Media Data Streams
    Gao, Xiaoming
    Ferrara, Emilio
    Qiu, Judy
    [J]. 2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 323 - 332
  • [6] SPOT: A system for detecting projected outliers from high-dimensional data streams
    Zhang, Ji
    Gao, Qigang
    Wang, Hai
    [J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 1628 - +
  • [7] Dynamic Sparse Subspace Clustering for Evolving High-Dimensional Data Streams
    Sui, Jinping
    Liu, Zhen
    Liu, Li
    Jung, Alexander
    Li, Xiang
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (06) : 4173 - 4186
  • [8] A grid-based clustering algorithm for high-dimensional data streams
    Lu, YS
    Sun, YF
    Xu, GP
    Liu, G
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 824 - 831
  • [9] NGPCA: Clustering of high-dimensional and non-stationary data streams
    Migenda, Nico
    Moeller, Ralf
    Schenck, Wolfram
    [J]. SOFTWARE IMPACTS, 2024, 20
  • [10] Subspace Clustering in High-Dimensional Data Streams: A Systematic Literature Review
    Ghani, Nur Laila Ab
    Aziz, Izzatdin Abdul
    AbdulKadir, Said Jadid
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (02): : 4649 - 4668