A grid-based subspace clustering algorithm for high-dimensional data streams

被引:0
|
作者
Sun, Yufen [1 ]
Lu, Yansheng [1 ]
机构
[1] Huazhong Univ Sci & Technol, Coll Comp Sci & Technol, Wuhan 430074, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many applications require the clustering of high-dimensional data streams. We propose a subspace clustering algorithm that can find clusters in different subspaces through one pass over a data stream. The algorithm combines the bottom-up grid-based method and top-down grid-based method. A uniformly partitioned grid data structure is used to summarize the data stream online. The top-down grid partition method is used o find the subspaces in which clusters locate. The errors made by the top-down partition procedure are eliminated by a mergence step in our algorithm. Our performance study with real datasets and synthetic dataset demonstrates the efficiency and effectiveness of our proposed algorithm.
引用
收藏
页码:37 / 48
页数:12
相关论文
共 50 条
  • [1] A grid-based clustering algorithm for high-dimensional data streams
    Lu, YS
    Sun, YF
    Xu, GP
    Liu, G
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 824 - 831
  • [2] GACH: a grid-based algorithm for hierarchical clustering of high-dimensional data
    Mansoori, Eghbal G.
    SOFT COMPUTING, 2014, 18 (05) : 905 - 922
  • [3] GACH: a grid-based algorithm for hierarchical clustering of high-dimensional data
    Eghbal G. Mansoori
    Soft Computing, 2014, 18 : 905 - 922
  • [4] Evolutionary Subspace Clustering Algorithm for High-Dimensional Data
    Nourashrafeddin, S. N.
    Arnold, Dirk V.
    Milios, Evangelos
    PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTATION COMPANION (GECCO'12), 2012, : 1497 - 1498
  • [5] A Density Grid-based Clustering Algorithm for Uncertain Data Streams
    Tu, Li
    Cui, Peng
    Tang, Keming
    2013 10TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA 2013), 2013, : 347 - +
  • [6] Dynamic Sparse Subspace Clustering for Evolving High-Dimensional Data Streams
    Sui, Jinping
    Liu, Zhen
    Liu, Li
    Jung, Alexander
    Li, Xiang
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (06) : 4173 - 4186
  • [7] Subspace Clustering in High-Dimensional Data Streams: A Systematic Literature Review
    Ghani, Nur Laila Ab
    Aziz, Izzatdin Abdul
    AbdulKadir, Said Jadid
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (02): : 4649 - 4668
  • [8] A Fast Subspace Partition Clustering Algorithm for High Dimensional Data Streams
    Zhang, Zhongping
    Wang, Hao
    2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 1, 2009, : 491 - 495
  • [9] A fast subspace partition clustering algorithm for high dimensional data streams
    College of Information Science and Engineering, Yanshan University, Qinhuangdao City, China
    Proc. - IEEE Int. Conf. Intelligent Comput. Intelligent Syst., ICIS, 1600, (491-495):
  • [10] Subspace clustering of high dimensional data streams
    Wang, Shuyun
    Fan, Yingjie
    Zhang, Chenghong
    Xu, HeXiang
    Hao, Xiulan
    Hu, Yunfa
    7TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE IN CONJUNCTION WITH 2ND IEEE/ACIS INTERNATIONAL WORKSHOP ON E-ACTIVITY, PROCEEDINGS, 2008, : 165 - +