DBSC: A dependency-based subspace clustering algorithm for high dimensional numerical datasets

被引:0
|
作者
Wang, Xufei [1 ]
Li, Chunping [1 ]
机构
[1] Tsinghua Univ, Sch Software, China MOE Key Lab Informat Syst Secur, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel algorithm called DBSC, which finds subspace clusters in numerical datasets based on the concept of "dependency". This algorithm uses a depth-first search strategy to find out the maximal subspaces: a new dimension is added to current k-subspace and its validity as a (k+1)-subspace is evaluated. The clusters within those maximal subspaces are mined in a similar fashion as maximal subspace mining does. With the experiments on synthetic and real datasets, our algorithm is shown to be both effective and efficient for high dimensional datasets.
引用
收藏
页码:832 / 837
页数:6
相关论文
共 50 条
  • [21] DPM: Fast and scalable Clustering Algorithm for Large Scale High Dimensional Datasets
    Ghanem, Tamer F.
    Elkilani, Wail S.
    Ahmed, Hatem S.
    Hadhoud, Mohiy M.
    [J]. 2014 10TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO), 2014, : 26 - 35
  • [22] DPM: Fast and scalable Clustering Algorithm for Large Scale High Dimensional Datasets
    Ghanem, Tamer F.
    Elkilani, Wail S.
    Ahmed, Hatem S.
    Hadhoud, Mohiy M.
    [J]. 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS (ICCES), 2014, : 71 - 79
  • [23] High-Dimensional Text Datasets Clustering Algorithm Based on Cuckoo Search and Latent Semantic Indexing
    Boushaki, Saida Ishak
    Kamel, Nadjet
    Bendjeghaba, Omar
    [J]. JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2018, 17 (03)
  • [24] Feature-Gathering Dependency-Based Software Clustering Using Dedication and Modularity
    Kobayashi, Kenichi
    Kamimura, Manabu
    Kato, Koki
    Yano, Keisuke
    Matsuo, Akihiko
    [J]. 2012 28TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE (ICSM), 2012, : 462 - 471
  • [25] A k-means type clustering algorithm for subspace clustering of mixed numeric and categorical datasets
    Ahmad, Amir
    Dey, Lipika
    [J]. PATTERN RECOGNITION LETTERS, 2011, 32 (07) : 1062 - 1069
  • [26] A rough set based subspace clustering technique for high dimensional data
    Lakshmi, B. Jaya
    Shashi, M.
    Madhuri, K. B.
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2020, 32 (03) : 329 - 334
  • [27] Automatic Subspace Clustering of High Dimensional Data
    Rakesh Agrawal
    Johannes Gehrke
    Dimitrios Gunopulos
    Prabhakar Raghavan
    [J]. Data Mining and Knowledge Discovery, 2005, 11 : 5 - 33
  • [28] Automatic subspace clustering of high dimensional data
    Agrawal, R
    Gehrke, J
    Gunopulos, D
    Raghavan, P
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2005, 11 (01) : 5 - 33
  • [29] Subspace clustering of high dimensional data streams
    Wang, Shuyun
    Fan, Yingjie
    Zhang, Chenghong
    Xu, HeXiang
    Hao, Xiulan
    Hu, Yunfa
    [J]. 7TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE IN CONJUNCTION WITH 2ND IEEE/ACIS INTERNATIONAL WORKSHOP ON E-ACTIVITY, PROCEEDINGS, 2008, : 165 - +
  • [30] A Survey on High-Dimensional Subspace Clustering
    Qu, Wentao
    Xiu, Xianchao
    Chen, Huangyue
    Kong, Lingchen
    [J]. MATHEMATICS, 2023, 11 (02)