DBSC: A dependency-based subspace clustering algorithm for high dimensional numerical datasets

被引:0
|
作者
Wang, Xufei [1 ]
Li, Chunping [1 ]
机构
[1] Tsinghua Univ, Sch Software, China MOE Key Lab Informat Syst Secur, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel algorithm called DBSC, which finds subspace clusters in numerical datasets based on the concept of "dependency". This algorithm uses a depth-first search strategy to find out the maximal subspaces: a new dimension is added to current k-subspace and its validity as a (k+1)-subspace is evaluated. The clusters within those maximal subspaces are mined in a similar fashion as maximal subspace mining does. With the experiments on synthetic and real datasets, our algorithm is shown to be both effective and efficient for high dimensional datasets.
引用
收藏
页码:832 / 837
页数:6
相关论文
共 50 条
  • [31] Clustering High-Dimensional Data: A Survey on Subspace Clustering, Pattern-Based Clustering, and Correlation Clustering
    Kriegel, Hans-Peter
    Kroeger, Peer
    Zimek, Arthur
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2009, 3 (01)
  • [32] Clustering high dimensional massive scientific datasets
    Otoo, EJ
    Shoshani, A
    Hwang, S
    [J]. THIRTEENTH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2001, : 147 - 157
  • [33] ASCRClu: an adaptive subspace combination and reduction algorithm for clustering of high-dimensional data
    Kavan Fatehi
    Mohsen Rezvani
    Mansoor Fateh
    [J]. Pattern Analysis and Applications, 2020, 23 : 1651 - 1663
  • [34] ASCRClu: an adaptive subspace combination and reduction algorithm for clustering of high-dimensional data
    Fatehi, Kavan
    Rezvani, Mohsen
    Fateh, Mansoor
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2020, 23 (04) : 1651 - 1663
  • [35] Adaptive Hybridized Meta-Heuristic Algorithm for Subspace Clustering on High Dimensional Data
    Kumar, Pradeep D.
    Sowmya, B. J.
    Kanavalli, Anita
    Amaresh, T. N.
    Anish, S.
    Nadgir, Chinmay S.
    Nischal, Gagan A.
    Supreeth, S.
    Shruthi, G.
    [J]. JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2023, 14 (05) : 1103 - 1116
  • [36] Clustering high dimensional massive scientific datasets
    Otoo, EJ
    Shoshani, A
    Hwang, SW
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2001, 17 (2-3) : 147 - 168
  • [37] Clustering High Dimensional Massive Scientific Datasets
    Ekow J. Otoo
    Arie Shoshani
    Seung-Won Hwang
    [J]. Journal of Intelligent Information Systems, 2001, 17 : 147 - 168
  • [38] A modified relationship based clustering framework for density based clustering and outlier filtering on high dimensional datasets
    Bilgin, Turgay Tugay
    Camurcu, A. Yilmaz
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, 4426 : 409 - +
  • [39] An Efficient Density Biased Sampling Algorithm for Clustering Large High-Dimensional Datasets
    Qian, Xue-Zhong
    Deng, Jie
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2015, 29 (08)
  • [40] Clustering Based - A FAST Algorithm on High Dimensional Data
    Kadam, Sonali P.
    Varsha, Naikwadi S.
    Kaveree, Belamkar S.
    Aruna, Andhare S.
    Mayuri, Mohite M.
    [J]. 2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 485 - 489