ASCRClu: an adaptive subspace combination and reduction algorithm for clustering of high-dimensional data

被引:5
|
作者
Fatehi, Kavan [1 ]
Rezvani, Mohsen [2 ]
Fateh, Mansoor [2 ]
机构
[1] Yazd Univ, Yazd, Iran
[2] Shahrood Univ Technol, Shahrood, Iran
关键词
High-dimensional data; Subspace clustering; Cluster similarity; DENSITY;
D O I
10.1007/s10044-020-00884-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The curse of dimensionality in high-dimensional data is one of the major challenges in data clustering. Recently, a considerable amount of literature has been published on subspace clustering to address this challenge. The main objective of the subspace clustering is to discover clusters embedded in any possible combination of the attributes. Previous studies have mostly been generating redundant subspace clusters, leading to clustering accuracy loss and also increasing the running time. In this paper, a bottom-up density-based approach is proposed for clustering of high-dimensional data. We employ the cluster structure as a similarity measure to generate the optimal subspaces which result in raising the accuracy of the subspace clustering. Using this idea, we propose an iterative algorithm to discover similar subspaces using the similarity in the features of subspaces. At each iteration of this algorithm, it first determines similar subspaces, then combines them to generate higher-dimensional subspaces, and finally re-clusters the subspaces. The algorithm repeats these steps and converges to the final clusters. Experiments on various synthetic and real datasets show that the results of the proposed approach are significantly better in both quality and runtime comparing to the state of the art on clustering high-dimensional data. The accuracy of the proposed method is around 34% higher than the CLIQUE algorithm and around 6% higher than DiSH.
引用
收藏
页码:1651 / 1663
页数:13
相关论文
共 50 条
  • [21] A meta-heuristic density-based subspace clustering algorithm for high-dimensional data
    Parul Agarwal
    Shikha Mehta
    Ajith Abraham
    Soft Computing, 2021, 25 : 10237 - 10256
  • [22] A meta-heuristic density-based subspace clustering algorithm for high-dimensional data
    Agarwal, Parul
    Mehta, Shikha
    Abraham, Ajith
    SOFT COMPUTING, 2021, 25 (15) : 10237 - 10256
  • [23] An entropy weighting k-means algorithm for subspace clustering of high-dimensional sparse data
    Jing, Liping
    Ng, Michael K.
    Huang, Joshua Zhexue
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (08) : 1026 - 1041
  • [24] Adaptive Hybridized Meta-Heuristic Algorithm for Subspace Clustering on High Dimensional Data
    Kumar, Pradeep D.
    Sowmya, B. J.
    Kanavalli, Anita
    Amaresh, T. N.
    Anish, S.
    Nadgir, Chinmay S.
    Nischal, Gagan A.
    Supreeth, S.
    Shruthi, G.
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2023, 14 (05) : 1103 - 1116
  • [25] Adaptive Clustering for Outlier Identification in High-Dimensional Data
    Thudumu, Srikanth
    Branch, Philip
    Jin, Jiong
    Singh, Jugdutt
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2019, PT II, 2020, 11945 : 215 - 228
  • [26] An entropy weighting mixture model for subspace clustering of high-dimensional data
    Peng, Liuqing
    Zhang, Junying
    PATTERN RECOGNITION LETTERS, 2011, 32 (08) : 1154 - 1161
  • [27] Dynamic Sparse Subspace Clustering for Evolving High-Dimensional Data Streams
    Sui, Jinping
    Liu, Zhen
    Liu, Li
    Jung, Alexander
    Li, Xiang
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (06) : 4173 - 4186
  • [28] Subspace Clustering for High-Dimensional Data Using Cluster Structure Similarity
    Fatehi, Kavan
    Rezvani, Mohsen
    Fateh, Mansoor
    Pajoohan, Mohammad-Reza
    INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2018, 14 (03) : 38 - 55
  • [29] Accelerating Density-Based Subspace Clustering in High-Dimensional Data
    Prinzbach, Juergen
    Lauer, Tobias
    Kiefer, Nicolas
    21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 474 - 481
  • [30] Spectral Clustering by Subspace Randomization and Graph Fusion for High-Dimensional Data
    Cai, Xiaosha
    Huang, Dong
    Wang, Chang-Dong
    Kwoh, Chee-Keong
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT I, 2020, 12084 : 330 - 342