A similarity measure based on subspace distance for spectral clustering

被引:0
|
作者
Naseri, Nadimeh [1 ]
Eftekhari, Mahdi [2 ]
Saberi-Movahed, Farid [3 ]
Radjabalipour, Mehdi [1 ,4 ]
Belanche, Lluis A. [5 ]
机构
[1] Shahid Bahonar Univ Kerman, Fac Math & Comp, Dept Pure Math, Kerman, Iran
[2] Shahid Bahonar Univ Kerman, Dept Comp Engn, Kerman, Iran
[3] Grad Univ Adv Technol, Fac Sci & Modern Technol, Dept Appl Math, Kerman, Iran
[4] Iranian Acad Sci, Tehran, Iran
[5] Univ Politecn Cataluna, Dept Comp Sci, Barcelona, Catalonia, Spain
关键词
Subspace learning; Similarity learning; Subspace distance; Unsupervised learning; Spectral clustering; ALGORITHM;
D O I
10.1016/j.neucom.2024.129187
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The performance of Spectral Clustering (SC) relies heavily on the choice of similarity matrix used to compute pairwise similarities between data points, especially when handling data distributed across multiple subspaces. Despite the effectiveness of subspace learning methods in identifying clusters within high-dimensional data, their integration into SC is often limited. Specifically, a majority of SC techniques rooted in subspace learning either lack efficient similarity metrics or encounter difficulties in uncovering clusters within datasets that share common subspaces. To address these concerns, this paper introduces a novel similarity metric, termed Similarity Measure based on the Distance of Subspaces (SMDS). The proposed SMDS criterion yields three key advantages. Firstly, SMDS involves identifying the local neighborhood of each sample, which typically exerts a stronger influence than global factors. Secondly, it employs subspace learning, leveraging the fact that estimating small linear subspaces is computationally more tractable than handling larger and more complex ones. Thirdly, it introduces a novel subspace clustering approach by establishing a similarity matrix based on subspace distance. This property effectively addresses the challenges posed by overlapping subspaces and facilitates their merging. Moving forward, this novel SMDS similarity matrix is then utilized within SC, leading to the proposal of SC-SMDS, anew method tailored for clustering tasks. The SC-SMDS method is evaluated through various experiments on a number of real-world benchmark datasets, demonstrating its superior performance over several competing clustering methods.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] An adaptive distance measure for similarity based playlist generation
    Gaertner, D.
    Kraft, F.
    Schaaf, T.
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS, 2007, : 229 - +
  • [42] Unsupervised feature selection based on adaptive similarity learning and subspace clustering
    Parsa, Mohsen Ghassemi
    Zare, Hadi
    Ghatee, Mehdi
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 95 (95)
  • [43] Fast Spectral Clustering with Landmark-based Subspace Iteration
    Gan, Zejun
    Sha, Chaofeng
    Niu, Junyu
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2013, : 773 - 779
  • [44] Ensemble dimension reduction based on spectral disturbance for subspace clustering
    Chen, Xiaoyun
    Wang, Qiaoping
    Zhuang, Shanshan
    KNOWLEDGE-BASED SYSTEMS, 2021, 227
  • [45] DISTANCE AS A MEASURE OF TAXONOMIC SIMILARITY
    SOKAL, RR
    SYSTEMATIC ZOOLOGY, 1961, 10 (02): : 70 - 79
  • [46] Robust Spectral Subspace Clustering Based on Least Square Regression
    Wu, Zongze
    Yin, Ming
    Zhou, Yajing
    Fang, Xiaozhao
    Xie, Shengli
    NEURAL PROCESSING LETTERS, 2018, 48 (03) : 1359 - 1372
  • [47] Robust Spectral Subspace Clustering Based on Least Square Regression
    Zongze Wu
    Ming Yin
    Yajing Zhou
    Xiaozhao Fang
    Shengli Xie
    Neural Processing Letters, 2018, 48 : 1359 - 1372
  • [48] CUR Decompositions, Similarity Matrices, and Subspace Clustering
    Aldroubi, Akram
    Hamm, Keaton
    Koku, Ahmet Bugra
    Sekmen, Ali
    FRONTIERS IN APPLIED MATHEMATICS AND STATISTICS, 2019, 4
  • [49] A fast algorithm for subspace clustering by pattern similarity
    Wang, HX
    Chu, F
    Fan, W
    Yu, PS
    Pei, J
    16TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2004, : 51 - 60
  • [50] Similarity Measures of Pythagorean Fuzzy Sets Based on Combination of Cosine Similarity Measure and Euclidean Distance Measure
    Mohd, Wan Rosanisah Wan
    Abdullah, Lazim
    PROCEEDING OF THE 25TH NATIONAL SYMPOSIUM ON MATHEMATICAL SCIENCES (SKSM25): MATHEMATICAL SCIENCES AS THE CORE OF INTELLECTUAL EXCELLENCE, 2018, 1974