Spectral clustering with linear embedding: A discrete clustering method for large-scale data

被引:2
|
作者
Gao, Chenhui [1 ]
Chen, Wenzhi [1 ]
Nie, Feiping [2 ]
Yu, Weizhong [2 ]
Wang, Zonghui [1 ]
机构
[1] Zhejiang Univ, Hangzhou 310027, Peoples R China
[2] Northwestern Polytech Univ, Xian 710072, Peoples R China
关键词
Spectral clustering; Graph embedding; Unsupervised learning;
D O I
10.1016/j.patcog.2024.110396
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent decades, spectral clustering has found widespread applications in various real -world scenarios, showcasing its effectiveness. Traditional spectral clustering typically follows a two-step procedure to address the optimization problem. However, this approach may result in substantial information loss and performance decline. Furthermore, the eigenvalue decomposition, a key step in spectral clustering, entails cubic computational complexity. This paper incorporates linear embedding into the objective function of spectral clustering and proposes a direct method to solve the indicator matrix. Moreover, our method achieves a linear time complexity with respect to the input data size. Our method, referred to as Spectral Clustering with Linear Embedding (SCLE), achieves a direct and efficient solution and naturally handles out -of -sample data. SCLE initiates the process with balanced and hierarchical K -means, effectively partitioning the input data into balanced clusters. After generating anchors, we compute a similarity matrix based on the distances between the input data points and the generated anchors. In contrast to the conventional two-step spectral clustering approach, we directly solve the cluster indicator matrix at a linear time complexity. Extensive experiments across multiple datasets underscore the efficiency and effectiveness of our proposed SCLE method.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Granular-ball-based Fast Spectral Embedding Clustering Algorithm for Large-Scale Data
    Liu, Shushu
    Cheng, Dongdong
    Xie, Jiang
    ACM International Conference Proceeding Series, : 16 - 20
  • [2] Large-Scale Clustering through Functional Embedding
    Ratle, Frederic
    Weston, Jason
    Miller, Matthew L.
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART II, PROCEEDINGS, 2008, 5212 : 266 - +
  • [3] LSEC: Large-scale spectral ensemble clustering
    Li, Hongmin
    Ye, Xiucai
    Imakura, Akira
    Sakurai, Tetsuya
    INTELLIGENT DATA ANALYSIS, 2023, 27 (01) : 59 - 77
  • [4] Large-scale parallel data clustering
    Judd, D
    McKinley, PK
    Jain, AK
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (08) : 871 - 876
  • [5] A study of large-scale data clustering based on fuzzy clustering
    Li, Yangyang
    Yang, Guoli
    He, Haiyang
    Jiao, Licheng
    Shang, Ronghua
    SOFT COMPUTING, 2016, 20 (08) : 3231 - 3242
  • [6] A study of large-scale data clustering based on fuzzy clustering
    Yangyang Li
    Guoli Yang
    Haiyang He
    Licheng Jiao
    Ronghua Shang
    Soft Computing, 2016, 20 : 3231 - 3242
  • [7] Compressed constrained spectral clustering framework for large-scale data sets
    Liu, Wenfen
    Ye, Mao
    Wei, Jianghong
    Hu, Xuexian
    KNOWLEDGE-BASED SYSTEMS, 2017, 135 : 77 - 88
  • [8] Spectral Clustering of Large-scale Data by Directly Solving Normalized Cut
    Chen, Xiaojun
    Hong, Weijun
    Nie, Feiping
    He, Dan
    Yang, Min
    Huang, Joshua Zhexue
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1206 - 1215
  • [9] Sparse kernel spectral clustering models for large-scale data analysis
    Alzate, Carlos
    Suykens, Johan A. K.
    NEUROCOMPUTING, 2011, 74 (09) : 1382 - 1390
  • [10] A Data Cleansing Method for Clustering Large-Scale Transaction Databases
    Loh, Woong-Kee
    Moon, Yang-Sae
    Kang, Jun-Gyu
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (11) : 3120 - 3123