Scalable Spectral Clustering With Nystrom Approximation: Practical and Theoretical Aspects

被引:7
|
作者
Pourkamali-Anaraki, Farhad [1 ]
机构
[1] Univ Massachusetts, Dept Comp Sci, Lowell, MA 01854 USA
关键词
Approximation methods; clustering algorithms; computational complexity; sampling methods; MATRIX; ALGORITHMS;
D O I
10.1109/OJSP.2020.3039330
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Spectral clustering techniques are valuable tools in signal processing and machine learning for partitioning complex data sets. The effectiveness of spectral clustering stems from constructing a non-linear embedding based on creating a similarity graph and computing the spectral decomposition of the Laplacian matrix. However, spectral clustering methods fail to scale to large data sets because of high computational cost and memory usage. A popular approach for addressing these problems utilizes the Nystrom method, an efficient sampling-based algorithm for computing low-rank approximations to large positive semi-definite matrices. This paper demonstrates how the previously popular approach of Nystrom-based spectral clustering has severe limitations. Existing time-efficient methods ignore critical information by prematurely reducing the rank of the similarity matrix associated with sampled points. Also, current understanding is limited regarding how utilizing the Nystrom approximation will affect the quality of spectral embedding approximations. To address the limitations, this work presents a principled spectral clustering algorithm that exploits spectral properties of the similarity matrix associated with sampled points to regulate accuracy-efficiency trade-offs. We provide theoretical results to reduce the current gap and present numerical experiments with real and synthetic data. Empirical results demonstrate the efficacy and efficiency of the proposed method compared to existing spectral clustering techniques based on the Nystrom method and other efficient methods. The overarching goal of this work is to provide an improved baseline for future research directions to accelerate spectral clustering.
引用
收藏
页码:242 / 256
页数:15
相关论文
共 50 条
  • [1] Improved fixed-rank Nystrom approximation via QR decomposition: Practical and theoretical aspects
    Pourkamali-Anaraki, Farhad
    Becker, Stephen
    [J]. NEUROCOMPUTING, 2019, 363 : 261 - 272
  • [2] A Spectral Clustering Image Segmentation Algorithm Based on Nystrom Approximation
    Miao, Jian
    Chen, Dai
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL MODELING, SIMULATION AND APPLIED MATHEMATICS (CMSAM 2016), 2016, : 187 - 191
  • [3] THEORETICAL AND PRACTICAL ASPECTS OF ZDO APPROXIMATION
    DEBRUIJN, S
    [J]. THEORETICA CHIMICA ACTA, 1970, 17 (04): : 293 - &
  • [4] Spectral clustering using Nystrom approximation for the accurate identification of cancer molecular subtypes
    Shi, Mingguang
    Xu, Guofu
    [J]. SCIENTIFIC REPORTS, 2017, 7
  • [5] Scalable kernel K-means clustering with nystrom approximation: Relative-error bounds
    Wang, Shusen
    Gittens, Alex
    Mahoney, Michael W.
    [J]. Journal of Machine Learning Research, 2019, 20
  • [6] Scalable Kernel K-Means Clustering with Nystrom Approximation: Relative-Error Bounds
    Wang, Shusen
    Gittens, Alex
    Mahoney, Michael W.
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
  • [7] Improved spectral clustering based on Nystrom method
    Zhan, Qiang
    Mao, Yu
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (19) : 20149 - 20165
  • [8] Constrained Spectral Clustering Using Nystrom Method
    Li, Liangchi
    Wang, Shenling
    Xu, Shuaijing
    Yang, Yuqi
    [J]. 2017 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION AND KNOWLEDGE IN THE INTERNET OF THINGS, 2018, 129 : 9 - 15
  • [9] Projected Affinity Values for Nystrom Spectral Clustering
    He, Li
    Zhu, Haifei
    Zhang, Tao
    Yang, Honghong
    Guan, Yisheng
    [J]. ENTROPY, 2018, 20 (07)
  • [10] Region-based approach for the spectral clustering Nystrom approximation with an application to burn depth assessment
    Garcia Garcia, Juan F.
    Venegas-Andraca, Salvador E.
    [J]. MACHINE VISION AND APPLICATIONS, 2015, 26 (2-3) : 353 - 368