Scalable Kernel K-Means Clustering with Nystrom Approximation: Relative-Error Bounds

被引:0
|
作者
Wang, Shusen [1 ]
Gittens, Alex [2 ]
Mahoney, Michael W. [3 ,4 ]
机构
[1] Stevens Inst Technol, Dept Comp Sci, Hoboken, NJ 07030 USA
[2] Rensselaer Polytech Inst, Comp Sci Dept, Troy, NY 12180 USA
[3] Univ Calif Berkeley, Int Comp Sci Inst, Berkeley, CA 94720 USA
[4] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
关键词
kernel k-means clustering; the Nystrom method; randomized linear algebra; RANDOMIZED ALGORITHMS; MATRIX; SEGMENTATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Kernel k-means clustering can correctly identify and extract a far more varied collection of cluster structures than the linear k-means clustering algorithm. However, kernel k-means clustering is computationally expensive when the non-linear feature map is high-dimensional and there are many input points. Kernel approximation, e.g., the Nystrom method, has been applied in previous works to approximately solve kernel learning problems when both of the above conditions are present. This work analyzes the application of this paradigm to kernel k-means clustering, and shows that applying the linear k-means clustering algorithm to k/c(1 + o(1)) features constructed using a so-called rank-restricted Nystrom approximation results in cluster assignments that satisfy a 1+ epsilon approximation ratio in terms of the kernel k-means cost function, relative to the guarantee provided by the same algorithm without the use of the Nystrom method. As part of the analysis, this work establishes a novel 1+ epsilon relative-error trace norm guarantee for low-rank approximation using the rank-restricted Nystrom approximation. Empirical evaluations on the 8 : 1 million instance MNIST8M dataset demonstrate the scalability and usefulness of kernel k-means clustering with Nystrom approximation. This work argues that spectral clustering using Nystrom approximation a popular and computationally efficient, but theoretically unsound approach to non-linear clustering-should be replaced with the efficient and theoretically sound combination of kernel k-means clustering with Nystrom approximation. The superior performance of the latter approach is empirically verified.
引用
收藏
页数:49
相关论文
共 50 条
  • [1] Scalable kernel K-means clustering with nystrom approximation: Relative-error bounds
    Wang, Shusen
    Gittens, Alex
    Mahoney, Michael W.
    [J]. Journal of Machine Learning Research, 2019, 20
  • [2] Kernel K-Means Sampling for Nystrom Approximation
    He, Li
    Zhang, Hong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (05) : 2108 - 2120
  • [3] Scalable Multiple Kernel k-means Clustering
    Lu, Yihang
    Xin, Haonan
    Wang, Rong
    Nie, Feiping
    Li, Xuelong
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4279 - 4283
  • [4] Beyond the Nystrom Approximation: Speeding up Spectral Clustering using Uniform Sampling and Weighted Kernel k-means
    Mohan, Mahesh
    Monteleoni, Claire
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2494 - 2500
  • [5] Kernel Probabilistic K-Means Clustering
    Liu, Bowen
    Zhang, Ting
    Li, Yujian
    Liu, Zhaoying
    Zhang, Zhilin
    [J]. SENSORS, 2021, 21 (05) : 1 - 16
  • [6] Sparse kernel k-means clustering
    Park, Beomjin
    Park, Changyi
    Hong, Sungchul
    Choi, Hosik
    [J]. JOURNAL OF APPLIED STATISTICS, 2024,
  • [7] Kernel K-Means Low Rank Approximation for Spectral Clustering and Diffusion Maps
    Alaiz, Carlos M.
    Fernandez, Angela
    Gala, Yvonne
    Dorronsoro, Jose R.
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2014, 2014, 8669 : 239 - 246
  • [8] Uniform Deviation Bounds for k-Means Clustering
    Bachem, Olivier
    Lucic, Mario
    Hassani, S. Hamed
    Krause, Andreas
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [9] Approximation of Kernel k-Means for Streaming Data
    Havens, Timothy C.
    [J]. 2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 509 - 512
  • [10] Scalable Fast Evolutionary k-means Clustering
    de Oliveira, Gilberto Viana
    Naldi, Murilo Coelho
    [J]. 2015 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS 2015), 2015, : 74 - 79