Improving Spectral Clustering Using the Asymptotic Value of the Normalized Cut

被引:5
|
作者
Hofnneyr, David P. [1 ]
机构
[1] Stellenbosch Univ, Dept Stat & Actuarial Sci, Cnr Bosman & Victoria St, ZA-7602 Stellenbosch, South Africa
关键词
Cluster number determination; Low density separation; Self-tuning clustering;
D O I
10.1080/10618600.2019.1593180
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Spectral clustering (SC) is a popular and versatile clustering method based on a relaxation of the normalized graph cut objective. Despite its popularity, selecting the number of clusters and tuning the important scaling parameter remain challenging problems in practical applications of SC. Popular heuristics have been proposed, but corresponding theoretical results are scarce. In this article, we investigate the asymptotic value of the normalized cut for an increasing sample assumed to arise from an underlying probability distribution. Based on this, we find strong connections between spectral and density clustering. This enables us to provide recommendations for selecting the number of clusters and setting the scaling parameter in a data driven manner. An algorithm inspired by these recommendations is proposed, whichwe have found to exhibit strong performance in a range of applied domains. AnRimplementation of the algorithm is available from https://github. com/DavidHofmeyr/spuds. Supplementary materials for this article are available online.
引用
收藏
页码:980 / 992
页数:13
相关论文
共 50 条
  • [21] Relevant Gene Selection Using Normalized Cut Clustering with Maximal Compression Similarity Measure
    Bala, Rajni
    Agrawal, R. K.
    Sardana, Manju
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT II, PROCEEDINGS, 2010, 6119 : 81 - +
  • [22] VEHICLE ORIENTATION ANALYSIS USING EIGEN COLOR, EDGE MAP, AND NORMALIZED CUT CLUSTERING
    Wu, Jui-Chen
    Hsieh, Jun-Wei
    Chen, Sin-Yu
    Tu, Cheng-Min
    Chen, Yung-Sheng
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2010, 24 (05) : 823 - 846
  • [23] Quadratic Problem Formulation with Linear Constraints for Normalized Cut Clustering
    Peluffo-Ordonez, D. H.
    Castro-Hoyos, C.
    Acosta-Medina, Carlos D.
    Castellanos-Dominguez, German
    PROGRESS IN PATTERN RECOGNITION IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2014, 2014, 8827 : 408 - 415
  • [24] On the convergence of spectral clustering on random samples: The normalized case
    von Luxburg, U
    Bousquet, O
    Belkin, M
    LEARNING THEORY, PROCEEDINGS, 2004, 3120 : 457 - 471
  • [25] Improving Spectral Clustering Using Path-Based Connectivity
    Guzel, Kadir
    Kursun, Olcay
    2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 2110 - 2113
  • [26] Sonar image spectral matting segmentation based on normalized cut
    Liu, Guangyu
    Bian, Hongyu
    Shen, Zhengyan
    Shi, Hong
    Harbin Gongcheng Daxue Xuebao/Journal of Harbin Engineering University, 2012, 33 (03): : 308 - 312
  • [27] Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap
    Park, Tae Jin
    Han, Kyu J.
    Kumar, Manoj
    Narayanan, Shrikanth
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 381 - 385
  • [28] Asymptotic Gaussian Fluctuations of Eigenvectors in Spectral Clustering
    Lebeau, Hugo
    Chatelain, Florent
    Couillet, Romain
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1920 - 1924
  • [29] ASYMPTOTIC GAUSSIAN FLUCTUATIONS OF SPECTRAL CLUSTERING EIGENVECTORS
    Kadavankandy, Arun
    Couilleet, Romain
    2019 IEEE 8TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP 2019), 2019, : 694 - 698
  • [30] A VNS heuristic for escaping local extrema entrapment in normalized cut clustering
    Hansen, Pierre
    Ruiz, Manuel
    Aloise, Daniel
    PATTERN RECOGNITION, 2012, 45 (12) : 4337 - 4345