Improving Spectral Clustering Using the Asymptotic Value of the Normalized Cut

被引:5
|
作者
Hofnneyr, David P. [1 ]
机构
[1] Stellenbosch Univ, Dept Stat & Actuarial Sci, Cnr Bosman & Victoria St, ZA-7602 Stellenbosch, South Africa
关键词
Cluster number determination; Low density separation; Self-tuning clustering;
D O I
10.1080/10618600.2019.1593180
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Spectral clustering (SC) is a popular and versatile clustering method based on a relaxation of the normalized graph cut objective. Despite its popularity, selecting the number of clusters and tuning the important scaling parameter remain challenging problems in practical applications of SC. Popular heuristics have been proposed, but corresponding theoretical results are scarce. In this article, we investigate the asymptotic value of the normalized cut for an increasing sample assumed to arise from an underlying probability distribution. Based on this, we find strong connections between spectral and density clustering. This enables us to provide recommendations for selecting the number of clusters and setting the scaling parameter in a data driven manner. An algorithm inspired by these recommendations is proposed, whichwe have found to exhibit strong performance in a range of applied domains. AnRimplementation of the algorithm is available from https://github. com/DavidHofmeyr/spuds. Supplementary materials for this article are available online.
引用
收藏
页码:980 / 992
页数:13
相关论文
共 50 条
  • [31] Developments on Solutions of the Normalized-Cut-Clustering Problem Without Eigenvectors
    Leonardo Lorente-Leyva, Leandro
    David Herrera-Granda, Israel
    Rosero-Montalvo, Paul D.
    Ponce-Guevara, Karina L.
    Eduardo Castro-Ospina, Andres
    Becerra, Miguel A.
    Hernan Peluffo-Ordonez, Diego
    Luis Rodriguez-Sotelo, Jose
    ADVANCES IN NEURAL NETWORKS - ISNN 2018, 2018, 10878 : 318 - 328
  • [32] Normalized Cut Group Clustering of Resting-State fMRI Data
    van den Heuvel, Martijn
    Mandl, Rene
    Pol, Hilleke Hulshoff
    PLOS ONE, 2008, 3 (04):
  • [33] Network control by fuzzy normalized cut clustering strategies for critical infrastructures
    Quirk, Mihaela D.
    Climescu-Haulica, Adriana
    Saeger, Kevin J.
    3RD INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL 1, PROCEEDINGS, 2005, : 173 - 177
  • [34] Multi-way spectral clustering based on normalized partitions
    Peluffo Ordonez, Diego Hernan
    CUADERNO ACTIVA, 2013, (05): : 39 - 49
  • [35] Improving Spectral Clustering Using Spectrum-Preserving Node Aggregation
    Wang, Yongyu
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3063 - 3068
  • [36] Design of manufacturable variable stiffness composite laminates using spectral Chebyshev and normalized cut segmentation methods
    Sheikhi, Mohammad
    Anamagh, Mirmeysam Rafiei
    Bediz, Bekir
    Tunc, Lutfi Taner
    COMPOSITE STRUCTURES, 2024, 330
  • [37] NEW SPECTRAL METHODS FOR RATIO CUT PARTITIONING AND CLUSTERING
    HAGEN, L
    KAHNG, AB
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 1992, 11 (09) : 1074 - 1085
  • [38] SolidBin: improving metagenome binning with semi-supervised normalized cut
    Wang, Ziye
    Wang, Zhengyang
    Lu, Yang Young
    Sun, Fengzhu
    Zhu, Shanfeng
    BIOINFORMATICS, 2019, 35 (21) : 4229 - 4238
  • [39] On the asymptotic spectral distribution of increasing size matrices: test functions, spectral clustering, and asymptotic estimates of outliers
    Bianchi, Davide
    Garoni, Carlo
    LINEAR ALGEBRA AND ITS APPLICATIONS, 2024, 697 : 615 - 638
  • [40] Data Ranking and Clustering via Normalized Graph Cut Based on Asymmetric Affinity
    Kyrgyzov, Olexiy
    Bloch, Isabelle
    Yang, Yuan
    Wiart, Joe
    Souloumiac, Antoine
    IMAGE ANALYSIS AND PROCESSING (ICIAP 2013), PT II, 2013, 8157 : 562 - 571