Strong consistency guarantees for clustering high-dimensional bipartite graphs with the spectral method

被引:0
|
作者
Braun, Guillaume [1 ]
机构
[1] RIKEN AIP, Tokyo, Japan
来源
ELECTRONIC JOURNAL OF STATISTICS | 2024年 / 18卷 / 02期
关键词
Spectral method; clustering; bipartite stochastic block model; MATRICES;
D O I
10.1214/24-EJS2271
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We investigate the problem of clustering bipartite graphs using a simple spectral method within the framework of the Bipartite Stochastic Block Model (BiSBM), a popular model for bipartite graphs having a community structure. Our focus lies in the high-dimensional setting where the number n 1 of rows, and n 2 of columns, of the associated adjacency matrix differ significantly. A recent study by [4] has established a sufficient and necessary condition related to the sparsity level p max of the bipartite graph, enabling the recovery of the latent partition of the rows. In their work, [4] introduces an iterative method that extends the approach proposed by [26] to achieve the stated recovery goal. However, empirical results suggest that the subsequent refinement algorithm does not significantly enhance the performance of the spectral method, indicating that the spectral method achieves exact recovery within the same regime as the refinement method. We establish this claim by deriving new entrywise bounds on the eigenvectors of the similarity matrix utilized by the spectral method. Our analysis extends the framework of [23], which is limited to symmetric matrices with restricted dependencies. As a critical technical step, we also derive an improved concentration inequality tailored for similarity matrices.
引用
收藏
页码:2798 / 2823
页数:26
相关论文
共 50 条
  • [31] A Survey on High-Dimensional Subspace Clustering
    Qu, Wentao
    Xiu, Xianchao
    Chen, Huangyue
    Kong, Lingchen
    [J]. MATHEMATICS, 2023, 11 (02)
  • [32] Adaptive quantum tomography of high-dimensional bipartite systems
    Struchalin, G., I
    Kovlakov, E., V
    Straupe, S. S.
    Kulik, S. P.
    [J]. PHYSICAL REVIEW A, 2018, 98 (03)
  • [33] On super quantum discord for high-dimensional bipartite state
    Zhou, Jianming
    Hu, Xiaoli
    Jing, Naihuan
    [J]. QUANTUM INFORMATION PROCESSING, 2023, 22 (12)
  • [34] Minimax Optimal Clustering of Bipartite Graphs with a Generalized Power Method
    Braun, Guillaume
    Tyagi, Hemant
    [J]. arXiv, 2022,
  • [35] Minimax optimal clustering of bipartite graphs with a generalized power method
    Braun, Guillaume
    Tyagi, Hemant
    [J]. INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2023, 12 (03)
  • [36] Redefining clustering for high-dimensional applications
    Aggarwal, CC
    Yu, PS
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2002, 14 (02) : 210 - 225
  • [37] A Clustering-Based Bipartite Graph Privacy-Preserving Approach for Sharing High-Dimensional Data
    Wang, Li-e
    Li, Xianxian
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2014, 24 (07) : 1091 - 1111
  • [38] On super quantum discord for high-dimensional bipartite state
    Jianming Zhou
    Xiaoli Hu
    Naihuan Jing
    [J]. Quantum Information Processing, 22
  • [39] Correlation and high-dimensional consistency in pattern recognition
    Ge, NX
    Simpson, DG
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1998, 93 (443) : 995 - 1006
  • [40] High-dimensional posterior consistency of the Bayesian lasso
    Dasgupta, Shibasish
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2016, 45 (22) : 6700 - 6708