Estimating the number of communities in the stochastic block model with outliers

被引:0
|
作者
Xiao, Jingsong [1 ]
Ye, Fei [2 ]
Ma, Weidong [1 ]
Yang, Ying [1 ]
机构
[1] Tsinghua Univ, Dept Math Sci, Beijing 100084, Peoples R China
[2] Capital Univ Econ & Business, Sch Stat, 121 Zhangjialukou, Beijing 100070, Peoples R China
基金
中国国家自然科学基金;
关键词
stochastic block model; community detection; Matrix-Forest index; regularized and normalized adjacency matrix; consistency; CONSISTENCY; BLOCKMODELS; NETWORKS; GRAPHS;
D O I
10.1093/comnet/cnac042
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The stochastic block model (SBM) is a popular model for community detecting problems. Many community detecting approaches have been proposed, and most of them assume that the number of communities is given previously. However, in practice, the number of communities is often unknown. Plenty of approaches were proposed to estimate the number of communities, but most of them were computationally intensive. Moreover, when outliers exist, there are no approaches to consistently estimate the number of communities. In this article, we propose a fast method based on the eigenvalues of the regularized and normalized adjacency matrix to estimate the number of communities under the SBM with outliers. We show that our method can consistently estimate the number of communities when outliers exist. Moreover, we extend our method to the degree-corrected SBM. We show that our approach is comparable to the other existing approaches in simulations. We also illustrate our approach on four real-world networks.
引用
收藏
页数:23
相关论文
共 50 条
  • [11] Estimating the Number of Communities in a Network
    Newman, M. E. J.
    Reinert, Gesine
    PHYSICAL REVIEW LETTERS, 2016, 117 (07)
  • [12] Stochastic model for estimating the annual number of storm overflow discharges
    Szelag, B.
    ENVIRONMENTAL ENGINEERING V, 2017, : 43 - 51
  • [13] Consistent estimation of the number of communities in stochastic block models using cross-validation
    Qin, Jining
    Lei, Jing
    STAT, 2022, 11 (01):
  • [14] Recovering Communities in the General Stochastic Block Model Without Knowing the Parameters
    Abbe, Emmanuel
    Sandon, Colin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [15] Estimating the number of communities by spectral methods
    Le, Can M.
    Levina, Elizaveta
    ELECTRONIC JOURNAL OF STATISTICS, 2022, 16 (01): : 3315 - 3342
  • [16] Estimating the Number of Communities in Weighted Networks
    Qing, Huan
    ENTROPY, 2023, 25 (04)
  • [17] A GENERAL-MODEL FOR ESTIMATING THE NUMBER OF TERTIARY ESTABLISHMENTS IN COMMUNITIES - AN ARIZONA PERSPECTIVE
    MULLIGAN, GF
    WALLACE, ML
    PLANE, DA
    SOCIAL SCIENCE JOURNAL, 1985, 22 (02): : 77 - 93
  • [18] Estimating Effective Reproduction Number for SIR Compartmental Model: A Stochastic Evolutionary Approach
    Wong W.K.
    Juwono F.H.
    Journal of Social Computing, 2022, 3 (02): : 182 - 189
  • [19] Recovering Unbalanced Communities in the Stochastic Block Model with Application to Clustering with a Faulty Oracle
    Mukherjee, Chandra Sekhar
    Peng, Pan
    Zhang, Jiapeng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [20] Efficient method for estimating the number of communities in a network
    Riolo, Maria A.
    Cantwell, George T.
    Reinert, Gesine
    Newman, M. E. J.
    PHYSICAL REVIEW E, 2017, 96 (03)