Estimating the number of communities in the stochastic block model with outliers

被引:0
|
作者
Xiao, Jingsong [1 ]
Ye, Fei [2 ]
Ma, Weidong [1 ]
Yang, Ying [1 ]
机构
[1] Tsinghua Univ, Dept Math Sci, Beijing 100084, Peoples R China
[2] Capital Univ Econ & Business, Sch Stat, 121 Zhangjialukou, Beijing 100070, Peoples R China
基金
中国国家自然科学基金;
关键词
stochastic block model; community detection; Matrix-Forest index; regularized and normalized adjacency matrix; consistency; CONSISTENCY; BLOCKMODELS; NETWORKS; GRAPHS;
D O I
10.1093/comnet/cnac042
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The stochastic block model (SBM) is a popular model for community detecting problems. Many community detecting approaches have been proposed, and most of them assume that the number of communities is given previously. However, in practice, the number of communities is often unknown. Plenty of approaches were proposed to estimate the number of communities, but most of them were computationally intensive. Moreover, when outliers exist, there are no approaches to consistently estimate the number of communities. In this article, we propose a fast method based on the eigenvalues of the regularized and normalized adjacency matrix to estimate the number of communities under the SBM with outliers. We show that our method can consistently estimate the number of communities when outliers exist. Moreover, we extend our method to the degree-corrected SBM. We show that our approach is comparable to the other existing approaches in simulations. We also illustrate our approach on four real-world networks.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Estimation of the Number of Communities in the Stochastic Block Model
    Cerqueira, Andressa
    Leonardi, Florencia
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2020, 66 (10) : 6403 - 6412
  • [2] A PROCEDURE FOR ESTIMATING THE NUMBER OF OUTLIERS
    JAIN, RB
    PINGEL, LA
    COMMUNICATIONS IN STATISTICS PART A-THEORY AND METHODS, 1981, 10 (10): : 1029 - 1041
  • [3] A Stochastic Block Model Based Approach to Detect Outliers in Networks
    Angiulli, Fabrizio
    Fassetti, Fabio
    Serrao, Cristina
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2021, PT I, 2021, 12923 : 149 - 154
  • [4] On the Chromatic Number in the Stochastic Block Model
    Isaev, Mikhail
    Kang, Mihyun
    ELECTRONIC JOURNAL OF COMBINATORICS, 2023, 30 (02):
  • [5] Estimating the number of species in a Stochastic abundance model
    Chao, A
    Bunge, J
    BIOMETRICS, 2002, 58 (03) : 531 - 539
  • [6] Recovering asymmetric communities in the stochastic block model
    Caltagirone, Francesco
    Lelarge, Marc
    Miolane, Leo
    2016 54TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2016, : 9 - 16
  • [7] Recovering Asymmetric Communities in the Stochastic Block Model
    Caltagirone, Francesco
    Lelarge, Marc
    Miolane, Leo
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2018, 5 (03): : 237 - 246
  • [8] Determining the number of communities in degree-corrected stochastic block models
    Ma, Shujie
    Su, Liangjun
    Zhang, Yichong
    Journal of Machine Learning Research, 2021, 22
  • [9] Determining the Number of Communities in Degree-corrected Stochastic Block Models
    Ma, Shujie
    Su, Liangjun
    Zhang, Yichong
    JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
  • [10] Unraveling the Detectability of Stochastic Block Model With Overlapping Communities
    Wu, Huaying
    Fu, Luoyi
    Long, Huan
    Meng, Guie
    Gan, Xiaoying
    Wu, Yuanhao
    Zhang, Haisong
    Wang, Xinbing
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2021, 8 (02): : 1443 - 1455