A scalable community detection algorithm for large graphs using stochastic block models

被引:4
|
作者
Peng, Chengbin [1 ,2 ]
Zhang, Zhihua [3 ]
Wong, Ka-Chun [4 ]
Zhang, Xiangliang [1 ]
Keyes, David E. [1 ]
机构
[1] King Abdullah Univ Sci & Technol, Post Box 2925, Thuwal 239556900, Saudi Arabia
[2] Ningbo Inst Ind Technol, Ningbo, Zhejiang, Peoples R China
[3] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[4] City Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
关键词
Stochastic block model; parallel computing; community detection; MULTI;
D O I
10.3233/IDA-163156
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Community detection in graphs is widely used in social and biological networks, and the stochastic block model is a powerful probabilistic tool for describing graphs with community structures. However, in the era of "big data", traditional inference algorithms for such a model are increasingly limited due to their high time complexity and poor scalability. In this paper, we propose a multi-stage maximum likelihood approach to recover the latent parameters of the stochastic block model, in time linear with respect to the number of edges. We also propose a parallel algorithm based on message passing. Our algorithm can overlap communication and computation, providing speedup without compromising accuracy as the number of processors grows. For example, to process a real-world graph with about 1.3 million nodes and 10 million edges, our algorithm requires about 6 seconds on 64 cores of a contemporary commodity Linux cluster. Experiments demonstrate that the algorithm can produce high quality results on both benchmark and real-world graphs. An example of finding more meaningful communities is illustrated consequently in comparison with a popular modularity maximization algorithm.
引用
收藏
页码:1463 / 1485
页数:23
相关论文
共 50 条
  • [31] CONSISTENCY OF COMMUNITY DETECTION IN NETWORKS UNDER DEGREE-CORRECTED STOCHASTIC BLOCK MODELS
    Zhao, Yunpeng
    Levina, Elizaveta
    Zhu, Ji
    ANNALS OF STATISTICS, 2012, 40 (04): : 2266 - 2292
  • [32] Community detection in general stochastic block models: fundamental limits and efficient algorithms for recovery
    Abbe, Emmanuel
    Sandon, Colin
    2015 IEEE 56TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, 2015, : 670 - 688
  • [33] Profile-pseudo likelihood methods for community detection of multilayer stochastic block models
    Fu, Kang
    Hu, Jianwei
    STAT, 2023, 12 (01):
  • [34] Large deviations for empirical measures of dense stochastic block graphs
    Zheng Wenhua
    Liu Qun
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020,
  • [35] A distributed overlapping community detection model for large graphs using autoencoder
    Bhatia, Vandana
    Rani, Rinkle
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 94 : 16 - 26
  • [36] MDPCluster: a swarm-based community detection algorithm in large-scale graphs
    Shirjini, Mahsa Fozuni
    Farzi, Saeed
    Nikanjam, Amin
    COMPUTING, 2020, 102 (04) : 893 - 922
  • [37] MDPCluster: a swarm-based community detection algorithm in large-scale graphs
    Mahsa Fozuni Shirjini
    Saeed Farzi
    Amin Nikanjam
    Computing, 2020, 102 : 893 - 922
  • [38] A Genetic Algorithm for Community Detection in Attributed Graphs
    Pizzuti, Clara
    Socievole, Annalisa
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2018, 2018, 10784 : 159 - 170
  • [39] A Dynamic Algorithm for Local Community Detection in Graphs
    Zakrzewska, Anita
    Bader, David A.
    PROCEEDINGS OF THE 2015 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2015), 2015, : 559 - 564
  • [40] An Efficient Block Mode Detection Algorithm for Scalable Video Coding using Probability Model
    Balaji, L.
    Thyagharajan, K. K.
    Dhanalakshmi, A.
    INTERNATIONAL JOURNAL OF SYSTEM DYNAMICS APPLICATIONS, 2015, 4 (02) : 42 - 55