A scalable community detection algorithm for large graphs using stochastic block models

被引:4
|
作者
Peng, Chengbin [1 ,2 ]
Zhang, Zhihua [3 ]
Wong, Ka-Chun [4 ]
Zhang, Xiangliang [1 ]
Keyes, David E. [1 ]
机构
[1] King Abdullah Univ Sci & Technol, Post Box 2925, Thuwal 239556900, Saudi Arabia
[2] Ningbo Inst Ind Technol, Ningbo, Zhejiang, Peoples R China
[3] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[4] City Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
关键词
Stochastic block model; parallel computing; community detection; MULTI;
D O I
10.3233/IDA-163156
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Community detection in graphs is widely used in social and biological networks, and the stochastic block model is a powerful probabilistic tool for describing graphs with community structures. However, in the era of "big data", traditional inference algorithms for such a model are increasingly limited due to their high time complexity and poor scalability. In this paper, we propose a multi-stage maximum likelihood approach to recover the latent parameters of the stochastic block model, in time linear with respect to the number of edges. We also propose a parallel algorithm based on message passing. Our algorithm can overlap communication and computation, providing speedup without compromising accuracy as the number of processors grows. For example, to process a real-world graph with about 1.3 million nodes and 10 million edges, our algorithm requires about 6 seconds on 64 cores of a contemporary commodity Linux cluster. Experiments demonstrate that the algorithm can produce high quality results on both benchmark and real-world graphs. An example of finding more meaningful communities is illustrated consequently in comparison with a popular modularity maximization algorithm.
引用
收藏
页码:1463 / 1485
页数:23
相关论文
共 50 条
  • [41] BNEM: a fast community detection algorithm using generative models
    Hafez A.I.
    Hassanien A.E.
    Fahmy A.A.
    Hafez, Ahmed Ibrahem (ah.hafez@gmail.com), 1600, Springer-Verlag Wien (04): : 1 - 20
  • [42] Towards a Scalable Parallel Infomap Algorithm for Community Detection
    Santos, Gabriel G.
    Lakhotia, Kartik
    De Rose, Cesar A. F.
    2024 32ND EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, PDP 2024, 2024, : 116 - 123
  • [43] A SPECTRAL METHOD FOR COMMUNITY DETECTION IN MODERATELY SPARSE DEGREE-CORRECTED STOCHASTIC BLOCK MODELS
    Gulikers, Lennart
    Lelarge, Marc
    Massoulie, Laurent
    ADVANCES IN APPLIED PROBABILITY, 2017, 49 (03) : 686 - 721
  • [44] A generalized stochastic block model for overlapping community detection
    Liu, Xuan-Chen
    Zhang, Li-Jie
    Xu, Xin-Jian
    EPL, 2024, 146 (04)
  • [45] Distributed Community Detection on Overlapping Stochastic Block Model
    Xu, Jiasheng
    Fu, Luoyi
    Gan, Xiaoying
    Zhu, Bo
    2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2020, : 201 - 206
  • [46] Efficient Distributed Community Detection in the Stochastic Block Model
    Fathi, Reza
    Molla, Anisur Rahaman
    Pandurangan, Gopal
    2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 409 - 419
  • [47] Community detection in the sparse hypergraph stochastic block model
    Pal, Soumik
    Zhu, Yizhe
    RANDOM STRUCTURES & ALGORITHMS, 2021, 59 (03) : 407 - 463
  • [48] Triangle-Driven Community Detection in Large Graphs Using Propositional Satisfiability
    Jabbour, Said
    Mhadhbi, Nizar
    Radaoui, Badran
    Sais, Lakhdar
    PROCEEDINGS 2018 IEEE 32ND INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2018, : 437 - 444
  • [49] EXPONENTIAL RANDOM GRAPHS BEHAVE LIKE MIXTURES OF STOCHASTIC BLOCK MODELS
    Eldan, Ronen
    Gross, Renan
    ANNALS OF APPLIED PROBABILITY, 2018, 28 (06): : 3698 - 3735
  • [50] Scalable Generation of Graphs for Benchmarking HPC Community-Detection Algorithms
    Slota, George M.
    Berry, Jonathan W.
    Hammond, Simon D.
    Olivier, Stephen L.
    Phillips, Cynthia A.
    Rajamanickam, Siva
    PROCEEDINGS OF SC19: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2019,