A scalable community detection algorithm for large graphs using stochastic block models

被引:4
|
作者
Peng, Chengbin [1 ,2 ]
Zhang, Zhihua [3 ]
Wong, Ka-Chun [4 ]
Zhang, Xiangliang [1 ]
Keyes, David E. [1 ]
机构
[1] King Abdullah Univ Sci & Technol, Post Box 2925, Thuwal 239556900, Saudi Arabia
[2] Ningbo Inst Ind Technol, Ningbo, Zhejiang, Peoples R China
[3] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[4] City Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
关键词
Stochastic block model; parallel computing; community detection; MULTI;
D O I
10.3233/IDA-163156
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Community detection in graphs is widely used in social and biological networks, and the stochastic block model is a powerful probabilistic tool for describing graphs with community structures. However, in the era of "big data", traditional inference algorithms for such a model are increasingly limited due to their high time complexity and poor scalability. In this paper, we propose a multi-stage maximum likelihood approach to recover the latent parameters of the stochastic block model, in time linear with respect to the number of edges. We also propose a parallel algorithm based on message passing. Our algorithm can overlap communication and computation, providing speedup without compromising accuracy as the number of processors grows. For example, to process a real-world graph with about 1.3 million nodes and 10 million edges, our algorithm requires about 6 seconds on 64 cores of a contemporary commodity Linux cluster. Experiments demonstrate that the algorithm can produce high quality results on both benchmark and real-world graphs. An example of finding more meaningful communities is illustrated consequently in comparison with a popular modularity maximization algorithm.
引用
收藏
页码:1463 / 1485
页数:23
相关论文
共 50 条
  • [21] Dynamic stochastic block models: parameter estimation and detection of changes in community structure
    Matthew Ludkin
    Idris Eckley
    Peter Neal
    Statistics and Computing, 2018, 28 : 1201 - 1213
  • [22] Graph Theoretic and Stochastic Block Models Integrated with Matrix Factorization for Community Detection
    McGarry, Ken
    ADVANCES IN COMPUTATIONAL INTELLIGENCE SYSTEMS, UKCI 2022, 2024, 1454 : 297 - 311
  • [23] Dynamic stochastic block models: parameter estimation and detection of changes in community structure
    Ludkin, Matthew
    Eckley, Idris
    Neal, Peter
    STATISTICS AND COMPUTING, 2018, 28 (06) : 1201 - 1213
  • [24] Rate optimal Chernoff bound and application to community detection in the stochastic block models
    Zhou, Zhixin
    Li, Ping
    ELECTRONIC JOURNAL OF STATISTICS, 2020, 14 (01): : 1302 - 1347
  • [25] A Sequential and Scalable Approach to Community Detection in Dynamic Graphs
    Beckus, Andre
    Atia, George K.
    2019 57TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2019, : 298 - 302
  • [26] Accurate and scalable social recommendation using mixed-membership stochastic block models
    Godoy-Lorite, Antonia
    Guimera, Roger
    Moore, Cristopher
    Sales-Pardo, Marta
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (50) : 14207 - 14212
  • [27] Community Detection in Large Directed Graphs
    Chen, Siqi
    Bhatnagar, Raj
    BIG DATA ANALYTICS, BDA 2022, 2022, 13773 : 172 - 181
  • [28] EigenSpokes: Surprising patterns and Scalable Community Chipping in Large Graphs
    Prakash, B. Aditya
    Seshadri, Mukund
    Sridharan, Aswin
    Machiraju, Sridhar
    Faloutsos, Christos
    2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 290 - +
  • [29] EigenSpokes: Surprising Patterns and Scalable Community Chipping in Large Graphs
    Prakash, B. Aditya
    Sridharan, Ashwin
    Seshadri, Mukund
    Machiraju, Sridhar
    Faloutsos, Christos
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT II, PROCEEDINGS, 2010, 6119 : 435 - +
  • [30] Scalable parallel simulation of dynamical processes on large stochastic Kronecker graphs
    Bochenina, Klavdiya
    Kesarev, Sergey
    Boukhanovsky, Alexander
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 78 : 502 - 515