Scalable distributed Louvain algorithm for community detection in large graphs

被引:5
|
作者
Sattar, Naw Safrin [1 ]
Arifuzzaman, Shaikh [1 ]
机构
[1] Univ New Orleans, Dept Comp Sci, New Orleans, LA 70148 USA
来源
JOURNAL OF SUPERCOMPUTING | 2022年 / 78卷 / 07期
关键词
Community detection; Louvain method; Parallel algorithms; MPI; OpenMP; Load balancing; Graph mining;
D O I
10.1007/s11227-021-04224-2
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Community detection (or clustering) in large-scale graphs is an important problem in graph mining. Communities reveal interesting organizational and functional characteristics of a network. Louvain algorithm is an efficient sequential algorithm for community detection. However, such sequential algorithms fail to scale for emerging large-scale data. Scalable parallel algorithms are necessary to process large graph datasets. In this work, we show a comparative analysis of our different parallel implementations of Louvain algorithm. We design parallel algorithms for Louvain method in shared memory and distributed memory settings. Developing distributed memory parallel algorithms is challenging because of inter-process communication and load balancing issues. We incorporate dynamic load balancing in our final algorithm DPLAL (Distributed Parallel Louvain Algorithm with Load-balancing). DPLAL overcomes the performance bottleneck of the previous algorithms and shows around 12-fold speedup scaling to a larger number of processors. We also compare the performance of our algorithm with some other prominent algorithms in the literature and get better or comparable performance . We identify the challenges in developing distributed memory algorithm and provide an optimized solution DPLAL showing performance analysis of the algorithm on large-scale real-world networks from different domains.
引用
收藏
页码:10275 / 10309
页数:35
相关论文
共 50 条
  • [31] Shared-Memory Parallel Dynamic Louvain Algorithm for Community Detection
    Sahu, Subhajit
    Kothapalli, Kishore
    Banerjee, Dip Sankar
    2024 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, IPDPSW 2024, 2024, : 1204 - 1205
  • [32] EigenSpokes: Surprising patterns and Scalable Community Chipping in Large Graphs
    Prakash, B. Aditya
    Seshadri, Mukund
    Sridharan, Aswin
    Machiraju, Sridhar
    Faloutsos, Christos
    2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 290 - +
  • [33] EigenSpokes: Surprising Patterns and Scalable Community Chipping in Large Graphs
    Prakash, B. Aditya
    Sridharan, Ashwin
    Seshadri, Mukund
    Machiraju, Sridhar
    Faloutsos, Christos
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT II, PROCEEDINGS, 2010, 6119 : 435 - +
  • [34] Scalable Distributed Memory Community Detection Using Vite
    Ghosh, Sayan
    Halappanavar, Mahantesh
    Tumeo, Antonino
    Kalyanaraman, Ananth
    Gebremedhin, Assefaw H.
    2018 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2018,
  • [35] A new distributed graph coloring algorithm for large graphs
    Assia Brighen
    Hachem Slimani
    Abdelmounaam Rezgui
    Hamamache Kheddouci
    Cluster Computing, 2024, 27 : 875 - 891
  • [36] A Distributed Graph Partitioning Algorithm for Processing Large Graphs
    Chen, Tefeng
    Li, Bo
    PROCEEDINGS 2016 IEEE SYMPOSIUM ON SERVICE-ORIENTED SYSTEM ENGINEERING SOSE 2016, 2016, : 71 - 77
  • [37] Distributed Algorithm for Tip Decomposition on Large Bipartite Graphs
    Zhou X.
    Weng T.-F.
    Yang Z.-B.
    Li B.-R.
    Zhang J.
    Li K.-L.
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (03): : 1043 - 1056
  • [38] A new distributed graph coloring algorithm for large graphs
    Brighen, Assia
    Slimani, Hachem
    Rezgui, Abdelmounaam
    Kheddouci, Hamamache
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (01): : 875 - 891
  • [39] Parallelizing Louvain Algorithm: Distributed Memory Challenges
    Sattar, Naw Safrin
    Arifuzzaman, Shaikh
    2018 16TH IEEE INT CONF ON DEPENDABLE, AUTONOM AND SECURE COMP, 16TH IEEE INT CONF ON PERVAS INTELLIGENCE AND COMP, 4TH IEEE INT CONF ON BIG DATA INTELLIGENCE AND COMP, 3RD IEEE CYBER SCI AND TECHNOL CONGRESS (DASC/PICOM/DATACOM/CYBERSCITECH), 2018, : 695 - 701
  • [40] MDPCluster: a swarm-based community detection algorithm in large-scale graphs
    Shirjini, Mahsa Fozuni
    Farzi, Saeed
    Nikanjam, Amin
    COMPUTING, 2020, 102 (04) : 893 - 922