Practical Parallel Algorithms for Near-Optimal Densest Subgraphs on Massive Graphs

被引:0
|
作者
Sukprasert, Pattara [1 ,2 ]
Liu, Quanquan C. [3 ]
Dhulipala, Laxman [4 ]
Shun, Julian [5 ]
机构
[1] Databricks, San Francisco, CA 94105 USA
[2] Northwestern Univ, Evanston, IL 60208 USA
[3] Univ Calif Berkeley, Simons Inst, Berkeley, CA USA
[4] Univ Maryland, College Pk, MD 20742 USA
[5] MIT, CSAIL, 77 Massachusetts Ave, Cambridge, MA 02139 USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The densest subgraph problem has received significant attention, both in theory and in practice, due to its applications in problems such as community detection, social network analysis, and spam detection. Due to the high cost of obtaining exact solutions, much attention has focused on designing approximate densest subgraph algorithms. However, existing approaches are not able to scale to massive graphs with billions of edges. In this paper, we introduce a new framework that combines approximate densest subgraph algorithms with a pruning optimization. We design new parallel variants of the state-of-the-art sequential Greedy++ algorithm, and plug it into our framework in conjunction with a parallel pruning technique based on k-core decomposition to obtain parallel (1+epsilon)-approximate densest subgraph algorithms. On a single thread, our algorithms achieve 2.6-34x speedup over Greedy++, and obtain up to 22.37x self-relative parallel speedup on a 30core machine with two-way hyper-threading. Compared with the state-of-the-art parallel algorithm by Harb et al. [NeurIPS'22], we achieve up to a 114x speedup on the same machine. Finally, against the recent sequential algorithm of Xu et al. [PACMMOD'23], we achieve up to a 25.9x speedup. The scalability of our algorithms enables us to obtain near-optimal density statistics on the hyperlink2012 (with roughly 113 billion edges) and clueweb (with roughly 37 billion edges) graphs for the first time in the literature.
引用
收藏
页码:59 / 73
页数:15
相关论文
共 50 条
  • [1] Efficient and Near-Optimal Algorithms for Sampling Connected Subgraphs
    Bressan, Marco
    STOC '21: PROCEEDINGS OF THE 53RD ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING, 2021, : 1132 - 1143
  • [2] Algorithms for Densest Subgraphs of Vertex-Weighted Graphs
    Liu, Zhongling
    Chen, Wenbin
    Li, Fufang
    Qi, Ke
    Wang, Jianxiong
    MATHEMATICS, 2024, 12 (14)
  • [4] Near-Optimal Fully Dynamic Densest Subgraph
    Sawlani, Saurabh
    Wang, Junxing
    PROCEEDINGS OF THE 52ND ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING (STOC '20), 2020, : 181 - 193
  • [5] Near-Optimal and Practical Algorithms for Graph Scan Statistics with Connectivity Constraints
    Cadena, Jose
    Chen, Feng
    Vullikanti, Anil
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2019, 13 (02)
  • [6] OPTIMAL AND NEAR-OPTIMAL BROADCAST IN RANDOM GRAPHS
    SCHEINERMAN, ER
    WIERMAN, JC
    DISCRETE APPLIED MATHEMATICS, 1989, 25 (03) : 289 - 297
  • [7] Near-Optimal Separators in String Graphs
    Matousek, Jiri
    COMBINATORICS PROBABILITY & COMPUTING, 2014, 23 (01): : 135 - 139
  • [8] Extremal Distances in Directed Graphs: Tight Spanners and Near-Optimal Approximation Algorithms
    Choudhary, Keerti
    Gold, Omer
    PROCEEDINGS OF THE 2020 ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, SODA, 2020, : 495 - 514
  • [9] Near Optimal Parallel Algorithms for Dynamic DFS in Undirected Graphs
    Khan, Shahbaz
    PROCEEDINGS OF THE 29TH ACM SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES (SPAA'17), 2017, : 283 - 292
  • [10] Near Optimal Parallel Algorithms for Dynamic DFS in Undirected Graphs
    Khan, Shahbaz
    ACM TRANSACTIONS ON PARALLEL COMPUTING, 2019, 6 (03)