Practical Parallel Algorithms for Near-Optimal Densest Subgraphs on Massive Graphs

被引:0
|
作者
Sukprasert, Pattara [1 ,2 ]
Liu, Quanquan C. [3 ]
Dhulipala, Laxman [4 ]
Shun, Julian [5 ]
机构
[1] Databricks, San Francisco, CA 94105 USA
[2] Northwestern Univ, Evanston, IL 60208 USA
[3] Univ Calif Berkeley, Simons Inst, Berkeley, CA USA
[4] Univ Maryland, College Pk, MD 20742 USA
[5] MIT, CSAIL, 77 Massachusetts Ave, Cambridge, MA 02139 USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The densest subgraph problem has received significant attention, both in theory and in practice, due to its applications in problems such as community detection, social network analysis, and spam detection. Due to the high cost of obtaining exact solutions, much attention has focused on designing approximate densest subgraph algorithms. However, existing approaches are not able to scale to massive graphs with billions of edges. In this paper, we introduce a new framework that combines approximate densest subgraph algorithms with a pruning optimization. We design new parallel variants of the state-of-the-art sequential Greedy++ algorithm, and plug it into our framework in conjunction with a parallel pruning technique based on k-core decomposition to obtain parallel (1+epsilon)-approximate densest subgraph algorithms. On a single thread, our algorithms achieve 2.6-34x speedup over Greedy++, and obtain up to 22.37x self-relative parallel speedup on a 30core machine with two-way hyper-threading. Compared with the state-of-the-art parallel algorithm by Harb et al. [NeurIPS'22], we achieve up to a 114x speedup on the same machine. Finally, against the recent sequential algorithm of Xu et al. [PACMMOD'23], we achieve up to a 25.9x speedup. The scalability of our algorithms enables us to obtain near-optimal density statistics on the hyperlink2012 (with roughly 113 billion edges) and clueweb (with roughly 37 billion edges) graphs for the first time in the literature.
引用
收藏
页码:59 / 73
页数:15
相关论文
共 50 条
  • [41] Near-Optimal Quantum Algorithms for Multivariate Mean Estimation
    Cornelissen, Arjan
    Hamoudi, Yassine
    Jerbi, Sofiene
    PROCEEDINGS OF THE 54TH ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING (STOC '22), 2022, : 33 - 43
  • [42] Fast and Near-Optimal Algorithms for Approximating Distributions by Histograms
    Acharya, Jayadev
    Diakonikolas, Ilias
    Hegde, Chinmay
    Li, Jerry
    Schmidt, Ludwig
    PODS'15: PROCEEDINGS OF THE 33RD ACM SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2015, : 249 - 263
  • [43] Near-Optimal Algorithms for Maximum Constraint Satisfaction Problems
    Charikar, Moses
    Makarychev, Konstantin
    Makarychev, Yury
    ACM TRANSACTIONS ON ALGORITHMS, 2009, 5 (03)
  • [44] Near-Optimal Algorithms for Maximum Constraint Satisfaction Problems
    Charikar, Moses
    Makarychev, Konstantin
    Makarychev, Yury
    PROCEEDINGS OF THE EIGHTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2007, : 62 - 68
  • [45] Adaptive Reconnaissance Attacks with Near-Optimal Parallel Batching
    Li, Xiang
    Smith, J. David
    Thai, My T.
    2017 IEEE 37TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2017), 2017, : 699 - 709
  • [46] A NEAR-OPTIMAL PARALLEL ALGORITHM FOR JOINING BINARY RELATIONS
    Ketsman, Bas
    Suciu, Dan
    Tao, Yufei
    LOGICAL METHODS IN COMPUTER SCIENCE, 2022, 18 (02) : 1 - 6
  • [47] Optimal and near-optimal test sequencing algorithms with realistic test models
    Raghavan, V
    Shakeri, M
    Pattipati, K
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 1999, 29 (01): : 11 - 26
  • [48] OPTIMAL AND NEAR-OPTIMAL SCHEDULING ALGORITHMS FOR BATCHED PROCESSING IN LINEAR STORAGE
    BITNER, JR
    WONG, CK
    SIAM JOURNAL ON COMPUTING, 1979, 8 (04) : 479 - 498
  • [49] Optimal and near-optimal algorithms for generalized intersection reporting on pointer machines
    Shi, QM
    JaJa, J
    INFORMATION PROCESSING LETTERS, 2005, 95 (03) : 382 - 388
  • [50] Parallel Incremental Clustering Algorithms for Massive Dynamic Graphs
    Langguth, Johannes
    PROCEEDINGS OF THE 20TH ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS 2023, CF 2023, 2023, : 264 - 264