RECEIPT: REfine CoarsE-grained IndePendent Tasks for Parallel Tip decomposition of Bipartite Graphs

被引：6

作者：

Lakhotia, Kartik ^{[1
]}

Kannan, Rajgopal ^{[2
]}

Prasanna, Viktor ^{[1
]}

De Rose, Cesar A. F. ^{[3
]}

机构：

[1] Univ Southern Calif, Ming Hsieh Dept Elect Engn, Los Angeles, CA 90007 USA

[2] USA Res Lab, Los Angeles, CA 90094 USA

[3] Pontificia Univ Catolica Rio Grande do Sul, Sch Technol, Porto Alegre, RS, Brazil

来源：

PROCEEDINGS OF THE VLDB ENDOWMENT | 2020年 / 14卷 / 03期

基金：

美国国家科学基金会;

关键词：

ALGORITHMS;

D O I：

10.14778/3430915.3430929

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Tip decomposition is a crucial kernel for mining dense subgraphs in bipartite networks, with applications in spam detection, analysis of affiliation networks etc. It creates a hierarchy of vertex-induced subgraphs with varying densities determined by the participation of vertices in butterflies (2, 2-bicliques). To build the hierarchy, existing algorithms iteratively follow a delete-update(peeling) process: deleting vertices with the minimum number of butterflies and correspondingly updating the butterfly count of their 2-hop neighbors. The need to explore 2-hop neighborhood renders tip-decomposition computationally very expensive. Furthermore, the inherent sequentiality in peeling only minimum butterfly vertices makes derived parallel algorithms prone to heavy synchronization. In this paper, we propose a novel parallel tip-decomposition algorithm - REfine CoarsE-grained Independent Tasks (RECEIPT) that relaxes the peeling order restrictions by partitioning the vertices into multiple independent subsets that can be concurrently peeled. This enables RECEIPT to simultaneously achieve a high degree of parallelism and dramatic reduction in synchronizations. Further, RECEIPT employs a hybrid peeling strategy along with other optimizations that drastically reduce the amount of wedge exploration and execution time. We perform detailed experimental evaluation of RECEIPT on a shared-memory multicore server. It can process some of the largest publicly available bipartite datasets orders of magnitude faster than the state-of-the-art algorithms - achieving up to 1100x and 64x reduction in the number of thread synchronizations and traversed wedges, respectively. Using 36 threads, RECEIPT can provide up to 17.1x self-relative speedup.

引用

页码：404 / 417

页数：14

共 50 条

[31] A Coarse-grained Parallel Algorithm for the Matrix Chain Order Problem
Higa, Dilson R.
Stefanes, Marco A.
HIGH PERFORMANCE COMPUTING SYMPOSIUM 2012 (HPC 2012), 2012, 44 (06): : 58 - 65
[32] Coarse-grained distributed parallel programming interface for grid computing
Wu, YW
Wang, Q
Yang, GW
Zheng, WM
GRID AND COOPERATIVE COMPUTING, PT 1, 2004, 3032 : 255 - 258
[33] Polygonal approximation based on coarse-grained parallel genetic algorithm
Wu, Zhaobin
Zhao, Chunxia
Liu, Bin
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 71 (71)
[34] Mixed-granularity Parallel Coarse-grained Reconfigurable Architecture
Deng, Jinyi
Zhang, Linyun
Wang, Lei
Liu, Jiawei
Deng, Kexiang
Tang, Shibin
Gu, Jiangyuan
Han, Boxiao
Xu, Fei
Liu, Leibo
Wei, Shaojun
Yin, Shouyi
PROCEEDINGS OF THE 59TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC 2022, 2022, : 343 - 348
[35] C3: A Parallel Model for Coarse-Grained Machines
J Parallel Distrib Comput, 2 (139):
[36] Analysis of crossovers and selections in a coarse-grained parallel genetic algorithm
Katayama, K
Hirabayashi, H
Narihisa, H
MATHEMATICAL AND COMPUTER MODELLING, 2003, 38 (11-13) : 1275 - 1282
[37] Efficient Parallel Graph Algorithms for Coarse-Grained Multicomputers and BSP
F. Dehne
A. Ferreira
E. Cáceres
S. W. Song
A. Roncato
Algorithmica, 2002, 33 : 183 - 200
[38] A coarse-grained parallel algorithm for spanning tree and connected components
Cáceres, EN
Dehne, F
Mongelli, H
Song, SW
Szwarcfiter, JL
EURO-PAR 2004 PARALLEL PROCESSING, PROCEEDINGS, 2004, 3149 : 828 - 831
[39] PACK/UNPACK on coarse-grained distributed memory parallel machines
Bae, SJ
Ranka, S
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1996, 38 (02) : 204 - 216
[40] C-3: A parallel model for coarse-grained machines
Hambrusch, SE
Khokhar, AA
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1996, 32 (02) : 139 - 154

← 1 2 3 4 5 →