RECEIPT: REfine CoarsE-grained IndePendent Tasks for Parallel Tip decomposition of Bipartite Graphs

被引:6
|
作者
Lakhotia, Kartik [1 ]
Kannan, Rajgopal [2 ]
Prasanna, Viktor [1 ]
De Rose, Cesar A. F. [3 ]
机构
[1] Univ Southern Calif, Ming Hsieh Dept Elect Engn, Los Angeles, CA 90007 USA
[2] USA Res Lab, Los Angeles, CA 90094 USA
[3] Pontificia Univ Catolica Rio Grande do Sul, Sch Technol, Porto Alegre, RS, Brazil
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2020年 / 14卷 / 03期
基金
美国国家科学基金会;
关键词
ALGORITHMS;
D O I
10.14778/3430915.3430929
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Tip decomposition is a crucial kernel for mining dense subgraphs in bipartite networks, with applications in spam detection, analysis of affiliation networks etc. It creates a hierarchy of vertex-induced subgraphs with varying densities determined by the participation of vertices in butterflies (2, 2-bicliques). To build the hierarchy, existing algorithms iteratively follow a delete-update(peeling) process: deleting vertices with the minimum number of butterflies and correspondingly updating the butterfly count of their 2-hop neighbors. The need to explore 2-hop neighborhood renders tip-decomposition computationally very expensive. Furthermore, the inherent sequentiality in peeling only minimum butterfly vertices makes derived parallel algorithms prone to heavy synchronization. In this paper, we propose a novel parallel tip-decomposition algorithm - REfine CoarsE-grained Independent Tasks (RECEIPT) that relaxes the peeling order restrictions by partitioning the vertices into multiple independent subsets that can be concurrently peeled. This enables RECEIPT to simultaneously achieve a high degree of parallelism and dramatic reduction in synchronizations. Further, RECEIPT employs a hybrid peeling strategy along with other optimizations that drastically reduce the amount of wedge exploration and execution time. We perform detailed experimental evaluation of RECEIPT on a shared-memory multicore server. It can process some of the largest publicly available bipartite datasets orders of magnitude faster than the state-of-the-art algorithms - achieving up to 1100x and 64x reduction in the number of thread synchronizations and traversed wedges, respectively. Using 36 threads, RECEIPT can provide up to 17.1x self-relative speedup.
引用
收藏
页码:404 / 417
页数:14
相关论文
共 50 条
  • [31] A Coarse-grained Parallel Algorithm for the Matrix Chain Order Problem
    Higa, Dilson R.
    Stefanes, Marco A.
    HIGH PERFORMANCE COMPUTING SYMPOSIUM 2012 (HPC 2012), 2012, 44 (06): : 58 - 65
  • [32] Coarse-grained distributed parallel programming interface for grid computing
    Wu, YW
    Wang, Q
    Yang, GW
    Zheng, WM
    GRID AND COOPERATIVE COMPUTING, PT 1, 2004, 3032 : 255 - 258
  • [33] Polygonal approximation based on coarse-grained parallel genetic algorithm
    Wu, Zhaobin
    Zhao, Chunxia
    Liu, Bin
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 71 (71)
  • [34] Mixed-granularity Parallel Coarse-grained Reconfigurable Architecture
    Deng, Jinyi
    Zhang, Linyun
    Wang, Lei
    Liu, Jiawei
    Deng, Kexiang
    Tang, Shibin
    Gu, Jiangyuan
    Han, Boxiao
    Xu, Fei
    Liu, Leibo
    Wei, Shaojun
    Yin, Shouyi
    PROCEEDINGS OF THE 59TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC 2022, 2022, : 343 - 348
  • [35] C3: A Parallel Model for Coarse-Grained Machines
    J Parallel Distrib Comput, 2 (139):
  • [36] Analysis of crossovers and selections in a coarse-grained parallel genetic algorithm
    Katayama, K
    Hirabayashi, H
    Narihisa, H
    MATHEMATICAL AND COMPUTER MODELLING, 2003, 38 (11-13) : 1275 - 1282
  • [37] Efficient Parallel Graph Algorithms for Coarse-Grained Multicomputers and BSP
    F. Dehne
    A. Ferreira
    E. Cáceres
    S. W. Song
    A. Roncato
    Algorithmica, 2002, 33 : 183 - 200
  • [38] A coarse-grained parallel algorithm for spanning tree and connected components
    Cáceres, EN
    Dehne, F
    Mongelli, H
    Song, SW
    Szwarcfiter, JL
    EURO-PAR 2004 PARALLEL PROCESSING, PROCEEDINGS, 2004, 3149 : 828 - 831
  • [39] PACK/UNPACK on coarse-grained distributed memory parallel machines
    Bae, SJ
    Ranka, S
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1996, 38 (02) : 204 - 216
  • [40] C-3: A parallel model for coarse-grained machines
    Hambrusch, SE
    Khokhar, AA
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1996, 32 (02) : 139 - 154