Summarized bit batch-based triangle listing in massive graphs

被引:3
|
作者
Rasel, Mostofa Kamal [1 ]
Elena, En [1 ]
Lee, Young-Koo [1 ]
机构
[1] Kyung Hee Univ, Dept Comp Sci & Engn, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Bitmap compression; CPU parallelism; Triangle listing;
D O I
10.1016/j.ins.2018.02.018
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The presence of triangles in massive graphs provides many important uses in different graph algorithms, such as finding highly relevant vertices for dense subgraph mining, measuring the clustering coefficient, and computing the transitivity for network analysis. In memory algorithms cannot be used for triangle listing in massive graphs because the graphs are too large to fit into memory. External memory-based techniques address this problem by focusing on the I/O efficiency to improve performance. However, triangulation is a CPU intensive process that iteratively joins lists of neighbors to determine the adjacent vertices in each triangle. Therefore, the cost of a triangle listing algorithm on a massive graph is dominated by the join operations among the lists of neighbors. In this paper, we propose a disk-based triangle listing approach that uses an efficient technique to join the lists of neighbors by exploiting CPU parallelism through bitwise operations. We represent the lists of neighbors using bit vectors and compress them using our proposed summarized bit batch, which allows the bitwise operations to be performed directly on the compressed data. Our proposed technique slices a bit vector into a series of word length bit batches that it summarizes by pruning the bit batches that contain only 0-bits. Then our proposed approach for listing the triangles asynchronously accesses the summarized bit batches and joins them using bitwise operations. Our proposed technique achieves 40% higher compression for some real world datasets compared to the classic compression technique. The triangulation technique using the summarized bit batches also significantly outperforms the existing solutions in terms of wall clock time. (C) 2018 Elsevier Inc. All rights reserved.
引用
收藏
页码:1 / 17
页数:17
相关论文
共 50 条
  • [1] Exploiting CPU Parallelism for Triangle Listing using Hybrid Summarized Bit Batch Vector
    Rasel, Mostofa Kamal
    Lee, Young-Koo
    [J]. 2016 INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2016, : 183 - 190
  • [2] iTri: Index-based triangle listing in massive graphs
    Rase, Mostofa Kamal
    Han, Yongkoo
    Kim, Jinseung
    Park, Kisung
    Nguyen Anh Tu
    Lee, Young-Koo
    [J]. INFORMATION SCIENCES, 2016, 336 : 1 - 20
  • [3] PDTL: Parallel and Distributed Triangle Listing for Massive Graphs
    Giechaskiel, Ilias
    Panagopoulos, George
    Yoneki, Eiko
    [J]. 2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2015, : 370 - 379
  • [4] DPTL plus : Efficient Parallel Triangle Listing on Batch -Dynamic Graphs
    Yu, Michael
    Qin, Lu
    Zhang, Ying
    Zhang, Wenjie
    Lin, Xuemin
    [J]. 2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 1332 - 1343
  • [5] Triangle Listing in Massive Networks
    Chu, Shumo
    Cheng, James
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2012, 6 (04)
  • [6] On Asymptotic Cost of Triangle Listing in Random Graphs
    Xiao, Di
    Cui, Yi
    Cline, Daren B. H.
    Loguinov, Dmitri
    [J]. PODS'17: PROCEEDINGS OF THE 36TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2017, : 261 - 272
  • [7] Performance of batch-based digital signatures
    Cheng, WC
    Chou, CF
    Golubchik, L
    [J]. MASCOTS 2002: 10TH IEEE INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATIONS SYSTEMS, PROCEEDINGS, 2002, : 291 - 299
  • [8] An efficient exact algorithm for triangle listing in large graphs
    Lagraa, Sofiane
    Seba, Hamida
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2016, 30 (05) : 1350 - 1369
  • [9] Efficient Triangle Listing for Billion-Scale Graphs
    Zhang, Hao
    Zhu, Yuanyuan
    Qin, Lu
    Cheng, Hong
    Yu, Jeffrey Xu
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 813 - 822
  • [10] An efficient exact algorithm for triangle listing in large graphs
    Sofiane Lagraa
    Hamida Seba
    [J]. Data Mining and Knowledge Discovery, 2016, 30 : 1350 - 1369