DPTL plus : Efficient Parallel Triangle Listing on Batch -Dynamic Graphs

被引:3
|
作者
Yu, Michael [1 ]
Qin, Lu [2 ]
Zhang, Ying [2 ]
Zhang, Wenjie [1 ]
Lin, Xuemin [1 ]
机构
[1] Univ New South Wales, Sydney, NSW, Australia
[2] Univ Technol Sydney, AAII, Ultimo, Australia
关键词
D O I
10.1109/ICDE51399.2021.00119
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Triangle listing is an important topic in many practical applications. We have observed that this problem has not yet been studied systematically in the context of batchdynamic graphs. In this paper, we aim to fill this gap by developing novel and efficient parallel solutions. Specifically, given a graph G and a batch-update of edges B, we report the updated triangles (deleted triangles and new triangles) resulting from the batch of updates. We notice that it is cost expensive to directly apply state-of-the-art triangle listing algorithms because they are designed to enumerate the complete set of triangles from a given graph, whereas only the updated ones are the relevant output for our problem setting. In this paper, we developed an efficient algorithm, namely DPTL, based on a newly designed orientation technique, which only outputs the updated triangles while ensuring that each triangle solution is identified without any duplicate solutions. We follow up by taking advantage of a graph's degree distributions and designed a more sophisticated algorithm, namely DPTL+. We show that DPTL+ can achieve the best performance in terms of both practical performance and theoretical time complexity. Our comprehensive experiments over 28 real-life large graphs show the superior performance of the DPTL+ algorithm when compared against DPTL and two baseline solutions. Theoretically, we also show that DPTL+ has a time complexity of 8(E min{deg (u), deg (v)} m) where deg (x) is the degree of a vertex x, and m is the number of edges adjacent to the vertices in the batch-update. This time complexity is more promising than that of other solutions.
引用
收藏
页码:1332 / 1343
页数:12
相关论文
共 50 条
  • [1] PDTL: Parallel and Distributed Triangle Listing for Massive Graphs
    Giechaskiel, Ilias
    Panagopoulos, George
    Yoneki, Eiko
    [J]. 2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2015, : 370 - 379
  • [2] An efficient exact algorithm for triangle listing in large graphs
    Lagraa, Sofiane
    Seba, Hamida
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2016, 30 (05) : 1350 - 1369
  • [3] Efficient Triangle Listing for Billion-Scale Graphs
    Zhang, Hao
    Zhu, Yuanyuan
    Qin, Lu
    Cheng, Hong
    Yu, Jeffrey Xu
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 813 - 822
  • [4] An efficient exact algorithm for triangle listing in large graphs
    Sofiane Lagraa
    Hamida Seba
    [J]. Data Mining and Knowledge Discovery, 2016, 30 : 1350 - 1369
  • [5] Summarized bit batch-based triangle listing in massive graphs
    Rasel, Mostofa Kamal
    Elena, En
    Lee, Young-Koo
    [J]. INFORMATION SCIENCES, 2018, 441 : 1 - 17
  • [6] Exact and Parallel Triangle Counting in Dynamic Graphs
    Makkar, Devavret
    Bader, David A.
    Green, Oded
    [J]. 2017 IEEE 24TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2017, : 2 - 12
  • [7] Efficient MapReduce algorithms for triangle listing in billion-scale graphs
    Yuanyuan Zhu
    Hao Zhang
    Lu Qin
    Hong Cheng
    [J]. Distributed and Parallel Databases, 2017, 35 : 149 - 176
  • [8] Efficient MapReduce algorithms for triangle listing in billion-scale graphs
    Zhu, Yuanyuan
    Zhang, Hao
    Qin, Lu
    Cheng, Hong
    [J]. DISTRIBUTED AND PARALLEL DATABASES, 2017, 35 (02) : 149 - 176
  • [9] DDSL: Efficient Subgraph Listing on Distributed and Dynamic Graphs
    Jian, Xun
    Wang, Yue
    Lei, Xiayu
    Shen, Yanyan
    Chen, Lei
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT II, 2020, 12113 : 632 - 640
  • [10] Bermuda: An Efficient MapReduce Triangle Listing Algorithm for Web-Scale Graphs
    Xiao, Dongqing
    Eltabakh, Mohamed
    Kong, Xiangnan
    [J]. 28TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM) 2016), 2016,