Balanced parallel triangle enumeration with an adaptive algorithm

被引:0
|
作者
Abir Farouzi
Xiantian Zhou
Ladjel Bellatreche
Mimoun Malki
Carlos Ordonez
机构
[1] ISAE-ENSMA,
[2] University of Houston,undefined
[3] Ecole Supérieure en Informatique,undefined
来源
关键词
Triangle enumeration; Triangle counting; Graph analytics; Adaptive algorithm; Intelligent data distribution;
D O I
暂无
中图分类号
学科分类号
摘要
Triangle enumeration is a foundation brick for solving harder graph problems related to social networks, the Internet and transportation, to name a few applications. This problem is well studied in the theory literature, but remains an open problem with big data. In this paper, we defend the idea of solving triangle enumeration with SQL queries evaluating the steps of a new adaptive algorithm with linear speedup. Such SQL approach provides scalability beyond RAM limits, automatic parallel processing and more importantly: linear speedup as more machines are added. We present theory results and experimental validation showing our solution works well with large graphs analyzed on a parallel cluster with many machines, producing a balanced workload even with highly skewed degree vertices. We consider two types of distributed systems: (1) a parallel DBMS that evaluates SQL queries, and (2) a parallel HPC cluster calling the MPI library (called via Python). Extensive benchmark experiments with large graphs show our SQL solution offers many advantages over MPI and competing graph analytic systems.
引用
收藏
页码:103 / 141
页数:38
相关论文
共 50 条
  • [1] Balanced parallel triangle enumeration with an adaptive algorithm
    Farouzi, Abir
    Zhou, Xiantian
    Bellatreche, Ladjel
    Malki, Mimoun
    Ordonez, Carlos
    [J]. DISTRIBUTED AND PARALLEL DATABASES, 2024, 42 (01) : 103 - 141
  • [2] An Efficient Triangle Enumeration on Parallel and Distributed Frameworks
    Kim, Hongyeon
    Kim, Seunghee
    Min, Jun-Ki
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 545 - 548
  • [3] An Efficient Parallel Triangle Enumeration on the MapReduce Framework
    Kim, Hongyeon
    Min, Jun-Ki
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (10) : 1902 - 1915
  • [4] BEEP: Balanced Efficient subgraph Enumeration in Parallel
    Kawtikwar, Samiran
    Almasri, Mohammad
    Hwu, Wen-mei
    Nagi, Rakesh
    Xiong, Jinjun
    [J]. PROCEEDINGS OF THE 52ND INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2023, 2023, : 142 - 152
  • [5] PandaSQL: Parallel Randomized Triangle Enumeration with SQL Queries
    Farouzi, Abir
    Bellatreche, Ladjel
    Ordonez, Carlos
    Pandurangan, Gopal
    Malki, Mimoun
    [J]. CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 3377 - 3380
  • [6] Parallel Triangle Counting and Enumeration using Matrix Algebra
    Azad, Ariful
    Buluc, Aydin
    Gilbert, John
    [J]. 2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, 2015, : 804 - 811
  • [7] A parallel algorithm for the enumeration of benzenoid hydrocarbons
    Jensen, Iwan
    [J]. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2009,
  • [8] Balanced Hierarchical Face Clustering Algorithm on Triangle Meshes
    Wang, Zhi
    Wu, Zhongke
    Zhou, Mingquan
    Luo, Yanlin
    [J]. 2009 11TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN AND COMPUTER GRAPHICS, PROCEEDINGS, 2009, : 528 - 531
  • [9] A scalable, parallel algorithm for maximal clique enumeration
    Schmidt, Matthew C.
    Samatova, Nagiza F.
    Thomas, Kevin
    Park, Byung-Hoon
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2009, 69 (04) : 417 - 428
  • [10] A Scalable Parallel Algorithm for Balanced Sampling
    Lee, Alexander
    Walzer-Goldfeld, Stefan
    Zablah, Shukry
    Riondato, Matteo
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 12991 - 12992