GraphMineSuite: Enabling High-Performance and Programmable Graph Mining Algorithms with Set Algebra

被引:5
|
作者
Besta, Maciej [1 ]
Vonarburg-Shmaria, Zur [1 ]
Schaffner, Yannick [1 ]
Schwarz, Leonardo [1 ]
Kwasniewski, Grzegorz [1 ]
Gianinazzi, Lukas [1 ]
Beranek, Jakub [2 ]
Janda, Kacper [3 ]
Holenstein, Tobias [1 ]
Leisinger, Sebastian [1 ]
Tatkowski, Peter [1 ]
Ozdemir, Esref [1 ]
Balla, Adrian [1 ]
Copik, Marcin [1 ]
Lindenberger, Philipp [1 ]
Konieczny, Marek [3 ]
Mutlu, Onur [1 ]
Hoefler, Torsten [1 ]
机构
[1] Swiss Fed Inst Technol, Zurich, Switzerland
[2] VSB, Ostrava, Czech Republic
[3] AGH Univ Sci & Technol, Krakow, Poland
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2021年 / 14卷 / 11期
基金
欧洲研究理事会;
关键词
LINK-PREDICTION; MAXIMAL CLIQUES; BENCHMARK SUITE; NETWORKS; SIMILARITY;
D O I
10.14778/3476249.3476252
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose GraphMineSuite (GMS): the first benchmarking suite for graph mining that facilitates evaluating and constructing high-performance graph mining algorithms. First, GMS comes with a benchmark specification based on extensive literature review, prescribing representative problems, algorithms, and datasets. Second, GMS offers a carefully designed software platform for seamless testing of different fine-grained elements of graph mining algorithms, such as graph representations or algorithm subroutines. The platform includes parallel implementations of more than 40 considered baselines, and it facilitates developing complex and fast mining algorithms. High modularity is possible by harnessing set algebra operations such as set intersection and difference, which enables breaking complex graph mining algorithms into simple building blocks that can be separately experimented with. GMS is supported with a broad concurrency analysis for portability in performance insights, and a novel performance metric to assess the throughput of graph mining algorithms, enabling more insightful evaluation. As use cases, we harness GMS to rapidly redesign and accelerate state-of-the-art baselines of core graph mining problems: degeneracy reordering (by up to >2x), maximal clique listing (by up to >9x), k-clique listing (by 1.1x), and subgraph isomorphism (by up to 2.5x), also obtaining better theoretical performance bounds.
引用
收藏
页码:1922 / 1935
页数:14
相关论文
共 50 条
  • [21] PROGRAMMABLE HIGH-PERFORMANCE IIR FILTER CHIP
    WOODS, RF
    FLOYD, G
    WOOD, K
    EVANS, R
    MCCANNY, JV
    IEE PROCEEDINGS-CIRCUITS DEVICES AND SYSTEMS, 1995, 142 (03): : 179 - 185
  • [22] Mixing LU and QR factorization algorithms to design high-performance dense linear algebra solvers
    Faverge, Mathieu
    Herrmann, Julien
    Langou, Julien
    Lowery, Bradley
    Robert, Yves
    Dongarra, Jack
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2015, 85 : 32 - 46
  • [23] High-performance field programmable VLSI processor based on a direct allocation of a control/data flow graph
    Ohsawa, N
    Hariyama, M
    Kameyama, M
    ISVLSI 2000: IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI - NEW PARADIGMS FOR VLSI SYSTEMS DESIGN, 2002, : 95 - 100
  • [24] Enabling Java']Java for high-performance computing
    Kielmann, T
    Hatcher, P
    Bougé, L
    Bal, HE
    COMMUNICATIONS OF THE ACM, 2001, 44 (10) : 110 - 117
  • [25] Reliable Generation of High-Performance Matrix Algebra
    Nelson, Thomas
    Belter, Geoffrey
    Siek, Jeremy G.
    Jessup, Elizabeth
    Norris, Boyana
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2015, 41 (03):
  • [26] Towards High-Performance Computational Algebra with GAP
    Behrends, Reimer
    Konovalov, Alexander
    Linton, Steve
    Luebeck, Frank
    Neunhoeffer, Max
    MATHEMATICAL SOFTWARE - ICMS 2010, 2010, 6327 : 58 - +
  • [27] Graph analysis with high-performance computing
    Hendrickson, Bruce
    Berry, JonatHan W.
    COMPUTING IN SCIENCE & ENGINEERING, 2008, 10 (02) : 14 - 19
  • [28] Graphlt: A High-Performance Graph DSL
    Zhang, Yunming
    Yang, Mengjiao
    Baghdadi, Riyadh
    Kamil, Shoaib
    Shun, Julian
    Amarasinghe, Saman
    PROCEEDINGS OF THE ACM ON PROGRAMMING LANGUAGES-PACMPL, 2018, 2
  • [29] GraphIt: A High-Performance Graph DSL
    Zhang, Yunming
    Yang, Mengjiao
    Baghdadi, Riyadh
    Kamil, Shoaib
    Shun, Julian
    Amarasinghe, Saman
    PROCEEDINGS OF THE ACM ON PROGRAMMING LANGUAGES-PACMPL, 2018, 2
  • [30] HIGH-PERFORMANCE PARALLEL GRAPH REDUCTION
    JONES, SLP
    CLACK, C
    SALKILD, J
    LECTURE NOTES IN COMPUTER SCIENCE, 1989, 365 : 193 - 206