GraphMineSuite: Enabling High-Performance and Programmable Graph Mining Algorithms with Set Algebra

被引:5
|
作者
Besta, Maciej [1 ]
Vonarburg-Shmaria, Zur [1 ]
Schaffner, Yannick [1 ]
Schwarz, Leonardo [1 ]
Kwasniewski, Grzegorz [1 ]
Gianinazzi, Lukas [1 ]
Beranek, Jakub [2 ]
Janda, Kacper [3 ]
Holenstein, Tobias [1 ]
Leisinger, Sebastian [1 ]
Tatkowski, Peter [1 ]
Ozdemir, Esref [1 ]
Balla, Adrian [1 ]
Copik, Marcin [1 ]
Lindenberger, Philipp [1 ]
Konieczny, Marek [3 ]
Mutlu, Onur [1 ]
Hoefler, Torsten [1 ]
机构
[1] Swiss Fed Inst Technol, Zurich, Switzerland
[2] VSB, Ostrava, Czech Republic
[3] AGH Univ Sci & Technol, Krakow, Poland
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2021年 / 14卷 / 11期
基金
欧洲研究理事会;
关键词
LINK-PREDICTION; MAXIMAL CLIQUES; BENCHMARK SUITE; NETWORKS; SIMILARITY;
D O I
10.14778/3476249.3476252
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose GraphMineSuite (GMS): the first benchmarking suite for graph mining that facilitates evaluating and constructing high-performance graph mining algorithms. First, GMS comes with a benchmark specification based on extensive literature review, prescribing representative problems, algorithms, and datasets. Second, GMS offers a carefully designed software platform for seamless testing of different fine-grained elements of graph mining algorithms, such as graph representations or algorithm subroutines. The platform includes parallel implementations of more than 40 considered baselines, and it facilitates developing complex and fast mining algorithms. High modularity is possible by harnessing set algebra operations such as set intersection and difference, which enables breaking complex graph mining algorithms into simple building blocks that can be separately experimented with. GMS is supported with a broad concurrency analysis for portability in performance insights, and a novel performance metric to assess the throughput of graph mining algorithms, enabling more insightful evaluation. As use cases, we harness GMS to rapidly redesign and accelerate state-of-the-art baselines of core graph mining problems: degeneracy reordering (by up to >2x), maximal clique listing (by up to >9x), k-clique listing (by 1.1x), and subgraph isomorphism (by up to 2.5x), also obtaining better theoretical performance bounds.
引用
收藏
页码:1922 / 1935
页数:14
相关论文
共 50 条
  • [1] Playa: High-performance programmable linear algebra
    Howle, Victoria E.
    Kirby, Robert C.
    Long, Kevin
    Brennan, Brian
    Kennedy, Kimberly
    SCIENTIFIC PROGRAMMING, 2012, 20 (03) : 257 - 273
  • [2] ProbGraph: High-Performance and High-Accuracy Graph Mining with Probabilistic Set Representations
    Besta, Maciej
    Miglioli, Cesare
    Labini, Paolo Sylos
    Tetek, Jakub
    Iff, Patrick
    Kanakagiri, Raghavendra
    Ashkboos, Saleh
    Janda, Kacper
    Podstawski, Michal
    Kwasniewski, Grzegorz
    Gleinig, Niels
    Vella, Flavio
    Mutlu, Onur
    Hoefler, Torsten
    SC22: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2022,
  • [3] Automatic Code Generation for High-Performance Graph Algorithms
    Peng, Zhen
    Ashraf, Rizwan A.
    Guo, Luanzheng
    Tian, Ruiqin
    Kestor, Gokcen
    2023 32ND INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PACT, 2023, : 14 - 26
  • [5] Graph BLAST: A High-Performance Linear Algebra-based Graph Framework on the GPU
    Yang, Carl
    Buluc, Aydin
    Owens, John D.
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2022, 48 (01):
  • [6] Exploring PIM Architecture for High-Performance Graph Pattern Mining
    Su, Jiya
    He, Linfeng
    Jiang, Peng
    Wang, Rujia
    IEEE COMPUTER ARCHITECTURE LETTERS, 2021, 20 (02) : 114 - 117
  • [7] High-performance graph algorithms from parallel sparse matrices
    Gilbert, John R.
    Reinhardt, Steve
    Shah, Viral B.
    APPLIED PARALLEL COMPUTING: STATE OF THE ART IN SCIENTIFIC COMPUTING, 2007, 4699 : 260 - +
  • [8] Enabling High-Performance Computing as a Service
    AbdelBaky, Moustafa
    Parashar, Manish
    Kim, Hyunjoo
    Jordan, Kirk E.
    Sachdeva, Vipin
    Sexton, James
    Jamjoom, Hani
    Shae, Zon-Yin
    Pencheva, Gergina
    Tavakoli, Reza
    Wheeler, Mary F.
    COMPUTER, 2012, 45 (10) : 72 - 80
  • [9] High-performance linear algebra algorithms using new generalized data structures for matrices
    Gustavson, FG
    IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2003, 47 (01) : 31 - 55
  • [10] LINEAR ALGEBRA ON HIGH-PERFORMANCE COMPUTERS
    DONGARRA, JJ
    SORENSEN, DC
    APPLIED MATHEMATICS AND COMPUTATION, 1986, 20 (1-2) : 57 - 88