AutoMine: Harmonizing High-Level Abstraction and High Performance for Graph Mining

被引:50
|
作者
Mawhirter, Daniel [1 ]
Wu, Bo [1 ]
机构
[1] Colorado Sch Mines, Golden, CO 80401 USA
关键词
Graph mining; graph pattern matching; compiler; FRAMEWORK; ANALYTICS; COMPILER;
D O I
10.1145/3341301.3359633
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Graph mining algorithms that aim at identifying structural patterns of graphs are typically more complex than graph computation algorithms such as breadth first search. Researchers have implemented several systems with high-level and flexible interfaces customized for tackling graph mining problems. However, we find that for triangle counting, one of the simplest graph mining problems, such systems can be several times slower than a single-threaded implementation of a straightforward algorithm. In this paper, we reveal the root causes of the severe inefficiencies of state-of-the-art graph mining systems and the challenges to address the performance problems. We build AutoMine, a single-machine system to provide both highlevel interfaces and high performance for large-scale graph mining applications. The novelty of AutoMine comes from 1) a new representation of subgraph patterns and 2) compilation techniques that automatically generate efficient mining code with minimized memory consumption from a highlevel abstraction. We have extensively evaluated AutoMine against 3 graph mining systems on 8 real-world graphs of different scales. Our experimental results show that AutoMine often produces several orders of magnitude better performance and can process very large graphs existing systems cannot handle.
引用
收藏
页码:509 / 523
页数:15
相关论文
共 50 条
  • [31] High-Level Web Data Abstraction Using Language Integrated Query
    Misek, Jakub
    Zavoral, Filip
    [J]. INTELLIGENT DISTRIBUTED COMPUTING IV, 2010, 315 : 13 - 22
  • [32] High-level observability for effective high-level ATPG
    Corno, Fulvio
    Sonza Reorda, Matteo
    Squillero, Giovanni
    [J]. Proceedings of the IEEE VLSI Test Symposium, 2000, : 411 - 416
  • [33] From High-Level Specification to High-Performance Code
    Franchetti, Franz
    Moura, Jose M. F.
    Padua, David A.
    Dongarra, Jack
    [J]. PROCEEDINGS OF THE IEEE, 2018, 106 (11) : 1875 - 1878
  • [34] Performance Analysis with High-Level Languages for High-Performance Reconfigurable Computing
    Curreri, John
    Koehler, Seth
    Holland, Brian
    George, Alan D.
    [J]. PROCEEDINGS OF THE SIXTEENTH IEEE SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, 2008, : 23 - 30
  • [35] Multiple abstraction level performance data mining
    Pierce, DB
    Rover, DT
    [J]. PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, 2001, : 407 - 413
  • [36] LogNIC: A High-Level Performance Model for SmartNICs
    Guo, Zerui
    Lin, Jiaxin
    Bai, Yuebin
    Kim, Daehyeok
    Swift, Michael
    Akella, Aditya
    Liu, Ming
    [J]. 56TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, MICRO 2023, 2023, : 916 - 929
  • [37] SynDFG: Synthetic Dataflow Graph Generator for High-level Synthesis
    Sinha, Sharad
    Zhang, Wei
    [J]. PROCEEDINGS OF THE SIXTH ASIA SYMPOSIUM ON QUALITY ELECTRONIC DESIGN ASQED 2015, 2015, : 50 - 55
  • [38] Control and data flow graph extraction for high-level synthesis
    Namballa, R
    Ranganathan, N
    Ejnioui, A
    [J]. VLSI 2004: IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, PROCEEDINGS, 2004, : 187 - 192
  • [39] A graph-based framework for High-level test synthesis
    Bashari, Ali Pourghaffari
    Pourmozafari, Saadat
    [J]. WORLD CONGRESS ON ENGINEERING 2007, VOLS 1 AND 2, 2007, : 486 - +
  • [40] Extending High-Level Synthesis with High-Performance Computing Performance Visualization
    Huthmann, Jens
    Podobas, Artur
    Sommer, Lukas
    Koch, Andreas
    Sano, Kentaro
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2020), 2020, : 371 - 380