NDMiner: Accelerating Graph Pattern Mining Using Near Data Processing

被引:5
|
作者
Talati, Nishil [1 ]
Ye, Haojie [1 ]
Yang, Yichen [1 ]
Belayneh, Leul [1 ]
Chen, Kuan-Yu [1 ]
Blaauw, David [1 ]
Mudge, Trevor [1 ]
Dreslinski, Ronald [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
关键词
Graph pattern mining; near data processing; hardware-software co-design; RAM;
D O I
10.1145/3470496.3527437
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Graph Pattern Mining (GPM) algorithms mine structural patterns in graphs. The performance of GPM workloads is bottlenecked by control flow and memory stalls. This is because of data-dependent branches used in set intersection and difference operations that dominate the execution time. This paper first conducts a systematic GPM workload analysis and uncovers four new observations to inform the optimization effort. First, GPM workloads mostly fetch inputs of costly set operations from different memory banks. Second, to avoid redundant computation, modern GPM workloads employ symmetry breaking that discards several data reads, resulting in cache pollution and wasted DRAM bandwidth. Third, sparse pattern mining algorithms perform redundant memory reads and computations. Fourth, GPM workloads do not fully utilize the in-DRAM data parallelism. Based on these observations, this paper presents NDMiner, a Near Data Processing (NDP) architecture that improves the performance of GPM workloads. To reduce in-memory data transfer of fetching data from different memory banks, NDMiner integrates compute units to offload set operations in the buffer chip of DRAM. To alleviate the wasted memory bandwidth caused by symmetry breaking, NDMiner integrates a load elision unit in hardware that detects the satisfiability of symmetry breaking constraints and terminates unnecessary loads. To optimize the performance of sparse pattern mining, NDMiner employs compiler optimizations and maps reduced reads and composite computation to NDP hardware that improves algorithmic efficiency of sparse GPM. Finally, NDMiner proposes a new graph remapping scheme in memory and a hardware-based set operation reordering technique to best optimize bank, rank, and channel-level parallelism in DRAM. To orchestrate NDP computation, this paper presents design modifications at the host ISA, compiler, and memory controller. We compare the performance of NDMiner with state-of-the-art software and hardware baselines using a mix of dense and sparse GPM algorithms. Our evaluation shows that NDMiner significantly outperforms software and hardware baselines by 6.4x and 2.5x, on average, while incurring a negligible area overhead on CPU and DRAM.
引用
下载
收藏
页码:146 / 159
页数:14
相关论文
共 50 条
  • [1] Accelerating Unstructured Graph Data Processing on GPUs
    Pan, Xiaohui
    2ND INTERNATIONAL CONFERENCE ON SIMULATION AND MODELING METHODOLOGIES, TECHNOLOGIES AND APPLICATIONS (SMTA 2015), 2015, : 29 - 33
  • [2] GraphR: Accelerating Graph Processing Using ReRAM
    Song, Linghao
    Zhuo, Youwei
    Qian, Xuehai
    Li, Hai
    Chen, Yiran
    2018 24TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2018, : 531 - 543
  • [3] GraphIdx: An efficient indexing technique for accelerating graph data mining
    Rasel, Mostofa Kamal
    Huq, Mohammad Rezwanul
    Arifuzzaman, Mohammad
    SOFTWARE IMPACTS, 2024, 20
  • [4] Design Pattern Mining Using Graph Matching
    LI Qing-hua 1
    2. Department of Computer Science
    Wuhan University Journal of Natural Sciences, 2004, (04) : 444 - 448
  • [5] Frequent pattern mining using bipartite graph
    Chai, Duck Jin
    Jin, Long
    Hwang, Buhyun
    Ryu, Keun Ho
    DEXA 2007: 18TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, : 182 - +
  • [6] Design pattern mining using graph matching
    Li, Qing-Hua
    Zhang, Zhi-Xiang
    Ben, Ke-Rong
    Wuhan University Journal of Natural Sciences, 2004, 9 (04) : 444 - 448
  • [7] GraphMini: Accelerating Graph Pattern Matching Using Auxiliary Graphs
    Liu, Juelin
    Polisetty, Sandeep
    Guan, Hui
    Serafini, Marco
    2023 32ND INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PACT, 2023, : 211 - 224
  • [8] Survival Prediction from Longitudinal Health Insurance Data using Graph Pattern Mining
    Ren, Yongjian
    Zhang, Kun
    Shi, Yuliang
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 1104 - 1108
  • [9] ACCELERATING BIG DATA PROCESSING CHAIN IN IMAGE INFORMATION MINING USING A HYBRID HPC APPROACH
    Kurte, Kuldeep R.
    Bhangale, Ujwala M.
    Durbha, Surya S.
    King, Roger L.
    Younan, Nicolas H.
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 7597 - 7600
  • [10] Accelerating Graph Processing With Lightweight Learning-Based Data Reordering
    Zou, Mo
    Zhang, Mingzhe
    Wang, Rujia
    Sun, Xian-He
    Ye, Xiaochun
    Fan, Dongrui
    Tang, Zhimin
    IEEE COMPUTER ARCHITECTURE LETTERS, 2022, 21 (01) : 5 - 8