Understanding the SIMD Efficiency of Graph Traversal on GPU

被引:0
|
作者
Cheng, Yichao [1 ]
An, Hong [1 ]
Chen, Zhitao [1 ]
Li, Feng [1 ]
Wang, Zhaohui [1 ]
Jiang, Xia [1 ]
Peng, Yi [1 ]
机构
[1] Univ Sci & Technol China, Hefei 230026, Peoples R China
关键词
BFS; GPU; irregular computation; graph topology; SIMD underutilization; SIMD efficiency; BREADTH-1ST SEARCH; ALGORITHMS; PARALLELISM;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Graph is a widely used data structure and graph algorithms, such as breadth-first search (BFS), are regarded as key components in a great number of applications. Recent studies have attempted to accelerate graph algorithms on highly parallel graphics processing unit (GPU). Although many graph algorithms based on large graphs exhibit abundant parallelism, their performance on GPU still faces formidable challenges, one of which is to map the irregular computation onto GPU's vectorized execution model. In this paper, we investigate the link between graph topology and performance of BFS on GPU. We introduce a novel model to analyze the components of SIMD underutilization. We show that SIMD lanes are wasted either due to the workload imbalance between tasks, or to the heterogeneity of each task. We also develop corresponding metrics to quantify the SIMD efficiency for BFS on GPU. Finally, we demonstrate the applicability of the metrics by using them to profile the performance for different mapping strategies.
引用
收藏
页码:42 / 56
页数:15
相关论文
共 50 条
  • [1] Scalable GPU Graph Traversal
    Merrill, Duane
    Garland, Michael
    Grimshaw, Andrew
    ACM SIGPLAN NOTICES, 2012, 47 (08) : 117 - 127
  • [2] Excavating the Potential of GPU for Accelerating Graph Traversal
    Wang, Pengyu
    Zhang, Lu
    Li, Chao
    Guo, Minyi
    2019 IEEE 33RD INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2019), 2019, : 221 - 230
  • [3] GPU-based Graph Traversal on Compressed Graphs
    Sha, Mo
    Li, Yuchen
    Tan, Kian-Lee
    SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2019, : 775 - 792
  • [4] Improving High-Performance GPU Graph Traversal with Compression
    Kaczmarski, Krzysztof
    Przymus, Piotr
    Rzazewski, Pawel
    NEW TRENDS IN DATABASE AND INFORMATION SYSTEMS II, 2015, 312 : 201 - 214
  • [5] Parallelizing Big De Bruijn Graph Traversal for Genome Assembly on GPU Clusters
    Qiu, Shuang
    Feng, Zonghao
    Luo, Qiong
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 : 466 - 470
  • [6] Understanding parallelism in graph traversal on multi-core clusters
    Lv, Huiwei
    Tan, Guangming
    Chen, Mingyu
    Sun, Ninghui
    COMPUTER SCIENCE-RESEARCH AND DEVELOPMENT, 2013, 28 (2-3): : 193 - 201
  • [7] Graph traversal and graph transformation
    Holdsworth, JJ
    THEORETICAL COMPUTER SCIENCE, 2004, 321 (2-3) : 215 - 231
  • [8] A GPU-parallel Algorithm for Fast Hybrid BFS-DFS Graph Traversal
    Maratea, Antonio
    Marcellino, Livia
    Duraccio, Vincenzo
    2017 13TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY AND INTERNET-BASED SYSTEMS (SITIS), 2017, : 450 - 457
  • [9] CPU-Style SIMD Ray Traversal on GPUs
    Lier, Alexander
    Stamminger, Marc
    Selgrad, Kai
    HIGH-PERFORMANCE GRAPHICS 2018, 2018,
  • [10] CPU-Style SIMD Ray Traversal on GPUs
    Lier, Alexander
    Stamminger, Marc
    Selgrad, Kai
    HIGH-PERFORMANCE GRAPHICS 2018, 2018,