gSampler: General and Efficient GPU-based Graph Sampling for Graph Learning

被引:2
|
作者
Gong, Ping [1 ,3 ,4 ]
Liu, Renjie [2 ,3 ]
Mao, Zunyao [2 ,3 ]
Cai, Zhenkun [3 ]
Yan, Xiao [2 ]
Li, Cheng [4 ]
Wang, Minjie [3 ]
Li, Zhuozhao [2 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
[2] Southern Univ Sci & Technol, Shenzhen, Peoples R China
[3] AWS Shanghai Lab, Shanghai, Peoples R China
[4] Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph Neural Network; Graph Sampling; Graph Learning; Graphics Processing Unit;
D O I
10.1145/3600006.3613168
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Graph sampling prepares training samples for graph learning and can dominate the training time. Due to the increasing algorithm diversity and complexity, existing sampling frameworks are insufficient in the generality of expression and the efficiency of execution. To close this gap, we conduct a comprehensive study on 15 popular graph sampling algorithms to motivate the design of gSampler, a general and efficient GPU-based graph sampling framework. gSampler models graph sampling using a general 4-step Extract-Compute-Select-Finalize (ECSF) programming model, proposes a set of matrix-centric APIs that allow to easily express complex graph sampling algorithms, and incorporates a data-flow intermediate representation (IR) that translates high-level API codes for efficient GPU execution. We demonstrate that implementing graph sampling algorithms with gSampler is easy and intuitive. We also conduct extensive experiments with 7 algorithms, 4 graph datasets, and 2 hardware configurations. The results show that gSampler introduces sampling speedups of 1.14-32.7x and an average speedup of 6.54x, compared to state-of-the-art GPU-based graph sampling systems such as DGL, which translates into an overall time reduction of over 40% for graph learning. gSampler is open-source at https://tinyurl.com/29twthd4.
引用
收藏
页码:562 / 578
页数:17
相关论文
共 50 条
  • [1] Efficient GPU-based Graph Cuts for Stereo Matching
    Choi, Young-kyu
    Park, In Kyu
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2013, : 642 - 648
  • [2] EGraph: Efficient Concurrent GPU-Based Dynamic Graph Processing
    Zhang, Yu
    Liang, Yuxuan
    Zhao, Jin
    Mao, Fubing
    Gu, Lin
    Liao, Xiaofei
    Jin, Hai
    Liu, Haikun
    Guo, Song
    Zeng, Yangqing
    Hu, Hang
    Li, Chen
    Zhang, Ji
    Wang, Biao
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (06) : 5823 - 5836
  • [3] Optimizing GPU-Based Graph Sampling and Random Walk for Efficiency and Scalability
    Wang, Pengyu
    Xu, Cheng
    Li, Chao
    Wang, Jing
    Wang, Taolei
    Zhang, Lu
    Hou, Xiaofeng
    Guo, Minyi
    IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (09) : 2508 - 2521
  • [4] A dynamic ancestral graph model and GPU-based simulation of a community based on metagenomic sampling
    Griswold, Cortland K.
    MOLECULAR ECOLOGY RESOURCES, 2022, 22 (06) : 2429 - 2442
  • [5] A GPU-based Graph Pattern Mining System
    Hu, Lin
    Zou, Lei
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4867 - 4871
  • [6] GPU-based Graph Traversal on Compressed Graphs
    Sha, Mo
    Li, Yuchen
    Tan, Kian-Lee
    SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2019, : 775 - 792
  • [7] GPregel: A GPU-Based Parallel Graph Processing Model
    Lai, Siyan
    Lai, Guangda
    Shen, Guojun
    Jin, Jing
    Lin, Xiaola
    2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 254 - 259
  • [8] INFINEL: An efficient GPU-based processing method for unpredictable large output graph queries
    Park, Sungwoo
    Oh, Seyeon
    Kim, Min-Soo
    PROCEEDINGS OF THE 29TH ACM SIGPLAN ANNUAL SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING, PPOPP 2024, 2024, : 147 - 159
  • [9] An Efficient GPU-based de Bruijn Graph Construction Algorithm for Micro-Assembly
    Ren, Shanshan
    Ahmed, Nauman
    Bertels, Koen
    Al-Ars, Zaid
    PROCEEDINGS 2018 IEEE 18TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2018, : 67 - 72
  • [10] Graph analysis using a GPU-based parallel algorithm: quantum clustering
    Wang, Zhe
    He, Zhijie
    Liu, Ding
    APPLIED INTELLIGENCE, 2024, : 7765 - 7776