Efficient Data Loader for Fast Sampling-Based GNN Training on Large Graphs

被引:15
|
作者
Bai, Youhui [1 ]
Li, Cheng [1 ]
Lin, Zhiqi [1 ]
Wu, Yufei [1 ]
Miao, Youshan [2 ]
Liu, Yunxin [2 ]
Xu, Yinlong [1 ,3 ]
机构
[1] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230026, Anhui, Peoples R China
[2] Microsoft Res, Beijing 100080, Peoples R China
[3] Anhui Prov Key Lab High Performance Comp, Hefei 230026, Anhui, Peoples R China
基金
国家重点研发计划;
关键词
Training; Graphics processing units; Loading; Computational modeling; Load modeling; Partitioning algorithms; Deep learning; Graph neural network; cache; large graph; graph partition; pipeline; multi-GPU;
D O I
10.1109/TPDS.2021.3065737
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Emerging graph neural networks (GNNs) have extended the successes of deep learning techniques against datasets like images and texts to more complex graph-structured data. By leveraging GPU accelerators, existing frameworks combine mini-batch and sampling for effective and efficient model training on large graphs. However, this setup faces a scalability issue since loading rich vertex features from CPU to GPU through a limited bandwidth link usually dominates the training cycle. In this article, we propose PaGraph, a novel, efficient data loader that supports general and efficient sampling-based GNN training on single-server with multi-GPU. PaGraph significantly reduces the data loading time by exploiting available GPU resources to keep frequently-accessed graph data with a cache. It also embodies a lightweight yet effective caching policy that takes into account graph structural information and data access patterns of sampling-based GNN training simultaneously. Furthermore, to scale out on multiple GPUs, PaGraph develops a fast GNN-computation-aware partition algorithm to avoid cross-partition access during data-parallel training and achieves better cache efficiency. Finally, it overlaps data loading and GNN computation for further hiding loading costs. Evaluations on two representative GNN models, GCN and GraphSAGE, using two sampling methods, Neighbor and Layer-wise, show that PaGraph could eliminate the data loading time from the GNN training pipeline, and achieve up to 4.8x performance speedup over the state-of-the-art baselines. Together with preprocessing optimization, PaGraph further delivers up to 16.0x end-to-end speedup.
引用
收藏
页码:2541 / 2556
页数:16
相关论文
共 50 条
  • [41] A Simple and Efficient Sampling-based Algorithm for General Reachability Analysis
    Lew, Thomas
    Janson, Lucas
    Bonalli, Riccardo
    Pavone, Marco
    Proceedings of Machine Learning Research, 2022, 168 : 1086 - 1099
  • [42] ESRL: Efficient Sampling-Based Reinforcement Learning for Sequence Generation
    Wang, Chenglong
    Zhou, Hang
    Hu, Yimin
    Huo, Yifu
    Li, Bei
    Liu, Tongran
    Xiao, Tong
    Zhu, Jingbo
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 19107 - 19115
  • [43] Adaptive sampling-based quadrature rules for efficient Bayesian prediction
    van den Bos, L. M. M.
    Sanderse, B.
    Bierbooms, W. A. A. M.
    JOURNAL OF COMPUTATIONAL PHYSICS, 2020, 417
  • [44] TCAM-GNN: A TCAM-Based Data Processing Strategy for GNN Over Sparse Graphs
    Wang, Yu-Pang
    Wang, Wei-Chen
    Chang, Yuan-Hao
    Tsai, Chieh-Lin
    Kuo, Tei-Wei
    Wu, Chun-Feng
    Ho, Chien-Chung
    Hu, Han-Wen
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2024, 12 (03) : 891 - 904
  • [45] New perspective on sampling-based motion planning via random geometric graphs
    Solovey, Kiril
    Salzman, Oren
    Halperin, Dan
    ROBOTICS: SCIENCE AND SYSTEMS XII, 2016,
  • [46] Fast Sampling-Based Whole-Genome Haplotype Block Recognition
    Taliun, Daniel
    Gamper, Johann
    Leser, Ulf
    Pattaro, Cristian
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2016, 13 (02) : 315 - 325
  • [47] A Fast and Efficient Double-Tree RRT*-Like Sampling-Based Planner Applying on Mobile Robotic Systems
    Chen, Long
    Shan, Yunxiao
    Tian, Wei
    Li, Bijun
    Cao, Dongpu
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2018, 23 (06) : 2568 - 2578
  • [48] New perspective on sampling-based motion planning via random geometric graphs
    Solovey, Kiril
    Salzman, Oren
    Halperin, Dan
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2018, 37 (10): : 1117 - 1133
  • [49] Compressive sampling-based scattering data acquisition in microwave imaging
    Oliveri, Giacomo
    Anselmi, Nicola
    Salucci, Marco
    Poli, Lorenzo
    Massa, Andrea
    JOURNAL OF ELECTROMAGNETIC WAVES AND APPLICATIONS, 2023, 37 (05) : 693 - 729
  • [50] Adaptive hybrid local–global sampling for fast informed sampling-based optimal path planning
    Marco Faroni
    Nicola Pedrocchi
    Manuel Beschi
    Autonomous Robots, 2024, 48 (2-3)