Sparsity-Aware Communication for Distributed Graph Neural Network Training

被引:0
|
作者
Mukhopadhyay, Ujjaini [1 ]
Tripathy, Alok [1 ]
Selvitopi, Oguz [2 ]
Yelick, Katherine [1 ]
Buluc, Aydin [2 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Lawrence Berkeley Nat Lab, Berkeley, CA USA
关键词
MATRIX MULTIPLICATION;
D O I
10.1145/3673038.3673152
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Graph Neural Networks (GNNs) are a computationally efficient method to learn embeddings and classifications on graph data. However, GNN training has low computational intensity, making communication costs the bottleneck for scalability. Sparse-matrix dense-matrix multiplication (SpMM) is the core computational operation in full-graph training of GNNs. Previous work parallelizing this operation focused on sparsity-oblivious algorithms, where matrix elements are communicated regardless of the sparsity pattern. This leads to a predictable communication pattern that can be overlapped with computation and enables the use of collective communication operations at the expense of wasting significant bandwidth by communicating unnecessary data. We develop sparsity-aware algorithms that tackle the communication bottlenecks in GNN training with three novel approaches. First, we communicate only the necessary matrix elements. Second, we utilize a graph partitioning model to reorder the matrix and drastically reduce the amount of communicated elements. Finally, we address the high load imbalance in communication with a tailored partitioning model, which minimizes both the total communication volume and the maximum sending volume. We further couple these sparsity-exploiting approaches with a communication-avoiding approach (1.5D parallel SpMM) in which submatrices are replicated to reduce communication. We explore the tradeoffs of these combined optimizations and show up to 14x improvement on 256 GPUs and on some instances reducing communication to almost zero resulting in a communication-free parallel training relative to a popular GNN framework based on communication-oblivious SpMM.
引用
收藏
页码:117 / 126
页数:10
相关论文
共 50 条
  • [31] PRISM: Pre-training RF Signals in Sparsity-aware Masked Autoencoders
    Fang, Liang
    Song, Ruiyuan
    Lu, Zhi
    Zhang, Dongheng
    Hu, Yang
    Sun, Qibin
    Chen, Yan
    IEEE INFOCOM 2024-IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2024, : 2109 - 2118
  • [32] Graph Attention Neural Network Distributed Model Training
    Esmaeilzadeh, Armin
    Kambar, Mina Esmail Zadeh Nojoo
    Heidari, Maryam
    2022 IEEE WORLD AI IOT CONGRESS (AIIOT), 2022, : 447 - 452
  • [33] Sparsity-Aware In-Memory Neuromorphic Computing Unit With Configurable Topology of Hybrid Spiking and Artificial Neural Network
    Liu, Ying
    Chen, Zhiyuan
    Zhao, Wentao
    Zhao, Tianhao
    Jia, Tianyu
    Wang, Zhixuan
    Huang, Ru
    Ye, Le
    Ma, Yufei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2024, 71 (06) : 2660 - 2673
  • [34] An improved sparsity-aware normalized least-mean-square scheme for underwater communication
    Kumar, Anand
    Kumar, Prashant
    ETRI JOURNAL, 2023, 45 (03) : 379 - 393
  • [35] Sparsity-Aware Estimation of Nonlinear Volterra Kernels
    Kekatos, Vassilis
    Angelosante, Daniele
    Giannakis, Georgios B.
    2009 3RD IEEE INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP), 2009, : 129 - 132
  • [36] Sparsity-Aware Estimation of CDMA System Parameters
    Daniele Angelosante
    Emanuele Grossi
    Georgios B. Giannakis
    Marco Lops
    EURASIP Journal on Advances in Signal Processing, 2010
  • [37] SPARSITY-AWARE ESTIMATION OF CDMA SYSTEM PARAMETERS
    Angelosante, D.
    Grossi, E.
    Giannakis, G. B.
    Lops, M.
    SPAWC: 2009 IEEE 10TH WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS, 2009, : 697 - +
  • [38] SPARSITY-AWARE TDOA LOCALIZATION OF MULTIPLE SOURCES
    Jamali-Rad, Hadi
    Leus, Geert
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 4021 - 4025
  • [39] Automatic Sparsity-Aware Recognition for Keypoint Detection
    Xie, Yurui
    Guan, Ling
    2020 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2020), 2020, : 127 - 134
  • [40] Special section on sparsity-aware signal processing
    Sawada, H., 1917, Institute of Electronics, Information and Communication, Engineers, IEICE (E96-A):