Sparsity-Aware Communication for Distributed Graph Neural Network Training

被引:0
|
作者
Mukhopadhyay, Ujjaini [1 ]
Tripathy, Alok [1 ]
Selvitopi, Oguz [2 ]
Yelick, Katherine [1 ]
Buluc, Aydin [2 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Lawrence Berkeley Nat Lab, Berkeley, CA USA
关键词
MATRIX MULTIPLICATION;
D O I
10.1145/3673038.3673152
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Graph Neural Networks (GNNs) are a computationally efficient method to learn embeddings and classifications on graph data. However, GNN training has low computational intensity, making communication costs the bottleneck for scalability. Sparse-matrix dense-matrix multiplication (SpMM) is the core computational operation in full-graph training of GNNs. Previous work parallelizing this operation focused on sparsity-oblivious algorithms, where matrix elements are communicated regardless of the sparsity pattern. This leads to a predictable communication pattern that can be overlapped with computation and enables the use of collective communication operations at the expense of wasting significant bandwidth by communicating unnecessary data. We develop sparsity-aware algorithms that tackle the communication bottlenecks in GNN training with three novel approaches. First, we communicate only the necessary matrix elements. Second, we utilize a graph partitioning model to reorder the matrix and drastically reduce the amount of communicated elements. Finally, we address the high load imbalance in communication with a tailored partitioning model, which minimizes both the total communication volume and the maximum sending volume. We further couple these sparsity-exploiting approaches with a communication-avoiding approach (1.5D parallel SpMM) in which submatrices are replicated to reduce communication. We explore the tradeoffs of these combined optimizations and show up to 14x improvement on 256 GPUs and on some instances reducing communication to almost zero resulting in a communication-free parallel training relative to a popular GNN framework based on communication-oblivious SpMM.
引用
收藏
页码:117 / 126
页数:10
相关论文
共 50 条
  • [41] Sparsity-Aware Estimation of CDMA System Parameters
    Angelosante, Daniele
    Grossi, Emanuele
    Giannakis, Georgios B.
    Lops, Marco
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2010,
  • [42] Sparsity-aware multitarget localisation for distributed MIMO radar against phase synchronisation mismatch
    Sun, Bin
    Chen, Haowen
    Zou, Huanxin
    IET COMMUNICATIONS, 2016, 10 (17) : 2269 - 2275
  • [43] A Simple Sparsity-aware Feature LMS Algorithm
    Chaves, Gabriel S.
    Lima, Markus V. S.
    Yazdanpanah, Hamed
    Diniz, Paulo S. R.
    Ferreira, Tadeu N.
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [44] Sparsity-Aware Distributed Conjugate Gradient Algorithms for Parameter Estimation over Sensor Networks
    Miller, Tamara Guerra
    Xu, Songcen
    de lamare, Rodrigo C.
    Nascimento, Vitor H.
    Zakharov, Yuriy
    2015 49TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2015, : 1556 - 1560
  • [45] SPRITE: Sparsity-Aware Neural Processing Unit with Constant Probability of Index-Matching
    Ryu, Sungju
    Oh, Youngtaek
    Kim, Taesu
    Ahn, Daehyun
    Kim, Jae-Joon
    PROCEEDINGS OF THE 2021 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2021), 2021, : 663 - 666
  • [46] Sparsity-aware distributed adaptive filtering with robustness against impulsive noise and low SNR
    do Carmo, Rafael Moura
    de R. Ferreira, Guilherme
    Campelo, Pedro Henrique
    Resende, Leonardo C.
    de Lima, Leonardo
    Henriques, Felipe da Rocha
    Haddad, Diego Barreto
    TELECOMMUNICATION SYSTEMS, 2024, 86 (03) : 451 - 461
  • [47] Sparsity-Aware Estimation of Nonlinear Volterra Kernels
    Kekatos, Vassilis
    Angelosante, Daniele
    Giannakis, Georgios B.
    2009 3RD IEEE INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP 2009), 2009, : 129 - 132
  • [48] Robust and sparsity-aware adaptive filters: A Review
    Kumar, Krishna
    Pandey, Rajlaxmi
    Karthik, M. L. N. S.
    Bhattacharjee, Sankha Subhra
    George, Nithin V.
    SIGNAL PROCESSING, 2021, 189 (189)
  • [49] Sparsity-Aware Reachability Computation for Massive Graphs
    Kim, Sung-Soo
    Kang, Young-Min
    Kimt, Young-Kuk
    2022 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (IEEE BIGCOMP 2022), 2022, : 157 - 160
  • [50] Sparsity-Aware Sensor Selection for Correlated Noise
    Jamali-Rad, Hadi
    Simonetto, Andrea
    Leus, Geert
    Ma, Xiaoli
    2014 17TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2014,