Sparsity-Aware Communication for Distributed Graph Neural Network Training

被引：0

作者：

Mukhopadhyay, Ujjaini ^{[1
]}

Tripathy, Alok ^{[1
]}

Selvitopi, Oguz ^{[2
]}

Yelick, Katherine ^{[1
]}

Buluc, Aydin ^{[2
]}

机构：

[1] Univ Calif Berkeley, Berkeley, CA 94720 USA

[2] Lawrence Berkeley Nat Lab, Berkeley, CA USA

来源：

53RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2024 | 2024年

关键词：

MATRIX MULTIPLICATION;

D O I：

10.1145/3673038.3673152

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Graph Neural Networks (GNNs) are a computationally efficient method to learn embeddings and classifications on graph data. However, GNN training has low computational intensity, making communication costs the bottleneck for scalability. Sparse-matrix dense-matrix multiplication (SpMM) is the core computational operation in full-graph training of GNNs. Previous work parallelizing this operation focused on sparsity-oblivious algorithms, where matrix elements are communicated regardless of the sparsity pattern. This leads to a predictable communication pattern that can be overlapped with computation and enables the use of collective communication operations at the expense of wasting significant bandwidth by communicating unnecessary data. We develop sparsity-aware algorithms that tackle the communication bottlenecks in GNN training with three novel approaches. First, we communicate only the necessary matrix elements. Second, we utilize a graph partitioning model to reorder the matrix and drastically reduce the amount of communicated elements. Finally, we address the high load imbalance in communication with a tailored partitioning model, which minimizes both the total communication volume and the maximum sending volume. We further couple these sparsity-exploiting approaches with a communication-avoiding approach (1.5D parallel SpMM) in which submatrices are replicated to reduce communication. We explore the tradeoffs of these combined optimizations and show up to 14x improvement on 256 GPUs and on some instances reducing communication to almost zero resulting in a communication-free parallel training relative to a popular GNN framework based on communication-oblivious SpMM.

引用

页码：117 / 126

页数：10

共 50 条

[41] Sparsity-Aware Estimation of CDMA System Parameters
Angelosante, Daniele
Grossi, Emanuele
Giannakis, Georgios B.
Lops, Marco
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2010,
[42] Sparsity-aware multitarget localisation for distributed MIMO radar against phase synchronisation mismatch
Sun, Bin
Chen, Haowen
Zou, Huanxin
IET COMMUNICATIONS, 2016, 10 (17) : 2269 - 2275
[43] A Simple Sparsity-aware Feature LMS Algorithm
Chaves, Gabriel S.
Lima, Markus V. S.
Yazdanpanah, Hamed
Diniz, Paulo S. R.
Ferreira, Tadeu N.
2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
[44] Sparsity-Aware Distributed Conjugate Gradient Algorithms for Parameter Estimation over Sensor Networks
Miller, Tamara Guerra
Xu, Songcen
de lamare, Rodrigo C.
Nascimento, Vitor H.
Zakharov, Yuriy
2015 49TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2015, : 1556 - 1560
[45] SPRITE: Sparsity-Aware Neural Processing Unit with Constant Probability of Index-Matching
Ryu, Sungju
Oh, Youngtaek
Kim, Taesu
Ahn, Daehyun
Kim, Jae-Joon
PROCEEDINGS OF THE 2021 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2021), 2021, : 663 - 666
[46] Sparsity-aware distributed adaptive filtering with robustness against impulsive noise and low SNR
do Carmo, Rafael Moura
de R. Ferreira, Guilherme
Campelo, Pedro Henrique
Resende, Leonardo C.
de Lima, Leonardo
Henriques, Felipe da Rocha
Haddad, Diego Barreto
TELECOMMUNICATION SYSTEMS, 2024, 86 (03) : 451 - 461
[47] Sparsity-Aware Estimation of Nonlinear Volterra Kernels
Kekatos, Vassilis
Angelosante, Daniele
Giannakis, Georgios B.
2009 3RD IEEE INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP 2009), 2009, : 129 - 132
[48] Robust and sparsity-aware adaptive filters: A Review
Kumar, Krishna
Pandey, Rajlaxmi
Karthik, M. L. N. S.
Bhattacharjee, Sankha Subhra
George, Nithin V.
SIGNAL PROCESSING, 2021, 189 (189)
[49] Sparsity-Aware Reachability Computation for Massive Graphs
Kim, Sung-Soo
Kang, Young-Min
Kimt, Young-Kuk
2022 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (IEEE BIGCOMP 2022), 2022, : 157 - 160
[50] Sparsity-Aware Sensor Selection for Correlated Noise
Jamali-Rad, Hadi
Simonetto, Andrea
Leus, Geert
Ma, Xiaoli
2014 17TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2014,

← 1 2 3 4 5 →