Balancing Computation and Communication in Distributed Sparse Matrix-Vector Multiplication

被引：0

作者：

Mi, Hongli ^{[1
]}

Yu, Xiangrui ^{[1
]}

Yu, Xiaosong ^{[1
]}

Wu, Shuangyuan ^{[1
]}

Liu, Weifeng ^{[1
]}

机构：

[1] China Univ Petr, Super Sci Software Lab, Beijing, Peoples R China

来源：

2023 IEEE/ACM 23RD INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING, CCGRID | 2023年

基金：

中国国家自然科学基金;

关键词：

Distributed memory system; sparse matrixvector multiplication; load balancing; PARTITIONING MODELS; ALGORITHM; FORMAT;

D O I：

10.1109/CCGRID57682.2023.00056

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Sparse Matrix-Vector Multiplication (SpMV) is a fundamental operation in a number of scientific and engineering problems. When the sparse matrices processed are large enough, distributed memory systems should be used to accelerate SpMV. At present, the optimization techniques for distributed SpMV mainly focus on reordering through graph or hypergraph partitioning. However, although the reordering could reduce the amount of communications in general, there are still load balancing challenges in computations and communications on distributed platforms that are not well addressed. In this paper, we propose two strategies to optimize SpMV on distributed clusters: (1) resizing the number of row blocks on the nodes for balancing the amount of computations, and (2) adjusting the column number of the diagonal blocks for balancing tasks and reducing communications among compute nodes. The experimental results show that compared with the classic distributed SpMV implementation and its variant reordered with graph partitioning, our algorithm achieves on average 77.20x and 5.18x (up to 460.52x and 27.50x) speedups, respectively. Also, our method bring on average 19.56x (up to 48.49x) speedup over a recently proposed hybrid distributed SpMV algorithm. In addition, our algorithm achieves obviously better scalability over these existing distributed SpMV methods.

引用

页码：535 / 544

页数：10

共 50 条

[21] Energy Evaluation of Sparse Matrix-Vector Multiplication on GPU
Benatia, Akrem
Ji, Weixing
Wang, Yizhuo
Shi, Feng
[J]. 2016 SEVENTH INTERNATIONAL GREEN AND SUSTAINABLE COMPUTING CONFERENCE (IGSC), 2016,
[22] Optimization by Runtime Specialization for Sparse Matrix-Vector Multiplication
Kamin, Sam
Garzaran, Maria Jesus
Aktemur, Baris
Xu, Danqing
Yilmaz, Buse
Chen, Zhongbo
[J]. ACM SIGPLAN NOTICES, 2015, 50 (03) : 93 - 102
[23] A new approach for accelerating the sparse matrix-vector multiplication
Tvrdik, Pavel
Simecek, Ivan
[J]. SYNASC 2006: EIGHTH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING, PROCEEDINGS, 2007, : 156 - +
[24] A New Method of Sparse Matrix-Vector Multiplication on GPU
Huan, Gao
Qian, Zhang
[J]. PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 954 - 958
[25] Sparse Binary Matrix-Vector Multiplication on Neuromorphic Computers
Schuman, Catherine D.
Kay, Bill
Date, Prasanna
Kannan, Ramakrishnan
Sao, Piyush
Potok, Thomas E.
[J]. 2021 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2021, : 308 - 311
[26] No Zero Padded Sparse Matrix-Vector Multiplication on FPGAs
Huang, Jiasen
Ren, Junyan
Yin, Wenbo
Wang, Lingli
[J]. PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), 2014, : 290 - 291
[27] Sparse Matrix-Vector Multiplication on a Reconfigurable Supercomputer with Application
Dubois, David
Dubois, Andrew
Boorman, Thomas
Connor, Carolyn
Poole, Steve
[J]. ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2010, 3 (01)
[28] Optimization techniques for sparse matrix-vector multiplication on GPUs
Maggioni, Marco
Berger-Wolf, Tanya
[J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2016, 93-94 : 66 - 86
[29] Parallel Sparse Matrix-Vector Multiplication Using Accelerators
Maeda, Hiroshi
Takahashi, Daisuke
[J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2016, PT II, 2016, 9787 : 3 - 18
[30] Adaptive diagonal sparse matrix-vector multiplication on GPU
Gao, Jiaquan
Xia, Yifei
Yin, Renjie
He, Guixia
[J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2021, 157 : 287 - 302

← 1 2 3 4 5 →