A work-efficient parallel sparse matrix-sparse vector multiplication algorithm

被引：33

作者：

Azad, Ariful ^{[1
]}

Buluc, Aydin ^{[1
]}

机构：

[1] Lawrence Berkeley Natl Lab, Computat Res Div, Berkeley, CA 94720 USA

来源：

2017 31ST IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS) | 2017年

关键词：

D O I：

10.1109/IPDPS.2017.76

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

We design and develop a work-efficient multithreaded algorithm for sparse matrix-sparse vector multiplication (SpMSpV) where the matrix, the input vector, and the output vector are all sparse. SpMSpV is an important primitive in the emerging GraphBLAS standard and is the workhorse of many graph algorithms including breadth-first search, bipartite graph matching, and maximal independent set. As thread counts increase, existing multithreaded SpMSpV algorithms can spend more time accessing the sparse matrix data structure than doing arithmetic. Our shared-memory parallel SpMSpV algorithm is work efficient in the sense that its total work is proportional to the number of arithmetic operations required. The key insight is to avoid each thread individually scan the list of matrix columns. Our algorithm is simple to implement and operates on existing column-based sparse matrix formats. It performs well on diverse matrices and vectors with heterogeneous sparsity patterns. A high-performance implementation of the algorithm attains up to 15x speedup on a 24-core Intel Ivy Bridge processor and up to 49x speedup on a 64-core Intel KNL manycore processor. In contrast to implementations of existing algorithms, the performance of our algorithm is sustained on a variety of different input types include matrices representing scale-free and high-diameter graphs.

引用

页码：688 / 697

页数：10

共 50 条

[1] TileSpMSpV: A Tiled Algorithm for Sparse Matrix-Sparse Vector Multiplication on GPUs
Ji, Haonan
Song, Huimin
Lu, Shibo
Jin, Zhou
Tan, Guangming
Liu, Weifeng
51ST INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2022, 2022,
[2] Merge-based Parallel Sparse Matrix-Sparse Vector Multiplication with a Vector Architecture
Li, Haoran
Yokoyama, Harumichi
Araki, Takuya
IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 43 - 50
[3] Parallel Computation of Sparse Matrix Vector Multiplication
Yin, Wei
He, Yu
2011 INTERNATIONAL CONFERENCE ON FUTURE COMPUTER SCIENCE AND APPLICATION (FCSA 2011), VOL 3, 2011, : 196 - 199
[4] Fast Sparse Matrix and Sparse Vector Multiplication Algorithm on the GPU
Yang, Carl
Wang, Yangzihao
Owens, John D.
2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, 2015, : 841 - 847
[5] An Efficient Sparse Matrix-Vector Multiplication on Distributed Memory Parallel Computers
Shahnaz, Rukhsana
Usman, Anila
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2007, 7 (01): : 77 - 82
[6] I/O-Optimal Cache-Oblivious Sparse Matrix-Sparse Matrix Multiplication
Gleinig, Niels
Besta, Maciej
Hoefler, Torsten
2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2022), 2022, : 36 - 46
[7] Adaptive sparse matrix representation for efficient matrix–vector multiplication
Pantea Zardoshti
Farshad Khunjush
Hamid Sarbazi-Azad
The Journal of Supercomputing, 2016, 72 : 3366 - 3386
[8] Hardware Support for Efficient Sparse Matrix Vector Multiplication
Ku, Anderson Kuei-An
Kuo, Jenny Yi-Chun
Xue, Jingling
EUC 2008: PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING, VOL 1, MAIN CONFERENCE, 2008, : 37 - 43
[9] Parallel Sparse Matrix-Vector and Matrix-Transpose-Vector Multiplication Using Compressed Sparse Blocks
Buluc, Aydin
Fineman, Jeremy T.
Frigo, Matteo
Gilbert, John R.
Leiserson, Charles E.
SPAA'09: PROCEEDINGS OF THE TWENTY-FIRST ANNUAL SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES, 2009, : 233 - 244
[10] Parallel Sparse Matrix-Vector Multiplication Using Accelerators
Maeda, Hiroshi
Takahashi, Daisuke
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2016, PT II, 2016, 9787 : 3 - 18

← 1 2 3 4 5 →