Parallel Sparse Matrix-Vector Multiplication Using Accelerators

被引：1

作者：

Maeda, Hiroshi ^{[1
]}

Takahashi, Daisuke ^{[2
]}

机构：

[1] Univ Tsukuba, Grad Sch Syst & Informat Engn, 1-1-1 Tennodai, Tsukuba, Ibaraki 3058573, Japan

[2] Univ Tsukuba, Ctr Computat Sci, 1-1-1 Tennodai, Tsukuba, Ibaraki 3058577, Japan

来源：

COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2016, PT II | 2016年 / 9787卷

关键词：

SpMV; Accelerator; GPU; MIC; Cluster;

D O I：

10.1007/978-3-319-42108-7_1

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Sparse matrix-vector multiplication (SpMV) is an essential computational kernel for many applications such as scientific computing. Recently, the number of computing systems equipped with NVIDIA's GPU and Intel's Xeon Phi coprocessor based on the MIC architecture has been increasing. Therefore, the importance of effective algorithms for SpMV in these systems is increasing. To the best of our knowledge, while previous studies have reported CPU and GPU implementations of SpMV for a cluster and MIC implementations for a single node, implementations of SpMV for the MIC cluster have not yet been reported. In this paper, we implemented and evaluated parallel SpMV on a GPU cluster and a MIC cluster. As shown by the results, the implementation for MIC achieved relatively high performance in some matrices with a single process, but it could not achieve higher performance than other implementations with 64 MPI processes. Therefore, we implemented and evaluated the single SpMV kernel to improve the performance of parallel SpMV.

引用

页码：3 / 18

页数：16

共 50 条

[1] Parallel Sparse Matrix-Vector and Matrix-Transpose-Vector Multiplication Using Compressed Sparse Blocks
Buluc, Aydin
Fineman, Jeremy T.
Frigo, Matteo
Gilbert, John R.
Leiserson, Charles E.
[J]. SPAA'09: PROCEEDINGS OF THE TWENTY-FIRST ANNUAL SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES, 2009, : 233 - 244
[2] Communication balancing in parallel sparse matrix-vector multiplication
Bisseling, RH
Meesen, W
[J]. ELECTRONIC TRANSACTIONS ON NUMERICAL ANALYSIS, 2005, 21 : 47 - 65
[3] Merge-based Parallel Sparse Matrix-Vector Multiplication
Merrill, Duane
Garland, Michael
[J]. SC '16: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2016, : 678 - 689
[4] Towards a fast parallel sparse symmetric matrix-vector multiplication
Geus, R
Röllin, S
[J]. PARALLEL COMPUTING, 2001, 27 (07) : 883 - 896
[5] Sparse Matrix-Vector Multiplication on GPGPUs
Filippone, Salvatore
Cardellini, Valeria
Barbieri, Davide
Fanfarillo, Alessandro
[J]. ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2017, 43 (04):
[6] Vectorized Parallel Sparse Matrix-Vector Multiplication in PETSc Using AVX-512
Zhang, Hong
Mills, Richard T.
Rupp, Karl
Smith, Barry F.
[J]. PROCEEDINGS OF THE 47TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2018,
[7] A Nested Dissection Partitioning Method for Parallel Sparse Matrix-Vector Multiplication
Boman, Erik G.
Wolf, Michael M.
[J]. 2013 IEEE CONFERENCE ON HIGH PERFORMANCE EXTREME COMPUTING (HPEC), 2013,
[8] Structured sparse matrix-vector multiplication on massively parallel SIMD architectures
Dehn, T
Eiermann, M
Giebermann, K
Sperling, V
[J]. PARALLEL COMPUTING, 1995, 21 (12) : 1867 - 1894
[9] Load-balanced sparse matrix-vector multiplication on parallel computers
Nastea, SG
Frieder, O
El-Ghazawi, T
[J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1997, 46 (02) : 180 - 193
[10] An Efficient Sparse Matrix-Vector Multiplication on Distributed Memory Parallel Computers
Shahnaz, Rukhsana
Usman, Anila
[J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2007, 7 (01): : 77 - 82

← 1 2 3 4 5 →