Parallel Sparse Matrix-Vector Multiplication Using Accelerators

被引:1
|
作者
Maeda, Hiroshi [1 ]
Takahashi, Daisuke [2 ]
机构
[1] Univ Tsukuba, Grad Sch Syst & Informat Engn, 1-1-1 Tennodai, Tsukuba, Ibaraki 3058573, Japan
[2] Univ Tsukuba, Ctr Computat Sci, 1-1-1 Tennodai, Tsukuba, Ibaraki 3058577, Japan
关键词
SpMV; Accelerator; GPU; MIC; Cluster;
D O I
10.1007/978-3-319-42108-7_1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sparse matrix-vector multiplication (SpMV) is an essential computational kernel for many applications such as scientific computing. Recently, the number of computing systems equipped with NVIDIA's GPU and Intel's Xeon Phi coprocessor based on the MIC architecture has been increasing. Therefore, the importance of effective algorithms for SpMV in these systems is increasing. To the best of our knowledge, while previous studies have reported CPU and GPU implementations of SpMV for a cluster and MIC implementations for a single node, implementations of SpMV for the MIC cluster have not yet been reported. In this paper, we implemented and evaluated parallel SpMV on a GPU cluster and a MIC cluster. As shown by the results, the implementation for MIC achieved relatively high performance in some matrices with a single process, but it could not achieve higher performance than other implementations with 64 MPI processes. Therefore, we implemented and evaluated the single SpMV kernel to improve the performance of parallel SpMV.
引用
收藏
页码:3 / 18
页数:16
相关论文
共 50 条
  • [1] Parallel Sparse Matrix-Vector and Matrix-Transpose-Vector Multiplication Using Compressed Sparse Blocks
    Buluc, Aydin
    Fineman, Jeremy T.
    Frigo, Matteo
    Gilbert, John R.
    Leiserson, Charles E.
    [J]. SPAA'09: PROCEEDINGS OF THE TWENTY-FIRST ANNUAL SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES, 2009, : 233 - 244
  • [2] Communication balancing in parallel sparse matrix-vector multiplication
    Bisseling, RH
    Meesen, W
    [J]. ELECTRONIC TRANSACTIONS ON NUMERICAL ANALYSIS, 2005, 21 : 47 - 65
  • [3] Merge-based Parallel Sparse Matrix-Vector Multiplication
    Merrill, Duane
    Garland, Michael
    [J]. SC '16: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2016, : 678 - 689
  • [4] Towards a fast parallel sparse symmetric matrix-vector multiplication
    Geus, R
    Röllin, S
    [J]. PARALLEL COMPUTING, 2001, 27 (07) : 883 - 896
  • [5] Sparse Matrix-Vector Multiplication on GPGPUs
    Filippone, Salvatore
    Cardellini, Valeria
    Barbieri, Davide
    Fanfarillo, Alessandro
    [J]. ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2017, 43 (04):
  • [6] Vectorized Parallel Sparse Matrix-Vector Multiplication in PETSc Using AVX-512
    Zhang, Hong
    Mills, Richard T.
    Rupp, Karl
    Smith, Barry F.
    [J]. PROCEEDINGS OF THE 47TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2018,
  • [7] A Nested Dissection Partitioning Method for Parallel Sparse Matrix-Vector Multiplication
    Boman, Erik G.
    Wolf, Michael M.
    [J]. 2013 IEEE CONFERENCE ON HIGH PERFORMANCE EXTREME COMPUTING (HPEC), 2013,
  • [8] Structured sparse matrix-vector multiplication on massively parallel SIMD architectures
    Dehn, T
    Eiermann, M
    Giebermann, K
    Sperling, V
    [J]. PARALLEL COMPUTING, 1995, 21 (12) : 1867 - 1894
  • [9] Load-balanced sparse matrix-vector multiplication on parallel computers
    Nastea, SG
    Frieder, O
    El-Ghazawi, T
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1997, 46 (02) : 180 - 193
  • [10] An Efficient Sparse Matrix-Vector Multiplication on Distributed Memory Parallel Computers
    Shahnaz, Rukhsana
    Usman, Anila
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2007, 7 (01): : 77 - 82