Exploring the Design Space of Distributed Parallel Sparse Matrix-Multiple Vector Multiplication

被引:0
|
作者
Huang, Hua [1 ]
Chow, Edmond [1 ]
机构
[1] Georgia Inst Technol, Sch Computat Sci, Engn, Atlanta, GA 30332 USA
关键词
Sparse matrices; Partitioning algorithms; Vectors; Costs; Three-dimensional displays; Space exploration; Optimization; SpMM; SpMV; distributed-memory matrix multiplication; communication optimization; OPTIMIZATION; PERFORMANCE; FRAMEWORK;
D O I
10.1109/TPDS.2024.3452478
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We consider the distributed memory parallel multiplication of a sparse matrix by a dense matrix (SpMM). The dense matrix is often a collection of dense vectors. Standard implementations will multiply the sparse matrix by multiple dense vectors at the same time, to exploit the computational efficiencies therein. But such approaches generally utilize the same sparse matrix partitioning as if multiplying by a single vector. This article explores the design space of parallelizing SpMM and shows that a coarser grain partitioning of the matrix combined with a column-wise partitioning of the block of vectors can often require less communication volume and achieve higher SpMM performance. An algorithm is presented that chooses a process grid geometry for a given number of processes to optimize the performance of parallel SpMM. The algorithm can augment existing graph partitioners by utilizing the additional concurrency available when multiplying by multiple dense vectors to further reduce communication.
引用
收藏
页码:1977 / 1988
页数:12
相关论文
共 50 条
  • [21] A Nested Dissection Partitioning Method for Parallel Sparse Matrix-Vector Multiplication
    Boman, Erik G.
    Wolf, Michael M.
    2013 IEEE CONFERENCE ON HIGH PERFORMANCE EXTREME COMPUTING (HPEC), 2013,
  • [22] Load-balanced sparse matrix-vector multiplication on parallel computers
    Nastea, SG
    Frieder, O
    El-Ghazawi, T
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1997, 46 (02) : 180 - 193
  • [23] THE SCHEDULING OF SPARSE MATRIX-VECTOR MULTIPLICATION ON A MASSIVELY PARALLEL DAP COMPUTER
    ANDERSEN, J
    MITRA, G
    PARKINSON, D
    PARALLEL COMPUTING, 1992, 18 (06) : 675 - 697
  • [24] Analysis of Partitioning Models and Metrics in Parallel Sparse Matrix-Vector Multiplication
    Kaya, Kamer
    Ucar, Bora
    Catalyuerek, Uemit V.
    PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2013), PT II, 2014, 8385 : 174 - 184
  • [25] Well balanced sparse matrix-vector multiplication on a parallel heterogeneous system
    Jiogo, C. Dongmo
    Manneback, P.
    Kuonen, P.
    2006 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, VOLS 1 AND 2, 2006, : 665 - +
  • [26] Sparse Matrix Sparse Vector Multiplication - A Novel Approach
    Shah, Monika
    2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS, 2015, : 67 - 73
  • [27] The study of impact of matrix-processor mapping on the parallel sparse matrix-vector multiplication
    Simecek, I.
    Langr, D.
    Srnec, E.
    2013 15TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2013), 2014, : 321 - 328
  • [28] Sparse Matrix-Vector Multiplication on GPGPUs
    Filippone, Salvatore
    Cardellini, Valeria
    Barbieri, Davide
    Fanfarillo, Alessandro
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2017, 43 (04):
  • [29] Sparse matrix by vector multiplication on transputer networks
    Doreste, L.
    Navarro, J.J.
    Fernandez, A.
    Proceedings of the IASTED International Symposium on Applied Informatics, 1991,
  • [30] A GPU Framework for Sparse Matrix Vector Multiplication
    Neelima, B.
    Reddy, G. Ram Mohana
    Raghavendra, Prakash S.
    2014 IEEE 13TH INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING (ISPDC), 2014, : 51 - 58