PERFORMANCE EVALUATION OF SPARSE MATRIX-MATRIX MULTIPLICATION

被引：0

作者：

Jain-Mendon, Shweta ^{[1
]}

Sass, Ron ^{[1
]}

机构：

[1] Univ N Carolina, Reconfigurable Comp Syst Lab, Charlotte, NC 28223 USA

来源：

2013 23RD INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL 2013) PROCEEDINGS | 2013年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The conventional matrix multiplication algorithms that are suitable for dense matrices do not perform well on the corresponding Sparse Matrix-Matrix Multiplication (SMMM) operation. In particular, they do not utilize the sparsity of the matrix. This paper describes a new technique for performing the SMMM operation using a novel storage format for sparse matrices. To demonstrate the feasibility of this technique, the SMMM operation is implemented on an FPGA and various parameters that affect the performance of the design are explored.

引用

页数：4

共 50 条

[41] Reducing inter-process communication overhead in parallel sparse matrix-matrix multiplication
Ahmed M.S.
Houser J.
Hoque M.A.
Raju R.
Pfeiffer P.
[J]. Int. J. Grid High Perform. Comput., 3 (46-59): : 46 - 59
[42] SPMSD: An Partitioning-Strategy for Parallel General Sparse Matrix-Matrix Multiplication on GPU
Cui, Huanyu
Wang, Nianbin
Han, Qilong
Wang, Ye
[J]. PARALLEL PROCESSING LETTERS, 2024, 34 (02)
[43] Accelerating Sparse General Matrix-Matrix Multiplication for NVIDIA Volta GPU and Hygon DCU
Tian, Zhuo
Yang, Shuai
Zhang, Changyou
[J]. PROCEEDINGS OF THE 32ND INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING, HPDC 2023, 2023, : 329 - 330
[44] High-performance and Memory-saving Sparse General Matrix-Matrix Multiplication for NVIDIA Pascal GPU
Nagasaka, Yusuke
Nukada, Akira
Matsuoka, Satoshi
[J]. 2017 46TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2017, : 101 - 110
[45] SIMULTANEOUS INPUT AND OUTPUT MATRIX PARTITIONING FOR OUTER-PRODUCT-PARALLEL SPARSE MATRIX-MATRIX MULTIPLICATION
Akbudak, Kadir
Aykanat, Cevdet
[J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2014, 36 (05): : C568 - C590
[46] Parallel sparse matrix-matrix multiplication: a scalable solution with 1D algorithm
Hoque, Mohammad Asadul
Raju, Md Rezaul Karim
Tymczak, Christopher John
Vrinceanu, Daniel
Chilakamarri, Kiran
[J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2015, 11 (04) : 391 - 401
[47] Bandwidth Optimized Parallel Algorithms for Sparse Matrix-Matrix Multiplication using Propagation Blocking
Gu, Zhixiang
Moreira, Jose
Edelsohn, David
Azad, Ariful
[J]. PROCEEDINGS OF THE 32ND ACM SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES (SPAA '20), 2020, : 293 - 303
[48] Learning from Optimizing Matrix-Matrix Multiplication
Parikh, Devangi N.
Huang, Jianyu
Myers, Margaret E.
van de Geijn, Robert A.
[J]. 2018 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2018), 2018, : 332 - 339
[49] Fast Kronecker Matrix-Matrix Multiplication on GPUs
Jangda, Abhinav
Yadav, Mohit
[J]. PROCEEDINGS OF THE 29TH ACM SIGPLAN ANNUAL SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING, PPOPP 2024, 2024, : 390 - 403
[50] Communication-Avoiding and Memory-Constrained Sparse Matrix-Matrix Multiplication at Extreme Scale
Hussain, Md Taufique
Selvitopi, Oguz
Buluc, Aydin
Azad, Ariful
[J]. 2021 IEEE 35TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2021, : 90 - 100

← 1 2 3 4 5 →