PERFORMANCE EVALUATION OF SPARSE MATRIX-MATRIX MULTIPLICATION

被引:0
|
作者
Jain-Mendon, Shweta [1 ]
Sass, Ron [1 ]
机构
[1] Univ N Carolina, Reconfigurable Comp Syst Lab, Charlotte, NC 28223 USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The conventional matrix multiplication algorithms that are suitable for dense matrices do not perform well on the corresponding Sparse Matrix-Matrix Multiplication (SMMM) operation. In particular, they do not utilize the sparsity of the matrix. This paper describes a new technique for performing the SMMM operation using a novel storage format for sparse matrices. To demonstrate the feasibility of this technique, the SMMM operation is implemented on an FPGA and various parameters that affect the performance of the design are explored.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] Reducing inter-process communication overhead in parallel sparse matrix-matrix multiplication
    Ahmed M.S.
    Houser J.
    Hoque M.A.
    Raju R.
    Pfeiffer P.
    [J]. Int. J. Grid High Perform. Comput., 3 (46-59): : 46 - 59
  • [42] SPMSD: An Partitioning-Strategy for Parallel General Sparse Matrix-Matrix Multiplication on GPU
    Cui, Huanyu
    Wang, Nianbin
    Han, Qilong
    Wang, Ye
    [J]. PARALLEL PROCESSING LETTERS, 2024, 34 (02)
  • [43] Accelerating Sparse General Matrix-Matrix Multiplication for NVIDIA Volta GPU and Hygon DCU
    Tian, Zhuo
    Yang, Shuai
    Zhang, Changyou
    [J]. PROCEEDINGS OF THE 32ND INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING, HPDC 2023, 2023, : 329 - 330
  • [44] High-performance and Memory-saving Sparse General Matrix-Matrix Multiplication for NVIDIA Pascal GPU
    Nagasaka, Yusuke
    Nukada, Akira
    Matsuoka, Satoshi
    [J]. 2017 46TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2017, : 101 - 110
  • [45] SIMULTANEOUS INPUT AND OUTPUT MATRIX PARTITIONING FOR OUTER-PRODUCT-PARALLEL SPARSE MATRIX-MATRIX MULTIPLICATION
    Akbudak, Kadir
    Aykanat, Cevdet
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2014, 36 (05): : C568 - C590
  • [46] Parallel sparse matrix-matrix multiplication: a scalable solution with 1D algorithm
    Hoque, Mohammad Asadul
    Raju, Md Rezaul Karim
    Tymczak, Christopher John
    Vrinceanu, Daniel
    Chilakamarri, Kiran
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2015, 11 (04) : 391 - 401
  • [47] Bandwidth Optimized Parallel Algorithms for Sparse Matrix-Matrix Multiplication using Propagation Blocking
    Gu, Zhixiang
    Moreira, Jose
    Edelsohn, David
    Azad, Ariful
    [J]. PROCEEDINGS OF THE 32ND ACM SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES (SPAA '20), 2020, : 293 - 303
  • [48] Learning from Optimizing Matrix-Matrix Multiplication
    Parikh, Devangi N.
    Huang, Jianyu
    Myers, Margaret E.
    van de Geijn, Robert A.
    [J]. 2018 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2018), 2018, : 332 - 339
  • [49] Fast Kronecker Matrix-Matrix Multiplication on GPUs
    Jangda, Abhinav
    Yadav, Mohit
    [J]. PROCEEDINGS OF THE 29TH ACM SIGPLAN ANNUAL SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING, PPOPP 2024, 2024, : 390 - 403
  • [50] Communication-Avoiding and Memory-Constrained Sparse Matrix-Matrix Multiplication at Extreme Scale
    Hussain, Md Taufique
    Selvitopi, Oguz
    Buluc, Aydin
    Azad, Ariful
    [J]. 2021 IEEE 35TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2021, : 90 - 100