Acceleration of Approximate Matrix Multiplications on GPUs

被引:0
|
作者
Okuyama, Takuya [1 ]
Rohm, Andre [1 ]
Mihana, Takatomo [1 ]
Naruse, Makoto [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Dept Informat Phys & Comp, Tokyo 1138656, Japan
关键词
approximate calculation; approximate matrix multiplication; GPU computing; ALGORITHMS; ALIGNMENT;
D O I
10.3390/e25081130
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Matrix multiplication is important in various information-processing applications, including the computation of eigenvalues and eigenvectors, and in combinatorial optimization algorithms. Therefore, reducing the computation time of matrix products is essential to speed up scientific and practical calculations. Several approaches have been proposed to speed up this process, including GPUs, fast matrix multiplication libraries, custom hardware, and efficient approximate matrix multiplication (AMM) algorithms. However, research to date has yet to focus on accelerating AMMs for general matrices on GPUs, despite the potential of GPUs to perform fast and accurate matrix product calculations. In this paper, we propose a method for improving Monte Carlo AMMs. We also give an analytical solution for the optimal values of the hyperparameters in the proposed method. The proposed method improves the approximation of the matrix product without increasing the computation time compared to the conventional AMMs. It is also designed to work well with parallel operations on GPUs and can be incorporated into various algorithms. Finally, the proposed method is applied to a power method used for eigenvalue computation. We demonstrate that, on an NVIDIA A100 GPU, the computation time can be halved compared to the conventional power method using cuBLAS.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Improving accuracy for matrix multiplications on GPUs
    Badin, Matthew
    Bic, Lubomir
    Dillencourt, Michael
    Nicolau, Alexandru
    [J]. SCIENTIFIC PROGRAMMING, 2011, 19 (01) : 3 - 11
  • [2] Batched Small Tensor-Matrix Multiplications on GPUs
    Zhai, Keke
    Banerjee, Tania
    Wijayasiri, Adeesha
    Ranka, Sanjay
    [J]. 2020 IEEE 27TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC 2020), 2020, : 305 - 314
  • [3] A Memory Transaction Model for Sparse Matrix-Vector Multiplications on GPUs
    Keklikian, Thalie
    Langlois, J. M. Pierre
    Savaria, Yvon
    [J]. 2014 IEEE 12TH INTERNATIONAL NEW CIRCUITS AND SYSTEMS CONFERENCE (NEWCAS), 2014, : 309 - 312
  • [4] Performance Portability of Sparse Block Diagonal Matrix Multiple Vector Multiplications on GPUs
    Ibrahim, Khaled Z.
    Yang, Chao
    Maris, Pieter
    [J]. 2022 IEEE/ACM INTERNATIONAL WORKSHOP ON PERFORMANCE, PORTABILITY AND PRODUCTIVITY IN HPC (P3HPC), 2022, : 58 - 67
  • [5] In-DRAM Near-Data Approximate Acceleration for GPUs
    Yazdanbakhsh, Amir
    Song, Choungki
    Sacks, Jacob
    Lotfi-Kamran, Pejman
    Esmaeilzadeh, Hadi
    Kim, Nam Sung
    [J]. 27TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT 2018), 2018,
  • [6] Matrix Factorization on GPUs with Memory Optimization and Approximate Computing
    Tan, Wei
    Chang, Shiyu
    Fong, Liana
    Li, Cheng
    Wang, Zijun
    Cao, LiangLiang
    [J]. PROCEEDINGS OF THE 47TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2018,
  • [7] Performance of parallel sparse matrix-vector multiplications in linear solves on multiple GPUs
    Jamroz, Ben
    Mullowney, Paul
    [J]. 2012 SYMPOSIUM ON APPLICATION ACCELERATORS IN HIGH PERFORMANCE COMPUTING (SAAHPC), 2012, : 149 - 152
  • [8] Accelerating approximate matrix multiplication for near-sparse matrices on GPUs
    Liu, Xiaoyan
    Liu, Yi
    Yang, Hailong
    Dun, Ming
    Yin, Bohong
    Luan, Zhongzhi
    Qian, Depei
    [J]. JOURNAL OF SUPERCOMPUTING, 2022, 78 (09): : 11464 - 11491
  • [9] Threaded Accurate Matrix-Matrix Multiplications with Sparse Matrix-Vector Multiplications
    Ichimura, Shuntaro
    Ogita, Takeshi
    Katagiri, Takahiro
    Nagai, Toru
    Ozaki, Katsuhisa
    [J]. 2018 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2018), 2018, : 1093 - 1102
  • [10] Accelerating approximate matrix multiplication for near-sparse matrices on GPUs
    Xiaoyan Liu
    Yi Liu
    Hailong Yang
    Ming Dun
    Bohong Yin
    Zhongzhi Luan
    Depei Qian
    [J]. The Journal of Supercomputing, 2022, 78 : 11464 - 11491