Acceleration of Approximate Matrix Multiplications on GPUs

被引：0

作者：

Okuyama, Takuya ^{[1
]}

Rohm, Andre ^{[1
]}

Mihana, Takatomo ^{[1
]}

Naruse, Makoto ^{[1
]}

机构：

[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Dept Informat Phys & Comp, Tokyo 1138656, Japan

来源：

ENTROPY | 2023年 / 25卷 / 08期

关键词：

approximate calculation; approximate matrix multiplication; GPU computing; ALGORITHMS; ALIGNMENT;

D O I：

10.3390/e25081130

中图分类号：

O4 [物理学];

学科分类号：

0702 ;

摘要：

Matrix multiplication is important in various information-processing applications, including the computation of eigenvalues and eigenvectors, and in combinatorial optimization algorithms. Therefore, reducing the computation time of matrix products is essential to speed up scientific and practical calculations. Several approaches have been proposed to speed up this process, including GPUs, fast matrix multiplication libraries, custom hardware, and efficient approximate matrix multiplication (AMM) algorithms. However, research to date has yet to focus on accelerating AMMs for general matrices on GPUs, despite the potential of GPUs to perform fast and accurate matrix product calculations. In this paper, we propose a method for improving Monte Carlo AMMs. We also give an analytical solution for the optimal values of the hyperparameters in the proposed method. The proposed method improves the approximation of the matrix product without increasing the computation time compared to the conventional AMMs. It is also designed to work well with parallel operations on GPUs and can be incorporated into various algorithms. Finally, the proposed method is applied to a power method used for eigenvalue computation. We demonstrate that, on an NVIDIA A100 GPU, the computation time can be halved compared to the conventional power method using cuBLAS.

引用

页数：19

共 50 条

[1] Improving accuracy for matrix multiplications on GPUs
Badin, Matthew
Bic, Lubomir
Dillencourt, Michael
Nicolau, Alexandru
[J]. SCIENTIFIC PROGRAMMING, 2011, 19 (01) : 3 - 11
[2] Batched Small Tensor-Matrix Multiplications on GPUs
Zhai, Keke
Banerjee, Tania
Wijayasiri, Adeesha
Ranka, Sanjay
[J]. 2020 IEEE 27TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC 2020), 2020, : 305 - 314
[3] A Memory Transaction Model for Sparse Matrix-Vector Multiplications on GPUs
Keklikian, Thalie
Langlois, J. M. Pierre
Savaria, Yvon
[J]. 2014 IEEE 12TH INTERNATIONAL NEW CIRCUITS AND SYSTEMS CONFERENCE (NEWCAS), 2014, : 309 - 312
[4] Performance Portability of Sparse Block Diagonal Matrix Multiple Vector Multiplications on GPUs
Ibrahim, Khaled Z.
Yang, Chao
Maris, Pieter
[J]. 2022 IEEE/ACM INTERNATIONAL WORKSHOP ON PERFORMANCE, PORTABILITY AND PRODUCTIVITY IN HPC (P3HPC), 2022, : 58 - 67
[5] In-DRAM Near-Data Approximate Acceleration for GPUs
Yazdanbakhsh, Amir
Song, Choungki
Sacks, Jacob
Lotfi-Kamran, Pejman
Esmaeilzadeh, Hadi
Kim, Nam Sung
[J]. 27TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT 2018), 2018,
[6] Matrix Factorization on GPUs with Memory Optimization and Approximate Computing
Tan, Wei
Chang, Shiyu
Fong, Liana
Li, Cheng
Wang, Zijun
Cao, LiangLiang
[J]. PROCEEDINGS OF THE 47TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2018,
[7] Performance of parallel sparse matrix-vector multiplications in linear solves on multiple GPUs
Jamroz, Ben
Mullowney, Paul
[J]. 2012 SYMPOSIUM ON APPLICATION ACCELERATORS IN HIGH PERFORMANCE COMPUTING (SAAHPC), 2012, : 149 - 152
[8] Accelerating approximate matrix multiplication for near-sparse matrices on GPUs
Liu, Xiaoyan
Liu, Yi
Yang, Hailong
Dun, Ming
Yin, Bohong
Luan, Zhongzhi
Qian, Depei
[J]. JOURNAL OF SUPERCOMPUTING, 2022, 78 (09): : 11464 - 11491
[9] Threaded Accurate Matrix-Matrix Multiplications with Sparse Matrix-Vector Multiplications
Ichimura, Shuntaro
Ogita, Takeshi
Katagiri, Takahiro
Nagai, Toru
Ozaki, Katsuhisa
[J]. 2018 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2018), 2018, : 1093 - 1102
[10] Accelerating approximate matrix multiplication for near-sparse matrices on GPUs
Xiaoyan Liu
Yi Liu
Hailong Yang
Ming Dun
Bohong Yin
Zhongzhi Luan
Depei Qian
[J]. The Journal of Supercomputing, 2022, 78 : 11464 - 11491

← 1 2 3 4 5 →