A Fast Sparse Block Circulant Matrix Vector Product

被引：0

作者：

Romero, Eloy ^{[1
]}

Tomas, Andres ^{[1
]}

Soriano, Antonio ^{[1
]}

Blanquer, Ignacio ^{[1
]}

机构：

[1] Univ Politecn Valencia, CSIC, CIEMAT, Inst Instrumentac Imagen Mol I3M,Ctr Mixto, Camino Vera S-N, E-46022 Valencia, Spain

来源：

EURO-PAR 2014 PARALLEL PROCESSING | 2014年 / 8632卷

关键词：

Circulant matrix; sparse matrix; matrix vector product; GPU; multi-core; computed tomography; IMAGE-RECONSTRUCTION; COMPUTED-TOMOGRAPHY;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In the context of computed tomography (CT), iterative image reconstruction techniques are gaining attention because high-quality images are becoming computationally feasible. They involve the solution of large systems of equations, whose cost is dominated by the sparse matrix vector product (SpMV). Our work considers the case of the sparse matrices being block circulant, which arises when taking advantage of the rotational symmetry in the tomographic system. Besides the straightforward storage saving, we exploit the circulant structure to rewrite the poor-performance SpMVs into a high-performance product between sparse and dense matrices. This paper describes the implementations developed for multi-core CPUs and GPUs, and presents experimental results with typical CT matrices. The presented approach is up to ten times faster than without exploiting the circulant structure.

引用

页码：548 / 559

页数：12

共 50 条

[21] Semiautomatic Acceleration of Sparse Matrix-Vector Product Using OpenACC
Stpiczynski, Przemyslaw
PARALLEL PROCESSING AND APPLIED MATHEMATICS, PPAM 2015, PT II, 2016, 9574 : 143 - 152
[22] Efficient Sparse-Matrix Multi-Vector Product on GPUs
Hong, Changwan
Sukumaran-Rajam, Aravind
Bandyopadhyay, Bortik
Kim, Jinsung
Kurt, Sureyya Emre
Nisa, Israt
Sabhlok, Shivani
Catalyurek, Umit V.
Parthasarathy, Srinivasan
Sadayappan, P.
HPDC '18: PROCEEDINGS OF THE 27TH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING, 2018, : 66 - 79
[23] Fast Sparse Matrix-Vector Multiplication on GPUs for Graph Applications
Ashari, Arash
Sedaghati, Naser
Eisenlohr, John
Parthasarathy, Srinivasan
Sadayappan, P.
SC14: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2014, : 781 - 792
[24] Towards a fast parallel sparse symmetric matrix-vector multiplication
Geus, R
Röllin, S
PARALLEL COMPUTING, 2001, 27 (07) : 883 - 896
[25] Fast sparse matrix-vector multiplication for TeraFlop/s computers
Wellein, G
Hager, G
Basermann, A
Fehske, H
HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2002, 2003, 2565 : 287 - 301
[26] A fast GPU algorithm for the inverse of a circulant matrix
Zheng, Zuoyong
Zhang, Ruixia
FRONTIERS OF MANUFACTURING AND DESIGN SCIENCE II, PTS 1-6, 2012, 121-126 : 3755 - 3759
[27] Performance Portability of Sparse Block Diagonal Matrix Multiple Vector Multiplications on GPUs
Ibrahim, Khaled Z.
Yang, Chao
Maris, Pieter
2022 IEEE/ACM INTERNATIONAL WORKSHOP ON PERFORMANCE, PORTABILITY AND PRODUCTIVITY IN HPC (P3HPC), 2022, : 58 - 67
[28] Performance of Portable Sparse Matrix-Vector Product Implemented Using OpenACC
Stec, Kinga
Stpiczynski, Przemyslaw
Proceedings of the 18th Conference on Computer Science and Intelligence Systems, FedCSIS 2023, 2023, : 1155 - 1160
[29] Modeling and improving locality for the sparse-matrix-vector product on cache memories
Heras, DB
Blanco, V
Cabaleiro, JC
Rivera, FF
FUTURE GENERATION COMPUTER SYSTEMS, 2001, 18 (01) : 55 - 67
[30] A NEW STORAGE SCHEME FOR AN EFFICIENT IMPLEMENTATION OF THE SPARSE MATRIX-VECTOR PRODUCT
FERNANDES, P
GIRDINIO, P
PARALLEL COMPUTING, 1989, 12 (03) : 327 - 333

← 1 2 3 4 5 →