A Fast Sparse Block Circulant Matrix Vector Product

被引:0
|
作者
Romero, Eloy [1 ]
Tomas, Andres [1 ]
Soriano, Antonio [1 ]
Blanquer, Ignacio [1 ]
机构
[1] Univ Politecn Valencia, CSIC, CIEMAT, Inst Instrumentac Imagen Mol I3M,Ctr Mixto, Camino Vera S-N, E-46022 Valencia, Spain
来源
关键词
Circulant matrix; sparse matrix; matrix vector product; GPU; multi-core; computed tomography; IMAGE-RECONSTRUCTION; COMPUTED-TOMOGRAPHY;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In the context of computed tomography (CT), iterative image reconstruction techniques are gaining attention because high-quality images are becoming computationally feasible. They involve the solution of large systems of equations, whose cost is dominated by the sparse matrix vector product (SpMV). Our work considers the case of the sparse matrices being block circulant, which arises when taking advantage of the rotational symmetry in the tomographic system. Besides the straightforward storage saving, we exploit the circulant structure to rewrite the poor-performance SpMVs into a high-performance product between sparse and dense matrices. This paper describes the implementations developed for multi-core CPUs and GPUs, and presents experimental results with typical CT matrices. The presented approach is up to ten times faster than without exploiting the circulant structure.
引用
收藏
页码:548 / 559
页数:12
相关论文
共 50 条
  • [21] Semiautomatic Acceleration of Sparse Matrix-Vector Product Using OpenACC
    Stpiczynski, Przemyslaw
    PARALLEL PROCESSING AND APPLIED MATHEMATICS, PPAM 2015, PT II, 2016, 9574 : 143 - 152
  • [22] Efficient Sparse-Matrix Multi-Vector Product on GPUs
    Hong, Changwan
    Sukumaran-Rajam, Aravind
    Bandyopadhyay, Bortik
    Kim, Jinsung
    Kurt, Sureyya Emre
    Nisa, Israt
    Sabhlok, Shivani
    Catalyurek, Umit V.
    Parthasarathy, Srinivasan
    Sadayappan, P.
    HPDC '18: PROCEEDINGS OF THE 27TH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING, 2018, : 66 - 79
  • [23] Fast Sparse Matrix-Vector Multiplication on GPUs for Graph Applications
    Ashari, Arash
    Sedaghati, Naser
    Eisenlohr, John
    Parthasarathy, Srinivasan
    Sadayappan, P.
    SC14: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2014, : 781 - 792
  • [24] Towards a fast parallel sparse symmetric matrix-vector multiplication
    Geus, R
    Röllin, S
    PARALLEL COMPUTING, 2001, 27 (07) : 883 - 896
  • [25] Fast sparse matrix-vector multiplication for TeraFlop/s computers
    Wellein, G
    Hager, G
    Basermann, A
    Fehske, H
    HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2002, 2003, 2565 : 287 - 301
  • [26] A fast GPU algorithm for the inverse of a circulant matrix
    Zheng, Zuoyong
    Zhang, Ruixia
    FRONTIERS OF MANUFACTURING AND DESIGN SCIENCE II, PTS 1-6, 2012, 121-126 : 3755 - 3759
  • [27] Performance Portability of Sparse Block Diagonal Matrix Multiple Vector Multiplications on GPUs
    Ibrahim, Khaled Z.
    Yang, Chao
    Maris, Pieter
    2022 IEEE/ACM INTERNATIONAL WORKSHOP ON PERFORMANCE, PORTABILITY AND PRODUCTIVITY IN HPC (P3HPC), 2022, : 58 - 67
  • [28] Performance of Portable Sparse Matrix-Vector Product Implemented Using OpenACC
    Stec, Kinga
    Stpiczynski, Przemyslaw
    Proceedings of the 18th Conference on Computer Science and Intelligence Systems, FedCSIS 2023, 2023, : 1155 - 1160
  • [29] Modeling and improving locality for the sparse-matrix-vector product on cache memories
    Heras, DB
    Blanco, V
    Cabaleiro, JC
    Rivera, FF
    FUTURE GENERATION COMPUTER SYSTEMS, 2001, 18 (01) : 55 - 67
  • [30] A NEW STORAGE SCHEME FOR AN EFFICIENT IMPLEMENTATION OF THE SPARSE MATRIX-VECTOR PRODUCT
    FERNANDES, P
    GIRDINIO, P
    PARALLEL COMPUTING, 1989, 12 (03) : 327 - 333