Performance evaluation of the sparse matrix-vector multiplication on modern architectures

被引：58

作者：

Goumas, Georgios ^{[1
]}

Kourtis, Kornilios ^{[1
]}

Anastopoulos, Nikos ^{[1
]}

Karakasis, Vasileios ^{[1
]}

Koziris, Nectarios ^{[1
]}

机构：

[1] Natl Tech Univ Athens, Comp Syst Lab, Sch Elect & Comp Engn, Zografos 15780, Greece

来源：

JOURNAL OF SUPERCOMPUTING | 2009年 / 50卷 / 01期

关键词：

Sparse matrix-vector multiplication; Multicore architectures; Scientific applications; Performance evaluation;

D O I：

10.1007/s11227-008-0251-8

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we revisit the performance issues of the widely used sparse matrix-vector multiplication (SpMxV) kernel on modern microarchitectures. Previous scientific work reports a number of different factors that may significantly reduce performance. However, the interaction of these factors with the underlying architectural characteristics is not clearly understood, a fact that may lead to misguided, and thus unsuccessful attempts for optimization. In order to gain an insight into the details of SpMxV performance, we conduct a suite of experiments on a rich set of matrices for three different commodity hardware platforms. In addition, we investigate the parallel version of the kernel and report on the corresponding performance results and their relation to each architecture's specific multithreaded configuration. Based on our experiments, we extract useful conclusions that can serve as guidelines for the optimization process of both single and multithreaded versions of the kernel.

引用

页码：36 / 77

页数：42

共 50 条

[31] Load-balancing in sparse matrix-vector multiplication
Nastea, SG
Frieder, O
ElGhazawi, T
EIGHTH IEEE SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 1996, : 218 - 225
[32] Autotuning Runtime Specialization for Sparse Matrix-Vector Multiplication
Yilmaz, Buse
Aktemur, Baris
Garzaran, Maria J.
Kamin, Sam
Kirac, Furkan
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2016, 13 (01)
[33] Sparse Matrix-Vector Multiplication Based on Online Arithmetic
Cherati, Sahar Moradi
Jaberipur, Ghassem
Sousa, Leonel
IEEE ACCESS, 2024, 12 : 87653 - 87664
[34] Implementing Sparse Matrix-Vector Multiplication with QCSR on GPU
Zhang, Jilin
Liu, Enyi
Wan, Jian
Ren, Yongjian
Yue, Miao
Wang, Jue
APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 (02): : 473 - 482
[35] Communication balancing in parallel sparse matrix-vector multiplication
Bisseling, RH
Meesen, W
ELECTRONIC TRANSACTIONS ON NUMERICAL ANALYSIS, 2005, 21 : 47 - 65
[36] Sparse matrix-vector multiplication on network-on-chip
Sun, C-C
Goetze, J.
Jheng, H-Y
Ruan, S-J
ADVANCES IN RADIO SCIENCE, 2010, 8 : 289 - 294
[37] Optimization by Runtime Specialization for Sparse Matrix-Vector Multiplication
Kamin, Sam
Garzaran, Maria Jesus
Aktemur, Baris
Xu, Danqing
Yilmaz, Buse
Chen, Zhongbo
ACM SIGPLAN NOTICES, 2015, 50 (03) : 93 - 102
[38] A New Method of Sparse Matrix-Vector Multiplication on GPU
Huan, Gao
Qian, Zhang
PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 954 - 958
[39] A new approach for accelerating the sparse matrix-vector multiplication
Tvrdik, Pavel
Simecek, Ivan
SYNASC 2006: EIGHTH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING, PROCEEDINGS, 2007, : 156 - +
[40] No Zero Padded Sparse Matrix-Vector Multiplication on FPGAs
Huang, Jiasen
Ren, Junyan
Yin, Wenbo
Wang, Lingli
PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), 2014, : 290 - 291

← 1 2 3 4 5 →