Sparse Matrix-Vector Multiplication Cache Performance Evaluation and Design Exploration

被引：0

作者：

Cui, Jianfeng ^{[1
]}

Lu, Kai ^{[1
]}

Liu, Sheng ^{[2
]}

机构：

[1] Natl Univ Def Technol, Sch Comp, Changsha 410073, Hunan, Peoples R China

[2] Natl Univ Def Technol, Sci & Technol Parallel & Distributed Proc Lab, Changsha 410073, Hunan, Peoples R China

来源：

29TH INTERNATIONAL SYMPOSIUM ON THE MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS 2021) | 2021年

关键词：

SpMV; cache; sparse; matrix; PIN; simulation;

D O I：

10.1109/MASCOTS53633.2021.9614301

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we conducted a group of evaluations on the SpMV kernel with sequential implementation to investigate cache performance on single-core platforms. We verified a similar pattern inside a suite of sparse matrices covering various domains, which makes cache hit rate extraordinary inspiring in a sequential environment. This implicit regularity drove us to propose a cache space splitting approach, aiming at a better locality in dense vector accessing and utilization of large cache capacity in modern processors. Finally, we explored the design space of cache on Matrix 3000 GPDSP and proposed a group of cache parameters, based on our experimental results.

引用

页码：97 / 103

页数：7

共 50 条

[21] STRUCTURED SPARSE MATRIX-VECTOR MULTIPLICATION ON A MASPAR
DEHN, T
EIERMANN, M
GIEBERMANN, K
SPERLING, V
ZEITSCHRIFT FUR ANGEWANDTE MATHEMATIK UND MECHANIK, 1994, 74 (06): : T534 - T538
[22] Sparse matrix-vector multiplication -: Final solution?
Simecek, Ivan
Tvrdik, Pavel
PARALLEL PROCESSING AND APPLIED MATHEMATICS, 2008, 4967 : 156 - 165
[23] A hybrid format for better performance of sparse matrix-vector multiplication on a GPU
Guo, Dahai
Gropp, William
Olson, Luke N.
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2016, 30 (01): : 103 - 120
[24] Breaking the performance bottleneck of sparse matrix-vector multiplication on SIMD processors
Zhang, Kai
Chen, Shuming
Wang, Yaohua
Wan, Jianghua
IEICE ELECTRONICS EXPRESS, 2013, 10 (09):
[25] Evaluating the Performance Impact of Communication Imbalance in Sparse Matrix-Vector Multiplication
Utrera, Gladys
Gil, Marisa
Martorell, Xavier
23RD EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2015), 2015, : 321 - 328
[26] Adaptive sparse matrix representation for efficient matrix-vector multiplication
Zardoshti, Pantea
Khunjush, Farshad
Sarbazi-Azad, Hamid
JOURNAL OF SUPERCOMPUTING, 2016, 72 (09): : 3366 - 3386
[27] Efficient sparse matrix-vector multiplication using cache oblivious extension quadtree storage format
Zhang, Jilin
Wan, Jian
Li, Fangfang
Mao, Jie
Zhuang, Li
Yuan, Junfeng
Liu, Enyi
Yu, Zhuoer
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2016, 54 : 490 - 500
[28] Load-balancing in sparse matrix-vector multiplication
Nastea, SG
Frieder, O
ElGhazawi, T
EIGHTH IEEE SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 1996, : 218 - 225
[29] Autotuning Runtime Specialization for Sparse Matrix-Vector Multiplication
Yilmaz, Buse
Aktemur, Baris
Garzaran, Maria J.
Kamin, Sam
Kirac, Furkan
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2016, 13 (01)
[30] Sparse Matrix-Vector Multiplication Based on Online Arithmetic
Cherati, Sahar Moradi
Jaberipur, Ghassem
Sousa, Leonel
IEEE ACCESS, 2024, 12 : 87653 - 87664

← 1 2 3 4 5 →