Spatiotemporal Graph and Hypergraph Partitioning Models for Sparse Matrix-Vector Multiplication on Many-Core Architectures

被引：7

作者：

Abubaker, Nabil ^{[1
]}

Akbudak, Kadir ^{[2
]}

Aykanat, Cevdet ^{[1
]}

机构：

[1] Bilkent Univ, Dept Comp Engn, TR-06800 Ankara, Turkey

[2] King Abdullah Univ Sci & Technol, KSA, Extreme Comp Res Ctr, Dept Appl Math & Computat, Thuwal 23955, Saudi Arabia

来源：

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS | 2019年 / 30卷 / 02期

关键词：

Sparse matrix; sparse matrix-vector multiplication; data locality; spatial locality; temporal locality; hypergraph model; bipartite graph model; graph model; hypergraph partitioning; graph partitioning; Intel many integrated core architecture; Intel Xeon Phi; EXPLOITING LOCALITY; PERFORMANCE;

D O I：

10.1109/TPDS.2018.2864729

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

There exist graph/hypergraph partitioning-based row/column reordering methods for encoding either spatial or temporal locality for sparse matrix-vector multiplication (SpMV) operations. Spatial and temporal hypergraph models in these methods are extended to encapsulate both spatial and temporal localities based on cut/uncut net categorization obtained from vertex partitioning. These extensions of spatial and temporal hypergraph models encode the spatial locality primarily and the temporal locality secondarily, and vice-versa, respectively. However, the literature lacks models that simultaneously encode both spatial and temporal localities utilizing only vertex partitioning for further improving the performance of SpMV on shared-memory architectures. In order to fill this gap, we propose a novel spatiotemporal hypergraph model that leads to a one-phase spatiotemporal reordering method which encodes both types of locality simultaneously. We also propose a framework for spatiotemporal methods which encodes both types of locality in two dependent phases and two separate phases. The validity of the proposed spatiotemporal models and methods are tested on a wide range of sparse matrices and the experiments are performed on both a 60-core Intel Xeon Phi processor and a Xeon processor. Results show the validity of the methods via almost doubling the Gflop/s performance through enhancing data locality in parallel SpMV operations.

引用

页码：445 / 458

页数：14

共 50 条

[41] Optimization of Block Sparse Matrix-Vector Multiplication on Shared-Memory Parallel Architectures
Eberhardt, Ryan
Hoemmen, Mark
2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2016, : 663 - 672
[42] Optimizing Sparse Tensor Times Matrix on Multi-core and Many-core Architectures
Li, Jiajia
Ma, Yuchen
Yan, Chenggang
Vuduc, Richard
PROCEEDINGS OF 2016 6TH WORKSHOP ON IRREGULAR APPLICATIONS: ARCHITECTURE AND ALGORITHMS (IA3), 2016, : 26 - 33
[43] Graph Reachability on Parallel Many-Core Architectures
Quer, Stefano
Calabrese, Andrea
COMPUTATION, 2020, 8 (04) : 1 - 26
[44] Adaptive sparse matrix representation for efficient matrix-vector multiplication
Zardoshti, Pantea
Khunjush, Farshad
Sarbazi-Azad, Hamid
JOURNAL OF SUPERCOMPUTING, 2016, 72 (09): : 3366 - 3386
[45] Load-balancing in sparse matrix-vector multiplication
Nastea, SG
Frieder, O
ElGhazawi, T
EIGHTH IEEE SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 1996, : 218 - 225
[46] Autotuning Runtime Specialization for Sparse Matrix-Vector Multiplication
Yilmaz, Buse
Aktemur, Baris
Garzaran, Maria J.
Kamin, Sam
Kirac, Furkan
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2016, 13 (01)
[47] Sparse Matrix-Vector Multiplication Based on Online Arithmetic
Cherati, Sahar Moradi
Jaberipur, Ghassem
Sousa, Leonel
IEEE ACCESS, 2024, 12 : 87653 - 87664
[48] Energy Evaluation of Sparse Matrix-Vector Multiplication on GPU
Benatia, Akrem
Ji, Weixing
Wang, Yizhuo
Shi, Feng
2016 SEVENTH INTERNATIONAL GREEN AND SUSTAINABLE COMPUTING CONFERENCE (IGSC), 2016,
[49] Implementing Sparse Matrix-Vector Multiplication with QCSR on GPU
Zhang, Jilin
Liu, Enyi
Wan, Jian
Ren, Yongjian
Yue, Miao
Wang, Jue
APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 (02): : 473 - 482
[50] Communication balancing in parallel sparse matrix-vector multiplication
Bisseling, RH
Meesen, W
ELECTRONIC TRANSACTIONS ON NUMERICAL ANALYSIS, 2005, 21 : 47 - 65

← 1 2 3 4 5 →