Iterative Sparse Matrix-Vector Multiplication on In-Memory Cluster Computing Accelerated by GPUs for Big Data

被引：0

作者：

Peng, Jiwu ^{[1
,2
]}

Xiao, Zheng ^{[1
,2
]}

Chen, Cen ^{[1
,2
]}

Yang, Wangdong ^{[1
,2
]}

机构：

[1] Hunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China

[2] Natl Supercomp Ctr Changsha, Changsha 410082, Hunan, Peoples R China

来源：

2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD) | 2016年

关键词：

Iterative SpMV; Flink; GPU; In-memory Computing; BigData;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Iterative SpMV (ISpMV) is a key operation in many graph-based data mining algorithms and machine learning algorithms. Along with the development of big data, the matrices can be so large, perhaps billion-scale, that the SpMV can not be implemented in a single computer. Therefore, it is a challenging issue to implement and optimize SpMV for large-scale data sets. In this paper, we used an in-memory heterogeneous CPU-GPU cluster computing platforms (IMHCPs) to efficiently solve billion-scale SpMV problem. A dedicated and efficient hierarchy partitioning strategy for sparse matrices and the vector is proposed. The partitioning strategy contains partitioning sparse matrices among workers in the cluster and among GPUs in one worker. More, the performance of the IMHCPs-based SpMV is evaluated from the aspects of computation efficiency and scalability.

引用

页码：1454 / 1460

页数：7

共 50 条

[1] Iterative Sparse Matrix-Vector Multiplication for Integer Factorization on GPUs
Schmidt, Bertil
Aribowo, Hans
Dang, Hoang-Vu
EURO-PAR 2011 PARALLEL PROCESSING, PT 2, 2011, 6853 : 413 - 424
[2] Scaleable Sparse Matrix-Vector Multiplication with Functional Memory and GPUs
Tanabe, Noboru
Ogawa, Yuuka
Takata, Masami
Joe, Kazuki
PROCEEDINGS OF THE 19TH INTERNATIONAL EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING, 2011, : 101 - 108
[3] Optimization techniques for sparse matrix-vector multiplication on GPUs
Maggioni, Marco
Berger-Wolf, Tanya
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2016, 93-94 : 66 - 86
[4] Leveraging Memory Copy Overlap for Efficient Sparse Matrix-Vector Multiplication on GPUs
Zeng, Guangsen
Zou, Yi
ELECTRONICS, 2023, 12 (17)
[5] Dual in-memory computing of matrix-vector multiplication for accelerating neural networks
Wang, Shiqing
Sun, Zhong
DEVICE, 2024, 2 (12):
[6] Time Complexity of In-Memory Matrix-Vector Multiplication
Sun, Zhong
Huang, Ru
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2021, 68 (08) : 2785 - 2789
[7] GPU accelerated sparse matrix-vector multiplication and sparse matrix-transpose vector multiplication
Tao, Yuan
Deng, Yangdong
Mu, Shuai
Zhang, Zhenzhong
Zhu, Mingfa
Xiao, Limin
Ruan, Li
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (14): : 3771 - 3789
[8] Implementing Blocked Sparse Matrix-Vector Multiplication on NVIDIA GPUs
Monakov, Alexander
Avetisyan, Arutyun
EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION, PROCEEDINGS, 2009, 5657 : 289 - 297
[9] Optimization of Sparse Matrix-Vector Multiplication with Variant CSR on GPUs
Feng, Xiaowen
Jin, Hai
Zheng, Ran
Hu, Kan
Zeng, Jingxiang
Shao, Zhiyuan
2011 IEEE 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2011, : 165 - 172
[10] Multiple-precision sparse matrix-vector multiplication on GPUs
Isupov, Konstantin
JOURNAL OF COMPUTATIONAL SCIENCE, 2022, 61

← 1 2 3 4 5 →