A Communication Optimization Scheme for Basis Computation of Krylov Subspace Methods on Multi-GPUs

被引:0
|
作者
Chen, Langshi [1 ]
Petiton, Serge G. [1 ,2 ]
Drummond, Leroy A. [3 ]
Hugues, Maxime [4 ]
机构
[1] Digiteo Labs Bat 565 PC 190, Maison Simulat, USR3441, F-91191 Gif Sur Yvette, France
[2] Univ Sci & Technol Lille, Lab Informat Fondamentale Lille, F-59650 Villeneuve Dascq, France
[3] Univ Calif Berkeley, Lawrence Berkeley Natl Lab, Berkeley, CA 94720 USA
[4] INRIA Saclay, F-91120 Palaiseau, France
来源
HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2014 | 2015年 / 8969卷
关键词
Krylov subspace; Auto-tuning; Arnoldi orthogonalization;
D O I
10.1007/978-3-319-17353-5_1
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Krylov Subspace Methods (KSMs) are widely used for solving large-scale linear systems and eigenproblems. However, the computation of Krylov subspace bases suffers from the overhead of performing global reduction operations when computing the inner vector products in the orthogonalization steps. In this paper, a hypergraph based communication optimization scheme is applied to Arnoldi and incomplete Arnoldi methods of forming Krylov subspace basis from sparse matrix, and features of these methods are compared in a analytical way. Finally, experiments on a CPU-GPU heterogeneous cluster show that our optimization improves the Arnoldi methods implementations for a generic matrix, and a benefit of up to 10x speedup for some special diagonal structured matrix. The performance advantage also varies for different subspace sizes and matrix formats, which requires a further integration of auto-tuning strategy.
引用
收藏
页码:3 / 16
页数:14
相关论文
共 50 条