A Communication Optimization Scheme for Basis Computation of Krylov Subspace Methods on Multi-GPUs

被引:0
|
作者
Chen, Langshi [1 ]
Petiton, Serge G. [1 ,2 ]
Drummond, Leroy A. [3 ]
Hugues, Maxime [4 ]
机构
[1] Digiteo Labs Bat 565 PC 190, Maison Simulat, USR3441, F-91191 Gif Sur Yvette, France
[2] Univ Sci & Technol Lille, Lab Informat Fondamentale Lille, F-59650 Villeneuve Dascq, France
[3] Univ Calif Berkeley, Lawrence Berkeley Natl Lab, Berkeley, CA 94720 USA
[4] INRIA Saclay, F-91120 Palaiseau, France
来源
HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2014 | 2015年 / 8969卷
关键词
Krylov subspace; Auto-tuning; Arnoldi orthogonalization;
D O I
10.1007/978-3-319-17353-5_1
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Krylov Subspace Methods (KSMs) are widely used for solving large-scale linear systems and eigenproblems. However, the computation of Krylov subspace bases suffers from the overhead of performing global reduction operations when computing the inner vector products in the orthogonalization steps. In this paper, a hypergraph based communication optimization scheme is applied to Arnoldi and incomplete Arnoldi methods of forming Krylov subspace basis from sparse matrix, and features of these methods are compared in a analytical way. Finally, experiments on a CPU-GPU heterogeneous cluster show that our optimization improves the Arnoldi methods implementations for a generic matrix, and a benefit of up to 10x speedup for some special diagonal structured matrix. The performance advantage also varies for different subspace sizes and matrix formats, which requires a further integration of auto-tuning strategy.
引用
收藏
页码:3 / 16
页数:14
相关论文
共 50 条
  • [31] Performance Analysis of Multi-GPU Implementations of Krylov-Subspace Methods Applied to FEA of Electromagnetic Phenomena
    Peixoto de Camargos, Ana Flavia
    Silva, Viviane Cristine
    IEEE TRANSACTIONS ON MAGNETICS, 2015, 51 (03)
  • [32] Efficient Computation of Electromagnetic Wave Fields on Unbounded Domains Using Stability-Corrected Wave Functions and Krylov Subspace Projection Methods
    Druskin, V.
    Remis, R. F.
    Zaslavsky, M.
    Zimmerling, J. T.
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ELECTROMAGNETICS IN ADVANCED APPLICATIONS (ICEAA), 2015, : 19 - 22
  • [33] Acceleration of phase-field lattice Boltzmann simulation of dendrite growth with thermosolutal convection by the multi-GPUs parallel computation with multiple mesh and time step method
    Sakane, Shinji
    Takaki, Tomohiro
    Ohno, Munekazu
    Shibuta, Yasushi
    Aoki, Takayuki
    MODELLING AND SIMULATION IN MATERIALS SCIENCE AND ENGINEERING, 2019, 27 (05)
  • [34] Comparison of preconditioned Krylov subspace iteration methods for PDE-constrained optimization problemsPoisson and convection-diffusion control
    Owe Axelsson
    Shiraz Farouq
    Maya Neytcheva
    Numerical Algorithms, 2016, 73 : 631 - 663
  • [35] Metric and Bregman projections onto affine subspaces and their computation via sequential subspace optimization methods
    Schoepfer, F.
    Schuster, T.
    Louis, A. K.
    JOURNAL OF INVERSE AND ILL-POSED PROBLEMS, 2008, 16 (05): : 479 - 506
  • [36] Optimization of the parallel semi-Lagrangian scheme based on overlapping communication with computation in the YHGSM
    Jiang, Tao
    Wu, Jianping
    Liu, Zhaoyang
    Zhao, Wenpeng
    Zhang, Yongshun
    QUARTERLY JOURNAL OF THE ROYAL METEOROLOGICAL SOCIETY, 2021, 147 (737) : 2293 - 2302
  • [37] Joint Optimization of Computation and Communication Power in Multi-User Massive MIMO Systems
    Ge, Xiaohu
    Sun, Yang
    Gharavi, Hamid
    Thompson, John
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2018, 17 (06) : 4051 - 4063
  • [38] Optimization of the parallel semi-Lagrangian scheme to overlap computation with communication based on grouping levels in YHGSM
    Liu, Dazheng
    Liu, Wenjuan
    Pan, Liangrui
    Dou, Yutao
    Wu, Jianping
    CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING, 2024, 6 (01) : 68 - 77
  • [39] Optimization of the parallel semi-Lagrangian scheme to overlap computation with communication based on grouping levels in YHGSM
    Dazheng Liu
    Wenjuan Liu
    Liangrui Pan
    Yutao Dou
    Jianping Wu
    CCF Transactions on High Performance Computing, 2024, 6 (1) : 68 - 77
  • [40] An multi-objective optimization scheme and its application based on sequential radial basis function
    Chen, Guodong
    Bu, Jiling
    Qiche Gongcheng/Automotive Engineering, 2015, 37 (09): : 1077 - 1083