A Multi-GPU Aggregation-Based AMG Preconditioner for Iterative Linear Solvers

被引:1
|
作者
Bernaschi, Massimo [1 ]
Celestini, Alessandro [1 ]
Vella, Flavio [2 ]
D'Ambra, Pasqua [1 ]
机构
[1] Inst Appl Comp IAC CNR, I-00185 Rome, Italy
[2] Univ Trento, I-38122 Trento, Italy
关键词
GPU accelerators; heterogeneous computing; iterative sparse linear solvers; parallel numerical algorithms; scalability; PARALLEL;
D O I
10.1109/TPDS.2023.3287238
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present and release in open source format a sparse linear solver which efficiently exploits heterogeneous parallel computers. The solver can be easily integrated into scientific applications that need to solve large and sparse linear systems on modern parallel computers made of hybrid nodes hosting Nvidia Graphics Processing Unit (GPU) accelerators. The work extends previous efforts of some of the authors in the exploitation of a single GPU accelerator and proposes an implementation, based on the hybrid MPI-CUDA software environment, of a Krylov-type linear solver relying on an efficient Algebraic MultiGrid (AMG) preconditioner already available in the BootCMatchG library. Our design for the hybrid implementation has been driven by the best practices for minimizing data communication overhead when multiple GPUs are employed, yet preserving the efficiency of the GPU kernels. Strong and weak scalability results of the new version of the library on well-known benchmark test cases are discussed. Comparisons with the Nvidia AmgX solution show a speedup, in the solve phase, up to 2.0x.
引用
收藏
页码:2365 / 2376
页数:12
相关论文
共 50 条
  • [21] A Jacobi_PCG solver for sparse linear systems on multi-GPU cluster
    Lin, Shaozhong
    Xie, Zhiqiang
    JOURNAL OF SUPERCOMPUTING, 2017, 73 (01): : 433 - 454
  • [22] AgrUNet: A Multi-GPU UNet Based Model for Crops Classification
    Miola, Andrea
    Calore, Enrico
    Schifano, Sebastiano Fabio
    IEEE ACCESS, 2024, 12 : 158065 - 158078
  • [23] A Jacobi_PCG solver for sparse linear systems on multi-GPU cluster
    Shaozhong Lin
    Zhiqiang Xie
    The Journal of Supercomputing, 2017, 73 : 433 - 454
  • [24] Multi-GPU based on multicriteria optimization for motion estimation system
    Garcia, Carlos
    Botella, Guillermo
    Ayuso, Fermin
    Prieto, Manuel
    Tirado, Francisco
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2013,
  • [25] Strength Check of Aircraft Parts Based on Multi-GPU Clusters for Fast Calculation of Sparse Linear Equations
    Zhang, Yuhua
    Hu, Binxing
    IEEE ACCESS, 2020, 8 : 77188 - 77203
  • [26] An improved direct linear equation solver using multi-GPU in multi-body dynamics
    Jung, Ji-Hyun
    Bae, Dae-Sung
    ADVANCES IN ENGINEERING SOFTWARE, 2018, 115 : 87 - 102
  • [27] MG-FIM: A MULTI-GPU FAST ITERATIVE METHOD USING ADAPTIVE DOMAIN DECOMPOSITION
    Hong, Sumin
    Jang, Ganghee
    Jeong, Won-Ki
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2022, 44 (01): : C54 - C76
  • [28] Cluster-Based Approach to a Multi-GPU CT Reconstruction Algorithm
    Orr, Laurel J.
    Jimenez, Edward S.
    Thompson, Kyle R.
    2014 IEEE NUCLEAR SCIENCE SYMPOSIUM AND MEDICAL IMAGING CONFERENCE (NSS/MIC), 2014,
  • [29] Neon: A Multi-GPU Programming Model for Grid-based Computations
    Meneghin, Massimiliano
    Mahmoud, Ahmed H.
    Jayaraman, Pradeep Kumar
    Morris, Nigel J. W.
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2022), 2022, : 817 - 827
  • [30] Multi-GPU based Fast Electromagnetic Simulation Method for Analysing PCB
    Inoue, Yuta
    Asai, Hideki
    2017 ASIA-PACIFIC INTERNATIONAL SYMPOSIUM ON ELECTROMAGNETIC COMPATIBILITY (APEMC), 2017, : 297 - 299