A Multi-GPU Aggregation-Based AMG Preconditioner for Iterative Linear Solvers

被引:1
|
作者
Bernaschi, Massimo [1 ]
Celestini, Alessandro [1 ]
Vella, Flavio [2 ]
D'Ambra, Pasqua [1 ]
机构
[1] Inst Appl Comp IAC CNR, I-00185 Rome, Italy
[2] Univ Trento, I-38122 Trento, Italy
关键词
GPU accelerators; heterogeneous computing; iterative sparse linear solvers; parallel numerical algorithms; scalability; PARALLEL;
D O I
10.1109/TPDS.2023.3287238
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present and release in open source format a sparse linear solver which efficiently exploits heterogeneous parallel computers. The solver can be easily integrated into scientific applications that need to solve large and sparse linear systems on modern parallel computers made of hybrid nodes hosting Nvidia Graphics Processing Unit (GPU) accelerators. The work extends previous efforts of some of the authors in the exploitation of a single GPU accelerator and proposes an implementation, based on the hybrid MPI-CUDA software environment, of a Krylov-type linear solver relying on an efficient Algebraic MultiGrid (AMG) preconditioner already available in the BootCMatchG library. Our design for the hybrid implementation has been driven by the best practices for minimizing data communication overhead when multiple GPUs are employed, yet preserving the efficiency of the GPU kernels. Strong and weak scalability results of the new version of the library on well-known benchmark test cases are discussed. Comparisons with the Nvidia AmgX solution show a speedup, in the solve phase, up to 2.0x.
引用
收藏
页码:2365 / 2376
页数:12
相关论文
共 50 条
  • [41] High Performance Multi-GPU SpMV for Multi-component PDE-Based Applications
    Abdelfattah, Ahmad
    Ltaief, Hatem
    Keyes, David
    EURO-PAR 2015: PARALLEL PROCESSING, 2015, 9233 : 601 - 612
  • [42] MULTI-GPU BASED TWO-LEVEL ACCELERATION OF FULL WAVEFORM INVERSION
    Luo, Jingrui
    Gao, Jinghuai
    Wang, Baoli
    JOURNAL OF SEISMIC EXPLORATION, 2012, 21 (04): : 377 - 394
  • [43] Multi-GPU work sharing in a task-based dataflow programming model
    John, Joseph
    Milthorpe, Josh
    Herault, Thomas
    Bosilca, George
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 156 : 313 - 324
  • [44] Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures
    Xie, Chenhao
    Chen, Jieyang
    Firoz, Jesun
    Li, Jiajia
    Song, Shuaiwen Leon
    Barker, Kevin
    Raugas, Mark
    Li, Ang
    50TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2021,
  • [45] Employing a Novel Consensus Optimization Strategy to Achieve Iterative Cone Beam CT Reconstruction On a Multi-GPU Platform
    Li, B.
    Tian, Z.
    Zhou, L.
    Jiang, S.
    Jia, X.
    MEDICAL PHYSICS, 2016, 43 (06) : 3344 - 3345
  • [46] Multi-GPU Island-Based Genetic Algorithm for Solving the Knapsack Problem
    Jaros, Jiri
    2012 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2012,
  • [47] Performance Analysis of Benchmarks for GPU-based Linear Programming Problem Solvers
    Shah, Usman Ali
    Yousaf, Suhail
    2019 2ND INTERNATIONAL CONFERENCE ON COMMUNICATION, COMPUTING AND DIGITAL SYSTEMS (C-CODE), 2019, : 132 - 136
  • [48] A Portable Multi-CPU/Multi-GPU Based Vertebra Localization in Sagittal MR Images
    Larhmam, Mohamed Amine
    Mahmoudi, Sidi Ahmed
    Benjelloun, Mohammed
    Mahmoudi, Said
    Manneback, Pierre
    IMAGE ANALYSIS AND RECOGNITION, ICIAR 2014, PT II, 2014, 8815 : 209 - 218
  • [49] Multi-Level Feature Aggregation-Based Joint Keypoint Detection and Description
    Li, Jun
    Li, Xiang
    Wei, Yifei
    Song, Mei
    Wang, Xiaojun
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (02): : 2529 - 2540
  • [50] Solution of Large Sparse System of Linear Equations over GF(2) on a Multi-Node Multi-GPU Platform
    Rawal, Shruti
    Gupta, Indivar
    DEFENCE SCIENCE JOURNAL, 2022, 72 (06) : 836 - 845