A Multi-GPU Aggregation-Based AMG Preconditioner for Iterative Linear Solvers

被引:1
|
作者
Bernaschi, Massimo [1 ]
Celestini, Alessandro [1 ]
Vella, Flavio [2 ]
D'Ambra, Pasqua [1 ]
机构
[1] Inst Appl Comp IAC CNR, I-00185 Rome, Italy
[2] Univ Trento, I-38122 Trento, Italy
关键词
GPU accelerators; heterogeneous computing; iterative sparse linear solvers; parallel numerical algorithms; scalability; PARALLEL;
D O I
10.1109/TPDS.2023.3287238
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present and release in open source format a sparse linear solver which efficiently exploits heterogeneous parallel computers. The solver can be easily integrated into scientific applications that need to solve large and sparse linear systems on modern parallel computers made of hybrid nodes hosting Nvidia Graphics Processing Unit (GPU) accelerators. The work extends previous efforts of some of the authors in the exploitation of a single GPU accelerator and proposes an implementation, based on the hybrid MPI-CUDA software environment, of a Krylov-type linear solver relying on an efficient Algebraic MultiGrid (AMG) preconditioner already available in the BootCMatchG library. Our design for the hybrid implementation has been driven by the best practices for minimizing data communication overhead when multiple GPUs are employed, yet preserving the efficiency of the GPU kernels. Strong and weak scalability results of the new version of the library on well-known benchmark test cases are discussed. Comparisons with the Nvidia AmgX solution show a speedup, in the solve phase, up to 2.0x.
引用
收藏
页码:2365 / 2376
页数:12
相关论文
共 50 条
  • [1] Two-stage Asynchronous Iterative Solvers for multi-GPU Clusters
    Nayak, Pratik
    Cojean, Terry
    Anzt, Hartwig
    PROCEEDINGS OF SCALA 2020: 11TH WORKSHOP ON LATEST ADVANCES IN SCALABLE ALGORITHMS FOR LARGE-SCALE SYSTEMS, 2020, : 9 - 18
  • [2] Multi-GPU Communication Schemes for Iterative Solvers: When CPUs are Not in Charge
    Ismayilov, Ismayil
    Baydamirli, Javid
    Sagbili, Dogan
    Wahib, Mohamed
    Unat, Didem
    Proceedings of the International Conference on Supercomputing, 2023, : 192 - 202
  • [3] Multi-GPU Communication Schemes for Iterative Solvers: When CPUs are Not in Charge
    Ismayilov, Ismayil
    Baydamirli, Javid
    Sagbili, Dogan
    Wahib, Mohamed
    Unat, Didem
    PROCEEDINGS OF THE 37TH INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, ACM ICS 2023, 2023, : 192 - 202
  • [4] Multi-GPU Kinetic Solvers using MPI and CUDA
    Zabelok, Sergey
    Arslanbekov, Robert
    Kolobov, Vladimir
    PROCEEDINGS OF THE 29TH INTERNATIONAL SYMPOSIUM ON RAREFIED GAS DYNAMICS, 2014, 1628 : 539 - 546
  • [5] An aggregation-based domain decomposition preconditioner for groundwater flow
    Jenkins, EW
    Kees, CE
    Kelley, CT
    Miller, CT
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2001, 23 (02): : 430 - 441
  • [6] Improving solve time of aggregation-based adaptive AMG
    D'Ambra, Pasqua
    Vassilevski, Panayot S.
    NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 2019, 26 (06)
  • [7] Multi-GPU based Cluster System for CT Iterative Reconstruction Algorithm
    Lu, Wan-li
    Yan, Bin
    Chen, Jian-lin
    Cai, Ai-long
    Li, Lei
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON MECHATRONICS AND INDUSTRIAL INFORMATICS, 2015, 31 : 881 - 886
  • [8] GPU-accelerated preconditioned iterative linear solvers
    Ruipeng Li
    Yousef Saad
    The Journal of Supercomputing, 2013, 63 : 443 - 466
  • [9] GPU-accelerated preconditioned iterative linear solvers
    Li, Ruipeng
    Saad, Yousef
    JOURNAL OF SUPERCOMPUTING, 2013, 63 (02): : 443 - 466
  • [10] Parallel Iterative Linear Solvers on GPU: A Financial Engineering Case
    Gaikwad, Abhijeet
    Toke, Ioane Muni
    PROCEEDINGS OF THE 18TH EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, 2010, : 607 - 614