A hybrid CPU/GPU approach for the parallel algebraic recursive multilevel solver pARMS

被引:0
|
作者
Jamal, Aygul [1 ]
Baboulin, Marc [1 ]
Khabou, Amal [1 ]
Sosonkina, Masha [2 ]
机构
[1] Univ Paris Sud, F-91405 Orsay, France
[2] Old Dominion Univ, Norfolk, VA 23529 USA
基金
美国国家科学基金会;
关键词
SCHEMES; GPU;
D O I
10.1109/SYNASC.2016.63
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We illustrate how the distributed parallel Algebraic Recursive Multilevel Solver based on MPI can be adapted for heterogeneous CPU/GPU architectures. The tasks performed on the GPU are related to the preconditioning of each part of the distributed matrix (local preconditioning) which is handled in the distributed version by each MPI process. The solving step remains on the CPU. In our implementation, the local preconditioning can be based either on the randomization of the last Schur complement system in the multilevel recursive process, or on an Incomplete LU factorization from the MAGMA library. Numerical experiments show that a promising performance improvement can be obtained using either randomized multilevel recursive preconditioning or Incomplete LU preconditioning for large enough matrices. Each preconditioning method ensures a good performance for a given set of matrices.
引用
收藏
页码:411 / 416
页数:6
相关论文
共 50 条
  • [1] pARMS: a parallel version of the algebraic recursive multilevel solver
    Li, ZZ
    Saad, Y
    Sosonkina, M
    [J]. NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 2003, 10 (5-6) : 485 - 509
  • [2] Using the parallel algebraic recursive multilevel solver in modern physical applications
    Sosonkina, M
    Saad, Y
    Cai, X
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2004, 20 (03): : 489 - 500
  • [3] PARALLEL SOLVER FOR SHIFTED SYSTEMS IN A HYBRID CPU-GPU FRAMEWORK
    Bosnery, Nela
    Bujanovic, Zvonimir
    Drmac, Zlatko
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2018, 40 (04): : C605 - C633
  • [4] CPU/GPU Heterogeneous Parallel CFD Solver and Optimizations
    Lai, Jianqi
    Li, Hua
    Tian, Zhengyu
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON SERVICE ROBOTICS TECHNOLOGIES (ICSRT 2018), 2018, : 88 - 92
  • [5] A CFD Heterogeneous Parallel Solver Based on Collaborating CPU and GPU
    Lai, Jianqi
    Tian, Zhengyu
    Li, Hua
    Pan, Sha
    [J]. 3RD INTERNATIONAL CONFERENCE ON MECHANICAL AND AERONAUTICAL ENGINEERING (ICMAE 2017), 2018, 326
  • [6] ARMS: an algebraic recursive multilevel solver for general sparse linear systems
    Saad, Y
    Suchomel, B
    [J]. NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 2002, 9 (05) : 359 - 378
  • [7] An incompressible flow solver on a GPU/CPU heterogeneous architecture parallel computing platform
    Qianqian Li
    Rong Li
    Zixuan Yang
    [J]. Theoretical and Applied Mechanics Letters, 2023, 13 (05) : 387 - 393
  • [8] An incompressible flow solver on a GPU/CPU heterogeneous architecture parallel computing platform
    Li, Qianqian
    Li, Rong
    Yang, Zixuan
    [J]. THEORETICAL AND APPLIED MECHANICS LETTERS, 2023, 13 (05)
  • [9] Developing a CPU-GPU LES Parallel Solver for Canonical Turbulent Flows
    Nafiseh ZendehAli
    Homayoun Emdad
    Omid Abouali
    [J]. Iranian Journal of Science and Technology, Transactions of Mechanical Engineering, 2023, 47 : 1535 - 1551
  • [10] Developing a CPU-GPU LES Parallel Solver for Canonical Turbulent Flows
    ZendehAli, Nafiseh
    Emdad, Homayoun
    Abouali, Omid
    [J]. IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF MECHANICAL ENGINEERING, 2023, 47 (04) : 1535 - 1551