Domain Decomposition Preconditioners for Communication-Avoiding Krylov Methods on a Hybrid CPU/GPU Cluster

被引:18
|
作者
Yamazaki, Ichitaro [1 ]
Rajamanickam, Sivasankaran [2 ]
Boman, Erik G. [2 ]
Hoemmen, Mark [2 ]
Heroux, Michael A. [2 ]
Tomov, Stanimire [1 ]
机构
[1] Univ Tennessee, Knoxville, TN 37996 USA
[2] Sandia Natl Labs, POB 5800, Albuquerque, NM 87185 USA
关键词
LINEAR-SYSTEMS; IMPLEMENTATION; GMRES;
D O I
10.1109/SC.2014.81
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Krylov subspace projection methods are widely used iterative methods for solving large-scale linear systems of equations. Researchers have demonstrated that communication-avoiding (CA) techniques can improve Krylov methods' performance on modern computers, where communication is becoming increasingly expensive compared to arithmetic operations. In this paper, we extend these studies by two major contributions. First, we present our implementation of a CA variant of the Generalized Minimum Residual (GMRES) method, called CA-GMRES, for solving nonsymmetric linear systems of equations on a hybrid CPU/GPU cluster. Our performance results on up to 120 GPUs show that CA-GMRES gives a speedup of up to 2.5x in total solution time over standard GMRES on a hybrid cluster with twelve Intel Xeon CPUs and three Nvidia Fermi GPUs on each node. We then outline a domain decomposition framework to introduce a family of preconditioners that are suitable for CA Krylov methods. Our preconditioners do not incur any additional communication and allow the easy reuse of existing algorithms and software for the subdomain solves. Experimental results on the hybrid CPU/GPU cluster demonstrate that CA-GMRES with preconditioning achieve a speedup of up to 7.4x over CA-GMRES without preconditioning, and speedup of up to 1.7x over GMRES with preconditioning in total solution time. These results confirm the potential of our framework to develop a practical and effective preconditioned CA Krylov method.
引用
收藏
页码:933 / 944
页数:12
相关论文
共 50 条
  • [31] Schwarz domain decomposition preconditioners for plane wave discontinuous Galerkin methods
    Antonietti, Paola F.
    Perugia, Ilaria
    Davide, Zaliani
    Lecture Notes in Computational Science and Engineering, 2015, 103 : 557 - 572
  • [32] Domain decomposition methods and multilevel preconditioners for nonconforming and mixed methods for partial differential problems
    Chen, ZX
    Ewing, RE
    DOMAIN DECOMPOSITION METHODS IN SCIENCES AND ENGINEERING, 1997, : 213 - 220
  • [33] Hybrid CPU-GPU Solver for Gradient Domain Processing of Massive Images
    Philip, Sujin
    Summa, Brian
    Pascucci, Valerio
    Bremer, Peer-Timo
    2011 IEEE 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2011, : 244 - 251
  • [34] Left-Preconditioned Communication-Avoiding Conjugate Gradient Methods for Multiphase CFD Simulations on the K Computer
    Mayumi, Akie
    Idomura, Yasuhiro
    Ina, Takuya
    Yamada, Susumu
    Imamura, Toshiyuki
    PROCEEDINGS OF SCALA 2016: 7TH WORKSHOP ON LATEST ADVANCES IN SCALABLE ALGORITHMS FOR LARGE-SCALE SYSTEMS, 2016, : 17 - 24
  • [35] A Hybrid Parallel Algorithm for Computer Simulation of Electrocardiogram Based on a CPU-GPU Cluster
    Shen, Wenfeng
    Sun, Lianqiang
    Wei, Daming
    Xu, Weimin
    Wang, Hui
    Zhu, Xin
    2013 IEEE/ACIS 12TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2013, : 167 - 171
  • [36] Domain Decomposition Preconditioners for Discontinuous Galerkin Methods for Elliptic Problems on Complicated Domains
    Paola F. Antonietti
    Stefano Giani
    Paul Houston
    Journal of Scientific Computing, 2014, 60 : 203 - 227
  • [37] Domain Decomposition Preconditioners for Discontinuous Galerkin Methods for Elliptic Problems on Complicated Domains
    Antonietti, Paola F.
    Giani, Stefano
    Houston, Paul
    JOURNAL OF SCIENTIFIC COMPUTING, 2014, 60 (01) : 203 - 227
  • [38] Prediction Model for Scheduling an Irregular Graph Algorithms on CPU-GPU Hybrid Cluster Framework
    Chandrashekhar, B. N.
    Sanjay, H. A.
    Lakshmi, H.
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT-2020), 2020, : 584 - 589
  • [39] A Class of Domain Decomposition Preconditioners for hp-Discontinuous Galerkin Finite Element Methods
    Antonietti, Paola F.
    Houston, Paul
    JOURNAL OF SCIENTIFIC COMPUTING, 2011, 46 (01) : 124 - 149
  • [40] A Class of Domain Decomposition Preconditioners for hp-Discontinuous Galerkin Finite Element Methods
    Paola F. Antonietti
    Paul Houston
    Journal of Scientific Computing, 2011, 46 : 124 - 149