PARALLEL SOLVER FOR SHIFTED SYSTEMS IN A HYBRID CPU-GPU FRAMEWORK

被引:2
|
作者
Bosnery, Nela [1 ]
Bujanovic, Zvonimir [1 ]
Drmac, Zlatko [1 ]
机构
[1] Univ Zagreb, Dept Math, Fac Sci, Zagreb 10000, Croatia
来源
SIAM JOURNAL ON SCIENTIFIC COMPUTING | 2018年 / 40卷 / 04期
关键词
GPU; Hessenberg matrix; interpolatory model reduction; parallel solver; pseudospectrum; shifted linear systems; transfer function; MODEL-REDUCTION; LINEAR-SYSTEMS; PSEUDOSPECTRA; CONTINUATION; COMPUTATION; HESSENBERG; FORMS; BICG;
D O I
10.1137/17M1144465
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
This paper proposes a combination of a hybrid CPU-GPU and a pure GPU software implementation of a direct algorithm for solving shifted linear systems (A - sigma I) X = B with a large number of complex shifts sigma and multiple right-hand sides. Such problems often appear, e.g., in control theory when evaluating the transfer function, or as a part of an algorithm performing interpolatory model reduction, as well as when computing pseudospectra and structured pseudospectra, or solving large linear systems of ordinary differential equations. The proposed algorithm first jointly reduces the general full n x n matrix A and the n x m full right-hand side matrix B to the controller Hessenberg canonical form that facilitates efficient solution: A is transformed to a so-called m-Hessenberg form, and B is made upper triangular. This is implemented as a blocked highly parallel CPU-GPU hybrid algorithm; individual blocks are reduced by the CPU, and the necessary updates of the rest of the matrix are split among the cores of the CPU and the GPU. To enhance parallelization, the reduction and the updates are overlapped. In the next phase, the reduced m-Hessenberg-triangular systems are solved entirely on the GPU, with shifts divided into batches. The benefits of such load distribution are demonstrated by numerical experiments. In particular, we show that our proposed implementation provides an excellent basis for efficient implementations of computational methods in systems and control theory, from evaluation of transfer function to the interpolatory model reduction.
引用
收藏
页码:C605 / C633
页数:29
相关论文
共 50 条
  • [1] CPU-GPU hybrid parallel strategy for cosmological simulations
    Wang, Yueqing
    Dou, Yong
    Guo, Song
    Lei, Yuanwu
    Zou, Dan
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2014, 26 (03): : 748 - 765
  • [2] Developing a CPU-GPU LES Parallel Solver for Canonical Turbulent Flows
    Nafiseh ZendehAli
    Homayoun Emdad
    Omid Abouali
    [J]. Iranian Journal of Science and Technology, Transactions of Mechanical Engineering, 2023, 47 : 1535 - 1551
  • [3] Developing a CPU-GPU LES Parallel Solver for Canonical Turbulent Flows
    ZendehAli, Nafiseh
    Emdad, Homayoun
    Abouali, Omid
    [J]. IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF MECHANICAL ENGINEERING, 2023, 47 (04) : 1535 - 1551
  • [4] Parallel Preconditioning and Modular Finite Element Solvers on Hybrid CPU-GPU Systems
    Heuveline, V.
    Lukarski, D.
    Subramanian, C.
    Weiss, J. -P.
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, GRID AND CLOUD COMPUTING FOR ENGINEERING, 2011, 95
  • [5] Improving Dense Linear Equation Solver on Hybrid CPU-GPU System
    Cao, Zhichao
    Xu, Shiming
    Xue, Wei
    Chen, Wenguang
    [J]. 2009 10TH INTERNATIONAL SYMPOSIUM ON PERVASIVE SYSTEMS, ALGORITHMS, AND NETWORKS (ISPAN 2009), 2009, : 556 - +
  • [6] Hybrid CPU-GPU Solver for Gradient Domain Processing of Massive Images
    Philip, Sujin
    Summa, Brian
    Pascucci, Valerio
    Bremer, Peer-Timo
    [J]. 2011 IEEE 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2011, : 244 - 251
  • [7] A Distributed CPU-GPU Sparse Direct Solver
    Sao, Piyush
    Vuduc, Richard
    Li, Xiaoye Sherry
    [J]. EURO-PAR 2014 PARALLEL PROCESSING, 2014, 8632 : 487 - 498
  • [8] CPU-GPU Hybrid Parallel Binomial American Option Pricing
    Zhang, Nan
    Lim, Eng Gee
    Man, Ka Lok
    Lei, Chi-Un
    [J]. INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTIST, IMECS 2012, VOL II, 2012, : 1157 - 1162
  • [9] PSkel: A stencil programming framework for CPU-GPU systems
    Pereira, Alyson D.
    Ramos, Luiz
    Goes, Luis F. W.
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (17): : 4938 - 4953
  • [10] A CPU-GPU Parallel Ant Colony Optimization Solver for the Vehicle Routing Problem
    Rey, Anton
    Prieto, Manuel
    Gomez, J., I
    Tenllado, Christian
    Ignacio Hidalgo, J.
    [J]. APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2018, 2018, 10784 : 653 - 667