REORDERING STRATEGY FOR BLOCKING OPTIMIZATION IN SPARSE LINEAR SOLVERS

被引:6
|
作者
Pichon, Gregoire [1 ]
Faverge, Mathieu [1 ,2 ]
Ramet, Pierre [1 ]
Roman, Jean [1 ]
机构
[1] Univ Bordeaux, Inria, Bordeaux INP, CNRS,Labri UMR 5800, Talence, France
[2] Univ Tennessee, ICL, Knoxville, TN 37996 USA
关键词
sparse block linear solver; nested dissection; sparse matrix ordering; heterogeneous architectures; NESTED DISSECTION; ELIMINATION;
D O I
10.1137/16M1062454
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Solving sparse linear systems is a problem that arises in many scientific applications, and sparse direct solvers are a time-consuming and key kernel for those applications and for more advanced solvers such as hybrid direct-iterative solvers. For this reason, optimizing their performance on modern architectures is critical. The preprocessing steps of sparse direct solvers-ordering and block-symbolic factorization-are two major steps that lead to a reduced amount of computation and memory and to a better task granularity to reach a good level of performance when using BLAS kernels. With the advent of GPUs, the granularity of the block computation has become more important than ever. In this paper, we present a reordering strategy that increases this block granularity. This strategy relies on block-symbolic factorization to refine the ordering produced by tools such as Metis or Scotch, but it does not impact the number of operations required to solve the problem. We integrate this algorithm in the PaStiX solver and show an important reduction of the number of off-diagonal blocks on a large spectrum of matrices. This improvement leads to an increase in efficiency of up to 20% on GPUs.
引用
收藏
页码:226 / 248
页数:23
相关论文
共 50 条
  • [31] Data-driven Performance Modeling of Linear Solvers for Sparse Matrices
    Yeom, Jae-Seung
    Thiagarajan, Jayaraman J.
    Bhatele, Abhinav
    Bronevetsky, Greg
    Kolev, Tzanio
    PROCEEDINGS OF PMBS 2016: 7TH INTERNATIONAL WORKSHOP ON PERFORMANCE MODELING, BENCHMARKING AND SIMULATION OF HIGH PERFORMANCE COMPUTING SYSTEMS, 2016, : 32 - 42
  • [32] Evaluation of parallel direct sparse linear solvers in electromagnetic geophysical problems
    Puzyrev, Vladimir
    Koric, Seid
    Wilkin, Scott
    COMPUTERS & GEOSCIENCES, 2016, 89 : 79 - 87
  • [33] Evaluation and FPGA Implementation of Sparse Linear Solvers for Video Processing Applications
    Greisen, Pierre
    Runo, Marian
    Guillet, Patrice
    Heinzle, Simon
    Smolic, Aljoscha
    Kaeslin, Hubert
    Gross, Markus
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (08) : 1402 - 1407
  • [34] Solving Parametric Sparse Linear Systems by Local Blocking
    Sasaki, Tateaki
    Inaba, Daiju
    Kako, Fujio
    COMPUTER ALGEBRA IN SCIENTIFIC COMPUTING, CASC 2014, 2014, 8660 : 403 - 418
  • [35] Optimization of sparse matrix-vector multiplication using reordering techniques on GPUs
    Pichel, Juan C.
    Rivera, Francisco F.
    Fernandez, Marcos
    Rodriguez, Aurelio
    MICROPROCESSORS AND MICROSYSTEMS, 2012, 36 (02) : 65 - 77
  • [36] REORDERING A SPARSE-MATRIX TO A SPARSE BLOCKED FORM
    ALTURAIGI, MA
    BICKART, TA
    INTERNATIONAL JOURNAL OF CIRCUIT THEORY AND APPLICATIONS, 1985, 13 (02) : 173 - 178
  • [37] Automatic Performance Tuning Approach for Parallel Applications Based on Sparse Linear Solvers
    Voronov, Vasiliy Yu.
    Popova, Nina N.
    PARALLEL COMPUTING: FROM MULTICORES AND GPU'S TO PETASCALE, 2010, 19 : 415 - 422
  • [38] Stochastic optimization of linear sparse arrays
    Trucco, A
    Murino, V
    IEEE JOURNAL OF OCEANIC ENGINEERING, 1999, 24 (03) : 291 - 299
  • [39] Performance evaluation of parallel sparse linear equation solvers for positive definite systems
    Davis, RL
    Henz, BJ
    Shires, DR
    PDPTA'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-4, 2003, : 1172 - 1178
  • [40] Improving the energy efficiency of sparse linear system solvers on multicore and manycore systems
    Anzt, H.
    Quintana-Orti, E. S.
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2014, 372 (2018):