Vectorization and Minimization of Memory Footprint for Linear High-Order Discontinuous Galerkin Schemes

被引:0
|
作者
Gallard, Jean-Matthieu [1 ]
Rannabauer, Leonhard [1 ]
Reinarz, Anne [1 ]
Bader, Michael [1 ]
机构
[1] Tech Univ Munich, Dept Informat, Munich, Germany
关键词
ExaHyPE; Code Generation; High-Order Discontinuous Galerkin; ADER; Hyperbolic PDE Systems; Vectorization; Array-of-Struct-of-Array;
D O I
10.1109/IPDPSW50202.2020.00126
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We present a sequence of optimizations to the performance-critical compute kernels of the high-order discontinuous Galerkin solver of the hyperbolic PDE engine ExaHyPE - successively tackling bottlenecks due to SIMD operations, cache hierarchies and restrictions in the software design. Starting from a generic scalar implementation of the numerical scheme, our first optimized variant applies state-of-the-art optimization techniques by vectorizing loops, improving the data layout and using Loop-over-GEMM to perform tensor contractions via highly optimized matrix multiplication functions provided by the LIBXSMM library. We show that memory stalls due to a memory footprint exceeding our L2 cache size hindered the vectorization gains. We therefore introduce a new kernel that applies a sum factorization approach to reduce the kernel's memory footprint and improve its cache locality. With the L2 cache bottleneck removed, we were able to exploit additional vectorization opportunities, by introducing a hybrid Array-of-Structure-of-Array data layout that solves the data layout conflict between matrix multiplications kernels and the point-wise functions to implement PDE-specific terms. With this last kernel, evaluated in a benchmark simulation at high polynomial order, only 2% of the floating point operations are still performed using scalar instructions and 22.5% of the available performance is achieved.
引用
收藏
页码:711 / 720
页数:10
相关论文
共 50 条
  • [31] BRIDGING THE HYBRID HIGH-ORDER AND HYBRIDIZABLE DISCONTINUOUS GALERKIN METHODS
    Cockburn, Bernardo
    Di Pietro, Daniele A.
    Ern, Alexandre
    ESAIM-MATHEMATICAL MODELLING AND NUMERICAL ANALYSIS-MODELISATION MATHEMATIQUE ET ANALYSE NUMERIQUE, 2016, 50 (03): : 635 - 650
  • [32] A High-Order Discontinuous Galerkin Solver for Helically Symmetric Flows
    Dierkes, Dominik
    Kummer, Florian
    Pluemacher, Dominik
    COMMUNICATIONS IN COMPUTATIONAL PHYSICS, 2021, 30 (01) : 288 - 320
  • [33] FOURIER ANALYSES OF HIGH-ORDER CONTINUOUS AND DISCONTINUOUS GALERKIN METHODS
    Le Roux, Daniel Y.
    Eldred, Christopher
    Taylor, Mark A.
    SIAM JOURNAL ON NUMERICAL ANALYSIS, 2020, 58 (03) : 1845 - 1866
  • [34] Hybrid multigrid methods for high-order discontinuous Galerkin discretizations
    Fehn, Niklas
    Munch, Peter
    Wall, Wolfgang A.
    Kronbichler, Martin
    JOURNAL OF COMPUTATIONAL PHYSICS, 2020, 415
  • [35] High-order continuous and discontinuous Galerkin methods for wave problems
    Giorgiani, Giorgio
    Modesto, David
    Fernandez-Mendez, Sonia
    Huerta, Antonio
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN FLUIDS, 2013, 73 (10) : 883 - 903
  • [36] A high-order discontinuous Galerkin method for nonlinear sound waves
    Antonietti, Paola F.
    Mazzieri, Ilario
    Muhr, Markus
    Nikolic, Vanja
    Wohlmuth, Barbara
    JOURNAL OF COMPUTATIONAL PHYSICS, 2020, 415
  • [37] High-Order Discontinuous Galerkin Method for Computation of Turbulent Flows
    Wang, Li
    Anderson, W. Kyle
    Erwin, Taylor
    Kapadia, Sagar
    AIAA JOURNAL, 2015, 53 (05) : 1159 - 1171
  • [38] Preconditioning High-Order Discontinuous Galerkin Discretizations of Elliptic Problems
    Antonietti, Paola F.
    Houston, Paul
    Lecture Notes in Computational Science and Engineering, 2013, 91 : 231 - 238
  • [39] A HIGH-ORDER ACCURATE DISCONTINUOUS GALERKIN METHOD FOR TURBOMACHINERY APPLICATIONS
    Bassi, F.
    Botti, L.
    Colombo, A.
    Franchina, N.
    Ghidoni, A.
    Rebay, S.
    9TH EUROPEAN CONFERENCE ON TURBOMACHINERY: FLUID DYNAMICS AND THERMODYNAMICS, VOLS I AND II, 2011, : 1603 - 1613
  • [40] Development of high-order Taylor-Galerkin schemes for LES
    Colin, O
    Rudgyard, M
    JOURNAL OF COMPUTATIONAL PHYSICS, 2000, 162 (02) : 338 - 371