Asynchronous distributed-memory task-parallel algorithm for compressible flows on unstructured 3D Eulerian grids

被引:8
|
作者
Bakosi, J. [1 ]
Bird, R. [1 ]
Gonzalez, F. [3 ]
Junghans, C. [1 ]
Li, W. [2 ]
Luo, H. [2 ]
Pandare, A. [1 ]
Waltz, J. [1 ]
机构
[1] Los Alamos Natl Lab, Los Alamos, NM 87544 USA
[2] North Carolina State Univ, Raleigh, NC USA
[3] Strong Analyt, Chicago, IL USA
关键词
Shock hydrodynamics; Finite element method; Flux-corrected transport; Charm plus; Automatic load balancing; ELEMENT; SYSTEMS;
D O I
10.1016/j.advengsoft.2020.102962
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We discuss the implementation of a finite element method, used to numerically solve the Euler equations of compressible flows, using an asynchronous runtime system (RTS). The algorithm is implemented for distributed-memory machines, using stationary unstructured 3D meshes, combining data-, and task-parallelism on top of the Charm++ RTS. Charm++'s execution model is asynchronous by default, allowing arbitrary overlap of computation and communication. Task-parallelism allows scheduling parts of an algorithm independently of, or dependent on, each other. Built-in automatic load balancing enables continuous redistribution of computational load by migration of work units based on real-time CPU load measurement. The RTS also features automatic checkpointing, fault tolerance, resilience against hardware failure, and supports power-, and energy-aware computation. We demonstrate scalability up to 25 x 10(9) cells at O(10(4)) compute cores and the benefits of automatic load balancing for irregular workloads. The full source code with documentation is available at https://quinoacomputing.org.
引用
下载
收藏
页数:21
相关论文
共 50 条
  • [1] Parallel multigrid acceleration of 3D compressible flows on unstructured meshes
    Carré, G
    Fournier, L
    Lanteri, S
    PROCEEDINGS OF THE HIGH PERFORMANCE COMPUTING SYMPOSIUM - HPC '99, 1999, : 3 - 8
  • [2] A new reconstruction scheme for the computation of inviscid compressible flows on 3D unstructured grids
    Catalano, LA
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN FLUIDS, 2002, 40 (1-2) : 273 - 279
  • [3] Parallel Asynchronous Distributed-Memory Maximal Independent Set Algorithm with Work Ordering
    Kanewala, Thejaka
    Zalewski, Marcin
    Lumsdaine, Andrew
    2017 IEEE 24TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2017, : 52 - 61
  • [4] Parallel 3d shape optimization for cellular composites on large distributed-memory clusters
    Pinzon, Jose
    Siebenborn, Martin
    Vogel, Andreas
    JOURNAL OF ADVANCED SIMULATION IN SCIENCE AND ENGINEERING, 2020, 7 (01): : 117 - 135
  • [5] A PARALLEL AGGLOMERATION MULTIGRID METHOD FOR THE ACCELERATION OF COMPRESSIBLE FLOW COMPUTATIONS ON 3D HYBRID UNSTRUCTURED GRIDS
    Lygidakis, Georgios N.
    Nikolos, Ioannis K.
    11TH WORLD CONGRESS ON COMPUTATIONAL MECHANICS; 5TH EUROPEAN CONFERENCE ON COMPUTATIONAL MECHANICS; 6TH EUROPEAN CONFERENCE ON COMPUTATIONAL FLUID DYNAMICS, VOLS V - VI, 2014, : 4852 - 4863
  • [6] A discrete unified gas kinetic scheme on unstructured grids for viscid compressible flows and its parallel algorithm
    Xu, Lei
    Yan, Zhengzheng
    Chen, Rongliang
    AIMS MATHEMATICS, 2023, 8 (04): : 8829 - 8846
  • [7] 2D gas dynamics problem computation parallelization on unstructured grids on distributed-memory computer
    Barabanov, RA
    Butnev, OI
    Pronin, VA
    Sofronov, ID
    Volkov, SG
    Voronin, BL
    Zhogov, BM
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V, 2000, : 2899 - 2906
  • [8] An arbitrary Lagrangian-Eulerian discretization of MHD on 3D unstructured grids
    Rieben, R. N.
    White, D. A.
    Wallin, B. K.
    Solberg, J. M.
    JOURNAL OF COMPUTATIONAL PHYSICS, 2007, 226 (01) : 534 - 570
  • [9] Parallel implementation of an adaptive scheme for 3D unstructured grids on a shared-memory multiprocessor
    Biswas, R
    Dagum, L
    PARALLEL COMPUTATIONAL FLUID DYNAMICS: IMPLEMENTATIONS AND RESULTS USING PARALLEL COMPUTERS, 1996, : 489 - 496
  • [10] A 2D Parallel Triangle Counting Algorithm for Distributed-Memory Architectures
    Tom, Ancy Sarah
    Karypis, George
    PROCEEDINGS OF THE 48TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP 2019), 2019,