Two-level parallelization of a fluid mechanics algorithm exploiting hardware heterogeneity

被引:7
|
作者
Huismann, Immo [1 ]
Stiller, Joerg [1 ]
Froehlich, Jochen [1 ]
机构
[1] Tech Univ Dresden, Lehrstuhl Stromungsmech, D-01062 Dresden, Germany
关键词
Parallelization; Heterogeneous computing; MPI; GPGPU; OpenMP; OpenACC; GPU; PERFORMANCE;
D O I
10.1016/j.compfluid.2015.05.012
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The prospect of wildly heterogeneous computer systems has led to a renewed discussion of programming approaches in high-performance computing, of which computational fluid dynamics is a major field. The challenge consists in harvesting the performance of all available hardware components while retaining good programmability. In particular the use of graphic cards is an important trend. This is addressed in the present paper by devising a hybrid programming model to create a heterogeneous data-parallel computation with a single source code. The concept is demonstrated for a one-dimensional spectral-element discretization of a fluid dynamics problem. To exploit the additional hardware available when coupling GPGPU-accelerated processes with excess CPU cores, a straight-forward load balancing model for such heterogeneous environments is developed. The paper presents a large number of run time measurements and demonstrates that the achieved performance gains are close to optimal. This provides valuable information for the implementation of fluid dynamics codes on modern heterogeneous hardware. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:114 / 124
页数:11
相关论文
共 50 条
  • [31] An efficient two-level partitioning algorithm for VLSI circuits
    Cherng, JS
    Chen, SJ
    Tsai, CC
    Ho, JM
    PROCEEDINGS OF ASP-DAC '99: ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE 1999, 1999, : 69 - 72
  • [32] Two-level Decomposition Algorithm for Shift Scheduling Problems
    Doi, Tsubasa
    Nishi, Tatsushi
    2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 3773 - 3778
  • [33] A two-level pipelined implementation of the IDEA cryptographic algorithm
    Salomao, SLC
    Alves, VC
    Filho, EMC
    XI BRAZILIAN SYMPOSIUM ON INTEGRATED CIRCUIT DESIGN, PROCEEDINGS, 1998, : 158 - 161
  • [34] Two-level learning algorithm for multilayer neural networks
    Liu, CS
    Tseng, CH
    TENTH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1998, : 97 - 102
  • [35] Two-level quantizer design using genetic algorithm
    Chen, WJ
    Tai, SC
    Cheng, PJ
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 1999, E82A (02) : 403 - 406
  • [36] Two-Level Intellectual Classifier Based on the SVM Algorithm
    Demidova, Liliya
    Sokolova, Yulia
    2017 6TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING (MECO), 2017, : 463 - 466
  • [37] A two-level detection algorithm for optical fiber vibration
    Fukun Bi
    Xuecong Ren
    Hongquan Qu
    Ruiqing Jiang
    Photonic Sensors, 2015, 5 : 284 - 288
  • [38] A Two-Level Transfer Learning Algorithm for Evolutionary Multitasking
    Ma, Xiaoliang
    Chen, Qunjian
    Yu, Yanan
    Sun, Yiwen
    Ma, Lijia
    Zhu, Zexuan
    FRONTIERS IN NEUROSCIENCE, 2020, 13
  • [39] A two-level distributed algorithm for nonconvex constrained optimization
    Kaizhao Sun
    X. Andy Sun
    Computational Optimization and Applications, 2023, 84 : 609 - 649
  • [40] A Two-Level Dynamic Chrono-Scheduling Algorithm
    Diaz-Del-Rio, F.
    Sevillano, J. L.
    Vicente, S.
    Cagigas, D.
    Lopez-Torres, M. R.
    2009 IEEE/ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1 AND 2, 2009, : 109 - 116