Balancing CPU-GPU Collaborative High-order CFD Simulations on the Tianhe-1A Supercomputer

被引:7
|
作者
Xu, Chuanfu [1 ]
Zhang, Lilun [1 ]
Deng, Xiaogang [1 ]
Fang, Jianbin [2 ]
Wang, Guangxue [3 ]
Cao, Wei [1 ]
Che, Yonggang [1 ]
Wang, Yongxian [1 ]
Liu, Wei [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp Sci, Changsha 410073, Hunan, Peoples R China
[2] Delft Univ Technol, Parallel & Distributed Syst Grp, NL-2628 CD Delft, Netherlands
[3] State Key Lab Aerodynam, Mianyang 621000, Peoples R China
基金
美国国家科学基金会;
关键词
COMPACT NONLINEAR SCHEMES; FINITE-DIFFERENCE SCHEME; ACCURATE; GRIDS;
D O I
10.1109/IPDPS.2014.80
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
HOSTA is an in-house high-order CFD software that can simulate complex flows with complex geometries. Large-scale high-order CFD simulations using HOSTA require massive HPC resources, thus motivating us to port it onto modern GPU-accelerated supercomputers like Tianhe-1A. To achieve a greater speedup and fully tap the potential of Tianhe-1A, we collaborate CPU and GPU for HOSTA instead of using a naive GPU-only approach. We present multiple novel techniques to balance the loads between the store-poor GPU and the store-rich CPU, and overlap the collaborative computation and communication as far as possible. Taking CPU and GPU load balance into account, we improve the maximum simulation problem size per Tianhe-1A node for HOSTA by 2.3X, meanwhile the collaborative approach can improve the performance by around 45% compared to the GPU-only approach. Scalability tests show that HOSTA can achieve a parallel efficiency of above 60% on 1024 Tianhe-1A nodes. With our method, we have successfully simulated China's large civil airplane configuration C919 containing 150M grid cells. To our best knowledge, this is the first paper that reports a CPU-GPU collaborative high-order accurate aerodynamic simulation result with such a complex grid geometry.
引用
收藏
页数:10
相关论文
共 31 条
  • [21] Improved grid partitioning algorithms for load-balancing high-order structured aerodynamics simulations
    Xiong, Min
    Xu, Chuanfu
    Gao, Xiang
    Li, Dali
    Qu, Dandan
    Wang, Zhenghua
    Deng, Xiaogang
    COMPUTERS & ELECTRICAL ENGINEERING, 2018, 67 : 70 - 84
  • [22] Efficient Application of Hanging-Node Constraints for Matrix-Free High-Order FEM Computations on CPU and GPU
    Munch, Peter
    Ljungkvist, Karl
    Kronbichler, Martin
    HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2022, 2022, 13289 : 133 - 152
  • [23] CPU/GPU computing for a multi-block structured grid based high-order flow solver on a large heterogeneous system
    Cao, Wei
    Xu, Chuan-fu
    Wang, Zheng-hua
    Yao, Lu
    Liu, Hua-yong
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2014, 17 (02): : 255 - 270
  • [24] CPU/GPU computing for a multi-block structured grid based high-order flow solver on a large heterogeneous system
    Wei Cao
    Chuan-fu Xu
    Zheng-hua Wang
    Lu Yao
    Hua-yong Liu
    Cluster Computing, 2014, 17 : 255 - 270
  • [25] On the utility of GPU accelerated high-order methods for unsteady flow simulations: A comparison with industry-standard tools
    Vermeire, B. C.
    Witherden, F. D.
    Vincent, P. E.
    JOURNAL OF COMPUTATIONAL PHYSICS, 2017, 334 : 497 - 521
  • [26] Implicit Large Eddy Simulations of a rectangular 5:1 cylinder with a high-order discontinuous Galerkin method
    Crivellini, Andrea
    Nigro, Alessandra
    Colombo, Alessandro
    Ghidoni, Antonio
    Noventa, Gianmaria
    Cimarelli, Andrea
    Corsini, Roberto
    WIND AND STRUCTURES, 2022, 34 (01) : 59 - 72
  • [27] Performance modeling and optimization of parallel LU-SGS on many-core processors for 3D high-order CFD simulations
    Li, Dali
    Xu, Chuanfu
    Cheng, Bin
    Xiong, Min
    Gao, Xiang
    Deng, Xiaogang
    JOURNAL OF SUPERCOMPUTING, 2017, 73 (06): : 2506 - 2524
  • [28] Performance modeling and optimization of parallel LU-SGS on many-core processors for 3D high-order CFD simulations
    Dali Li
    Chuanfu Xu
    Bin Cheng
    Min Xiong
    Xiang Gao
    Xiaogang Deng
    The Journal of Supercomputing, 2017, 73 : 2506 - 2524
  • [29] Development of Unstructured Curved Meshes with G1 Surface Continuity for High-Order Finite Element Simulations
    Lu, Qiukai
    Shephard, Mark S.
    SPECTRAL AND HIGH ORDER METHODS FOR PARTIAL DIFFERENTIAL EQUATIONS ICOSAHOM 2014, 2015, 106 : 335 - 343
  • [30] A comparative study of explicit and implicit Large Eddy simulations using a high-order discontinuous Galerkin solver: Application to a Formula 1 front wing
    Ntoukas, Gerasimos
    Rubio, Gonzalo
    Marino, Oscar
    Liosi, Alexandra
    Bottone, Francesco
    Hoessler, Julien
    Ferrer, Esteban
    RESULTS IN ENGINEERING, 2025, 25