Balancing CPU-GPU Collaborative High-order CFD Simulations on the Tianhe-1A Supercomputer

被引:7
|
作者
Xu, Chuanfu [1 ]
Zhang, Lilun [1 ]
Deng, Xiaogang [1 ]
Fang, Jianbin [2 ]
Wang, Guangxue [3 ]
Cao, Wei [1 ]
Che, Yonggang [1 ]
Wang, Yongxian [1 ]
Liu, Wei [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp Sci, Changsha 410073, Hunan, Peoples R China
[2] Delft Univ Technol, Parallel & Distributed Syst Grp, NL-2628 CD Delft, Netherlands
[3] State Key Lab Aerodynam, Mianyang 621000, Peoples R China
基金
美国国家科学基金会;
关键词
COMPACT NONLINEAR SCHEMES; FINITE-DIFFERENCE SCHEME; ACCURATE; GRIDS;
D O I
10.1109/IPDPS.2014.80
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
HOSTA is an in-house high-order CFD software that can simulate complex flows with complex geometries. Large-scale high-order CFD simulations using HOSTA require massive HPC resources, thus motivating us to port it onto modern GPU-accelerated supercomputers like Tianhe-1A. To achieve a greater speedup and fully tap the potential of Tianhe-1A, we collaborate CPU and GPU for HOSTA instead of using a naive GPU-only approach. We present multiple novel techniques to balance the loads between the store-poor GPU and the store-rich CPU, and overlap the collaborative computation and communication as far as possible. Taking CPU and GPU load balance into account, we improve the maximum simulation problem size per Tianhe-1A node for HOSTA by 2.3X, meanwhile the collaborative approach can improve the performance by around 45% compared to the GPU-only approach. Scalability tests show that HOSTA can achieve a parallel efficiency of above 60% on 1024 Tianhe-1A nodes. With our method, we have successfully simulated China's large civil airplane configuration C919 containing 150M grid cells. To our best knowledge, this is the first paper that reports a CPU-GPU collaborative high-order accurate aerodynamic simulation result with such a complex grid geometry.
引用
收藏
页数:10
相关论文
共 31 条
  • [1] Collaborating CPU and GPU for large-scale high-order CFD simulations with complex grids on the TianHe-1A supercomputer
    Xu, Chuanfu
    Deng, Xiaogang
    Zhang, Lilun
    Fang, Jianbin
    Wang, Guangxue
    Jiang, Yi
    Cao, Wei
    Che, Yonggang
    Wang, Yongxian
    Wang, Zhenghua
    Liu, Wei
    Cheng, Xinghua
    JOURNAL OF COMPUTATIONAL PHYSICS, 2014, 278 : 275 - 297
  • [2] Parallelizing a High-Order CFD Software for 3D, Multi-block, Structural Grids on the TianHe-1A Supercomputer
    Xu, Chuanfu
    Deng, Xiaogang
    Zhang, Lilun
    Jiang, Yi
    Cao, Wei
    Fang, Jianbin
    Che, Yonggang
    Wang, Yongxian
    Liu, Wei
    SUPERCOMPUTING (ISC 2013), 2013, 7905 : 26 - 39
  • [3] Evaluating and Optimizing Parallel LU-SGS for High-order CFD simulations on the Tianhe-2 supercomputer
    Li, Dali
    Xu, Chuanfu
    Guo, Ningbo
    2016 IEEE TRUSTCOM/BIGDATASE/ISPA, 2016, : 1182 - 1189
  • [4] A hybrid CPU-GPU paradigm to accelerate reactive CFD simulations
    Ghioldi, Federico
    Piscaglia, Federico
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN FLUIDS, 2024, 96 (08) : 1461 - 1488
  • [5] Large Scale Satellite Imagery Simulations with Physically Based Ray Tracing on Tianhe-1A Supercomputer
    Wu, Changmao
    Zhang, Yunquan
    Yang, Congli
    2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC), 2013, : 549 - 556
  • [6] Heterogeneous computing and optimization on tianhe-2 supercomputer system for high-order accurate CFD applications
    Wang, Yongxian
    Zhang, Lilun
    Che, Yonggang
    Xu, Chuanfu
    Liu, Wei
    Cheng, Xinghua
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2015, 52 (04): : 833 - 842
  • [7] Efficient parallel implementation of large scale 3D structured grid CFD applications on the Tianhe-1A supercomputer
    Wang Yong-Xian
    Zhang Li-Lun
    Liu Wei
    Che Yong-Gang
    Xu Chuan-Fu
    Wang Zheng-Hua
    Zhuang Yu
    COMPUTERS & FLUIDS, 2013, 80 : 244 - 250
  • [8] Acceleration of supersonic/hypersonic reactive CFD simulations via heterogeneous CPU-GPU supercomputing
    Ghioldi, F.
    Piscaglia, F.
    COMPUTERS & FLUIDS, 2023, 266
  • [9] A dynamic load balancing algorithm for CFD-DEM simulation with CPU-GPU heterogeneous computing
    Zhu, Aiqi
    Chang, Qi
    Xu, Ji
    Ge, Wei
    POWDER TECHNOLOGY, 2023, 428
  • [10] High efficient sedimentary basin simulations on hybrid CPU-GPU clusters
    Mei Wen
    Huayou Su
    Wenjie Wei
    Nan Wu
    Xing Cai
    Chunyuan Zhang
    Cluster Computing, 2014, 17 : 359 - 369