Evaluating and Optimizing Parallel LU-SGS for High-order CFD simulations on the Tianhe-2 supercomputer

被引:0
|
作者
Li, Dali [1 ]
Xu, Chuanfu [1 ]
Guo, Ningbo [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp Sci, Changsha 410073, Hunan, Peoples R China
基金
美国国家科学基金会;
关键词
ALGORITHM; EULER;
D O I
10.1109/TrustCom.2016.190
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The inherent strong data dependency of LU-SGS poses tough challenges for shared-memory parallelization. The popular pipeline solution for parallel LU-SGS in CFD, achieves impressive parallel scalability on early multi-core processors. However, recent experiences show that the scalability of pipeline LU-SGS drops dramatically on emerging many-core processors such as Xeon Phi due to high startup and emptying overheads and severe load imbalance. We discover that increasingly large pipeline depth tremendously hinder the applicability of pipeline LU-SGS in realistic parallel CFD simulations on many-core processors. Aiming at alleviating these performance issues, we propose a novel improved pipeline LU-SGS algorithm, which organizes threads hierarchically using nested OpenMP to construct a sub-pipeline in each original pipeline stage to further exploit LU-SGS's parallelism. We implement and evaluate it in our in-house high-order CFD software HOSTA on Xeon and Xeon Phi. For a given 256 x 256 x 256 workload, improved method achieves over 20% performance gains on Xeon Phi than traditional pipeline approach, a further 38% performance boost are observed on Xeon Phi when varies the dimension sizes. Related problems in realistic CFD simulations such as domain decomposition and algorithmic parameter tuning are also discussed. Generally, our work is applicable to all Gauss-Seidel like methods with intrinsic strong data dependency.
引用
收藏
页码:1182 / 1189
页数:8
相关论文
共 11 条
  • [1] Heterogeneous computing and optimization on tianhe-2 supercomputer system for high-order accurate CFD applications
    Wang, Yongxian
    Zhang, Lilun
    Che, Yonggang
    Xu, Chuanfu
    Liu, Wei
    Cheng, Xinghua
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2015, 52 (04): : 833 - 842
  • [2] Performance modeling and optimization of parallel LU-SGS on many-core processors for 3D high-order CFD simulations
    Li, Dali
    Xu, Chuanfu
    Cheng, Bin
    Xiong, Min
    Gao, Xiang
    Deng, Xiaogang
    JOURNAL OF SUPERCOMPUTING, 2017, 73 (06): : 2506 - 2524
  • [3] Performance modeling and optimization of parallel LU-SGS on many-core processors for 3D high-order CFD simulations
    Dali Li
    Chuanfu Xu
    Bin Cheng
    Min Xiong
    Xiang Gao
    Xiaogang Deng
    The Journal of Supercomputing, 2017, 73 : 2506 - 2524
  • [4] Balancing CPU-GPU Collaborative High-order CFD Simulations on the Tianhe-1A Supercomputer
    Xu, Chuanfu
    Zhang, Lilun
    Deng, Xiaogang
    Fang, Jianbin
    Wang, Guangxue
    Cao, Wei
    Che, Yonggang
    Wang, Yongxian
    Liu, Wei
    2014 IEEE 28TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, 2014,
  • [5] Collaborating CPU and GPU for large-scale high-order CFD simulations with complex grids on the TianHe-1A supercomputer
    Xu, Chuanfu
    Deng, Xiaogang
    Zhang, Lilun
    Fang, Jianbin
    Wang, Guangxue
    Jiang, Yi
    Cao, Wei
    Che, Yonggang
    Wang, Yongxian
    Wang, Zhenghua
    Liu, Wei
    Cheng, Xinghua
    JOURNAL OF COMPUTATIONAL PHYSICS, 2014, 278 : 275 - 297
  • [6] Parallelizing and optimizing large-scale 3D multi-phase flow simulations on the Tianhe-2 supercomputer
    Li, Dali
    Xu, Chuanfu
    Wang, Yongxian
    Song, Zhifang
    Xiong, Min
    Gao, Xiang
    Deng, Xiaogang
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (05): : 1678 - 1692
  • [7] High-Scalable Collaborated Parallel Framework for Large-Scale Molecular Dynamic Simulation on Tianhe-2 Supercomputer
    Peng, Shaoliang
    Zhang, Xiaoyu
    Su, Wenhe
    Dong, Dong
    Lu, Yutong
    Liao, Xiangke
    Lu, Kai
    Yang, Canqun
    Liu, Jie
    Zhu, Weiliang
    Wei, Dongqing
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (03) : 804 - 816
  • [8] Parallelizing a High-Order CFD Software for 3D, Multi-block, Structural Grids on the TianHe-1A Supercomputer
    Xu, Chuanfu
    Deng, Xiaogang
    Zhang, Lilun
    Jiang, Yi
    Cao, Wei
    Fang, Jianbin
    Che, Yonggang
    Wang, Yongxian
    Liu, Wei
    SUPERCOMPUTING (ISC 2013), 2013, 7905 : 26 - 39
  • [9] Efficient Implicit Non-linear LU-SGS Approach for Compressible Flow Computation Using High-Order Spectral Difference Method
    Sun, Yuzhi
    Wang, Z. J.
    Liu, Yen
    COMMUNICATIONS IN COMPUTATIONAL PHYSICS, 2009, 5 (2-4) : 760 - 778
  • [10] Parallel adaptive high-order CFD simulations characterising SOFIA cavity acoustics
    Barad, Michael F.
    Brehm, Christoph
    Kiris, Cetin C.
    Biswas, Rupak
    INTERNATIONAL JOURNAL OF COMPUTATIONAL FLUID DYNAMICS, 2016, 30 (06) : 437 - 443