Parallel Performance Optimizations on Unstructured Mesh-Based Simulations

被引:7
|
作者
Sarje, Abhinav [1 ]
Song, Sukhyun [2 ]
Jacobsen, Douglas [3 ]
Huck, Kevin [4 ]
Hollingsworth, Jeffrey [2 ]
Malony, Allen [4 ]
Williams, Samuel [1 ]
Oliker, Leonid [1 ]
机构
[1] Lawrence Berkeley Natl Lab, Berkeley, CA USA
[2] Univ Maryland, College Pk, MD USA
[3] Los Alamos Natl Lab, Los Alamos, NM 87545 USA
[4] Univ Oregon, Eugene, OR 97403 USA
关键词
Unstructured Mesh; Ocean Modeling; Graph Partitioning; Performance Optimization;
D O I
10.1016/j.procs.2015.05.466
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper addresses two key parallelization challenges the unstructured mesh-based ocean modeling code, MPAS-Ocean, which uses a mesh based on Voronoi tessellations: (1) load imbalance across processes, and (2) unstructured data access patterns, that inhibit intra-and inter-node performance. Our work analyzes the load imbalance due to naive partitioning of the mesh, and develops methods to generate mesh partitioning with better load balance and reduced communication. Furthermore, we present methods that minimize both inter-and intra-node data movement and maximize data reuse. Our techniques include predictive ordering of data elements for higher cache efficiency, as well as communication reduction approaches. We present detailed performance data when running on thousands of cores using the Cray XC30 supercomputer and show that our optimization strategies can exceed the original performance by over 2x. Additionally, many of these solutions can be broadly applied to a wide variety of unstructured grid-based computations.
引用
收藏
页码:2016 / 2025
页数:10
相关论文
共 50 条
  • [1] Generic components for petascale adaptive unstructured mesh-based simulations
    Xie, Ting
    Seol, Seegyoung
    Shephard, Mark S.
    [J]. ENGINEERING WITH COMPUTERS, 2014, 30 (01) : 79 - 95
  • [2] Generic components for petascale adaptive unstructured mesh-based simulations
    Ting Xie
    Seegyoung Seol
    Mark S. Shephard
    [J]. Engineering with Computers, 2014, 30 : 79 - 95
  • [3] Implementation and Performance Analysis of SkelGIS for Network Mesh-Based Simulations
    Coullon, Helene
    Limet, Sebastien
    [J]. EURO-PAR 2014 PARALLEL PROCESSING, 2014, 8632 : 439 - 450
  • [4] CONTROLLING UNSTRUCTURED MESH PARTITIONS FOR MASSIVELY PARALLEL SIMULATIONS
    Zhou, Min
    Sahni, Onkar
    Devine, Karen D.
    Shephard, Mark S.
    Jansen, Kenneth E.
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2010, 32 (06): : 3201 - 3227
  • [5] Performance model for Krylov subspace methods on mesh-based parallel computers
    Swiss Federal Inst of Technology, Zurich, Switzerland
    [J]. Parallel Comput, 1 (57-74):
  • [6] A performance model for Krylov subspace methods on mesh-based parallel computers
    deSturler, E
    [J]. PARALLEL COMPUTING, 1996, 22 (01) : 57 - 74
  • [7] PUMIPic: A mesh-based approach to unstructured mesh Particle-In-Cell on GPUs
    Diamond, Gerrett
    Smith, Cameron W.
    Zhang, Chonglin
    Yoon, Eisung
    Shephard, Mark S.
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2021, 157 : 1 - 12
  • [8] Hiroshima and Nagasaki Verification of an Unstructured Mesh-Based Transmutation Toolkit
    McClanahan, Tucker C.
    Goorley, Tim
    Auxier, John, II
    [J]. NUCLEAR TECHNOLOGY, 2021, 207 (01) : 19 - 36
  • [9] A parallel unstructured mesh adaptation for unsteady compressible flow simulations
    Kinoshita, T
    Inoue, O
    [J]. PARALLEL COMPUTATIONAL FLUID DYNAMICS: RECENT DEVELOPMENTS AND ADVANCES USING PARALLEL COMPUTERS, 1998, : 65 - 72
  • [10] A Parallelization and Performance Optimization Framework for Mesh-Based Simulations Using Target Execution Models
    Zhang, Zhi-guo
    He, Qing-yin
    Liu, Jin-yu
    Shao, Jing-yi
    [J]. 2018 2ND INTERNATIONAL CONFERENCE ON APPLIED MATHEMATICS, MODELING AND SIMULATION (AMMS 2018), 2018, 305 : 228 - 232