A Heterogeneous MPI plus PPL Task Scheduling Approach for Asynchronous Many-Task Runtime Systems

被引:1
|
作者
Holmen, John K. [1 ]
Sahasrabudhe, Damodar [1 ]
Berzins, Martin [1 ]
机构
[1] Univ Utah, SCI Inst, Salt Lake City, UT 84112 USA
关键词
Asynchronous Many-Task Runtime System; Performance Portability; Parallelism and Concurrency; Portability; Software Engineering; PERFORMANCE;
D O I
10.1145/3437359.3465581
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Asynchronous many-task runtime systems and MPI+X hybrid parallelism approaches have shown promise for helping manage the increasing complexity of nodes in current and emerging high performance computing (HPC) systems, including those for exascale. The increasing architectural diversity of these systems, however, poses challenges for runtimes supporting more homogeneous HPC systems. Performance portability layers (PPL) have shown promise for helping manage this diversity. This paper describes a heterogeneous MPI+PPL task scheduling approach for combining these promising solutions with additional consideration for parallel third party libraries facing similar challenges to help prepare such a runtime for the diverse heterogeneous systems accompanying exascale computing. This approach is demonstrated using a heterogeneous MPI+Kokkos task scheduler and the accompanying portable abstractions [16] implemented in the Uintah Computational Framework, an asynchronous many-task runtime system, with additional consideration for hypre, a parallel third party library. Results are shown for two challenging problems executing workloads representative of typical Uintah applications. These results show performance improvements up to 4.4x when using this scheduler and the accompanying portable abstractions [16] to port a previously MPI-Only problem to Kokkos::OpenMP and Kokkos::CUDA to improve complex heterogeneous node use. Good strong-scaling to 1,024 NVIDIA V100 GPUs and 512 IBM POWER9 processor are also shown using MPI+Kokkos::OpenMP+Kokkos::CUDA at scale.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Latency-based Vector Scheduling of Many-task Applications for a Hybrid Cloud
    Mithila, Shifat P.
    Baumgartner, Gerald
    2022 IEEE 15TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (IEEE CLOUD 2022), 2022, : 257 - 262
  • [32] An Introduction to hpxMP - A Modern OpenMP Implementation Leveraging HPX, An Asynchronous Many-Task System
    Zhang, Tianyi
    Shirzad, Shahrzad
    Diehl, Patrick
    Tohid, R.
    Wei, Weile
    Kaiser, Hartmut
    PROCEEDINGS OF THE INTERNATIONAL WORKSHOP ON OPENCL (IWOCL'19), 2019,
  • [33] Minimizing Energy of Heterogeneous Computing Systems by Task Scheduling Approach
    Li, Junke
    Li, Junwei
    Li, Mingjiang
    Wang, Guanyu
    Zhou, Jincheng
    Lu, Yu
    Li, Deguang
    Huang, Yanhui
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2020, 29 (12)
  • [34] Task Scheduling Approach to Save Energy of Heterogeneous Computing Systems
    Li, Junke
    Li, Mingjiang
    Wang, Guanyu
    Zhou, Jincheng
    Li, Deguang
    Huang, Yanhui
    2019 INTERNATIONAL CONFERENCE ON INTERNET OF THINGS (ITHINGS) AND IEEE GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) AND IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING (CPSCOM) AND IEEE SMART DATA (SMARTDATA), 2019, : 353 - 360
  • [35] Task scheduling for heterogeneous computing systems
    Shaikhah AlEbrahim
    Imtiaz Ahmad
    The Journal of Supercomputing, 2017, 73 : 2313 - 2338
  • [36] A novel task scheduling for heterogeneous systems
    Ren, XP
    Wan, J
    Hu, GH
    EMBEDDED SOFTWARE AND SYSTEMS, 2005, 3605 : 400 - 405
  • [37] Task scheduling for heterogeneous computing systems
    AlEbrahim, Shaikhah
    Ahmad, Imtiaz
    JOURNAL OF SUPERCOMPUTING, 2017, 73 (06): : 2313 - 2338
  • [38] Towards effective scheduling policies for many-task applications: Practice and experience based on HTCaaS
    Kim, Jik-Soo
    Quang, Bui
    Rho, Seungwoo
    Kim, Seoyoung
    Kim, Sangwan
    Breton, Vincent
    Hwang, Soonwook
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2017, 29 (21):
  • [39] Dynamic DAG scheduling for many-task computing of distributed eco-hydrological model
    Yue, Shasha
    Ma, Yan
    Chen, Lajiao
    Wang, Yuzhu
    Song, Weijing
    JOURNAL OF SUPERCOMPUTING, 2019, 75 (02): : 510 - 532
  • [40] Dynamic DAG scheduling for many-task computing of distributed eco-hydrological model
    Shasha Yue
    Yan Ma
    Lajiao Chen
    Yuzhu Wang
    Weijing Song
    The Journal of Supercomputing, 2019, 75 : 510 - 532