Straightforward Heterogeneous Computing with the oneAPI Coexecutor Runtime

被引:4
|
作者
Nozal, Raul [1 ]
Bosque, Jose Luis [1 ]
机构
[1] Univ Cantabria, Dept Comp Sci & Elect, Santander 39005, Spain
关键词
heterogeneous systems; parallel computing; co-execution; load balancing; SYCL; oneAPI; Data Parallel C++; scheduling; HPC; CPU-GPU; MULTI-CPU; PERFORMANCE;
D O I
10.3390/electronics10192386
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Heterogeneous systems are the core architecture of most computing systems, from high-performance computing nodes to embedded devices, due to their excellent performance and energy efficiency. Efficiently programming these systems has become a major challenge due to the complexity of their architectures and the efforts required to provide them with co-execution capabilities that can fully exploit the applications. There are many proposals to simplify the programming and management of acceleration devices and multi-core CPUs. However, in many cases, portability and ease of use compromise the efficiency of different devices-even more so when co-executing. Intel oneAPI, a new and powerful standards-based unified programming model, built on top of SYCL, addresses these issues. In this paper, oneAPI is provided with co-execution strategies to run the same kernel between different devices, enabling the exploitation of static and dynamic policies. This work evaluates the performance and energy efficiency for a well-known set of regular and irregular HPC benchmarks, using two heterogeneous systems composed of an integrated GPU and CPU. Static and dynamic load balancers are integrated and evaluated, highlighting single and co-execution strategies and the most significant key points of this promising technology. Experimental results show that co-execution is worthwhile when using dynamic algorithms and improves the efficiency even further when using unified shared memory.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] A Compiler and Runtime for Heterogeneous Computing
    Auerbach, Joshua
    Bacon, David F.
    Burcea, Ioana
    Cheng, Perry
    Fink, Stephen J.
    Rabbah, Rodric
    Shukla, Sunil
    [J]. 2012 49TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2012, : 271 - 276
  • [2] Runtime Adaptation for Autonomic Heterogeneous Computing
    Scogland, Thomas R. W.
    Feng, Wu-chun
    [J]. 2014 14TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2014, : 562 - 565
  • [3] Assessing Intel OneAPI capabilities and cloud-performance for heterogeneous computing
    Alcaraz, Silvia R.
    Laso, Ruben
    Lorenzo, Oscar G.
    Vilarino, David L.
    Pena, Tomas F.
    Rivera, Francisco F.
    [J]. JOURNAL OF SUPERCOMPUTING, 2024, 80 (09): : 13295 - 13316
  • [4] A Heterogeneous Runtime Environment for Scientific Desktop Computing
    Oliveira, Nuno
    Medeiros, Pedro D.
    [J]. HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2016, 2017, 10150 : 256 - 269
  • [5] RCHC: a Holistic Runtime System for Concurrent Heterogeneous Computing
    Park, Jinsu
    Baek, Woongki
    [J]. PROCEEDINGS 45TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING - ICPP 2016, 2016, : 211 - 216
  • [6] A Programming Model and Runtime System for Approximation-Aware Heterogeneous Computing
    Parnassos, Ioannis
    Bellas, Nikolaos
    Katsaros, Nikolaos
    Patsiatzis, Nikolaos
    Gkaras, Athanasios
    Kanellis, Konstantinos
    Antonopoulos, Christos D.
    Spyrou, Michalis
    Maroudas, Manolis
    [J]. 2017 27TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2017,
  • [7] An investigation on runtime task scheduling for parallel raytracing on a heterogeneous distributed computing system
    Qureshi, KU
    Hatanaka, M
    [J]. INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-IV, PROCEEDINGS, 1998, : 1066 - 1073
  • [8] Integrating FPGA-based Processing Elements into a Runtime for Parallel Heterogeneous Computing
    de la Chevallerie, David
    Korinth, Jens
    Koch, Andreas
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), 2014, : 314 - 317
  • [9] CEML: a Coordinated Runtime System for Efficient Machine Learning on Heterogeneous Computing Systems
    Hyun, Jihoon
    Park, Jinsu
    Kim, Kyu Yeun
    Yu, Seongdae
    Baek, Woongki
    [J]. EURO-PAR 2018: PARALLEL PROCESSING, 2018, 11014 : 781 - 795
  • [10] Making Profit with ALBATROSS: A Runtime System for Heterogeneous High-Performance-Computing Clusters
    Hoenig, Timo
    Eibel, Christopher
    Wagenhaeuser, Adam
    Wagner, Maximilian
    Schroeder-Preikschat, Wolfgang
    [J]. HPDC '18: PROCEEDINGS OF THE 27TH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING: POSTERS/DOCTORAL CONSORTIUM, 2018, : 11 - 12