Optimizing Offload Performance in Heterogeneous MPSoCs

被引:0
|
作者
Colagrande, Luca [1 ]
Benini, Luca [1 ]
机构
[1] Swiss Fed Inst Technol, Integrated Syst Lab, IIS, Zurich, Switzerland
关键词
heterogeneous systems; fine-grain parallelism; job offloading; manycore accelerators;
D O I
10.23919/DATE58400.2024.10546670
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Heterogeneous multi-core architectures combine a few "host" cores, optimized for single-thread performance, with many small energy-efficient "accelerator" cores for data-parallel processing, on a single chip. Offloading a computation to the many-core acceleration fabric introduces a communication and synchronization cost which reduces the speedup attainable on the accelerator, particularly for small and fine-grained parallel tasks. We demonstrate that by co-designing the hardware and offload routines, we can increase the speedup of an offloaded DAXPY kernel by as much as 47.9%. Furthermore, we show that it is possible to accurately model the runtime of an offloaded application, accounting for the offload overheads, with as low as 1% MAPE error, enabling optimal offload decisions under offload execution time constraints.
引用
收藏
页数:2
相关论文
共 50 条
  • [31] Offload Annotations: Bringing Heterogeneous Computing to Existing Libraries and Workloads
    Yuan, Gina
    Palkar, Shoumik
    Narayanan, Deepak
    Zaharia, Matei
    PROCEEDINGS OF THE 2020 USENIX ANNUAL TECHNICAL CONFERENCE, 2020, : 293 - 306
  • [32] Automatic Parallelization and Accelerator Offloading for Embedded Applications on Heterogeneous MPSoCs
    Aguilar, Miguel Angel
    Leupers, Rainer
    Ascheid, Gerd
    Murillo, Luis Gabriel
    2016 ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2016,
  • [33] Optimizing energy and throughput for MPSoCs: an integer particle swarm optimization approach
    Shahid Ali Murtza
    Ayaz Ahmad
    Muhammad Yasir Qadri
    Nadia N. Qadri
    Jameel Ahmed
    Computing, 2018, 100 : 227 - 244
  • [34] A Unified Approach to Optimizing Performance in Networks serving Heterogeneous Flows
    Li, Ruogu
    Ying, Lei
    Eryilmaz, Atilla
    Shroff, Ness B.
    IEEE INFOCOM 2009 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, VOLS 1-5, 2009, : 253 - +
  • [35] A Unified Approach to Optimizing Performance in Networks Serving Heterogeneous Flows
    Li, Ruogu
    Eryilmaz, Atilla
    Ying, Lei
    Shroff, Ness B.
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2011, 19 (01) : 223 - 236
  • [36] Dynamic road pricing for optimizing network performance with heterogeneous users
    Joksimovic, D
    Bliemer, MCJ
    Bovy, PHL
    Verwater-Lukszo, Z
    2005 IEEE Networking, Sensing and Control Proceedings, 2005, : 407 - 412
  • [37] Optimizing legacy molecular dynamics software with directive-based offload
    Brown, W. Michael
    Carrillo, Jan-Michael Y.
    Gavhane, Nitin
    Thakkar, Foram M.
    Plimpton, Steven J.
    COMPUTER PHYSICS COMMUNICATIONS, 2015, 195 : 95 - 101
  • [38] Automatic Extraction of Task-Level Parallelism for Heterogeneous MPSoCs
    Cordes, Daniel
    Neugebauer, Olaf
    Engel, Michael
    Marwedel, Peter
    2013 42ND ANNUAL INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2013, : 950 - 959
  • [39] Heterogeneous MPSoCs for Mixed-Criticality Systems: Challenges and Opportunities
    Hassan, Mohamed
    IEEE DESIGN & TEST, 2018, 35 (04) : 47 - 55
  • [40] Optimizing energy and throughput for MPSoCs: an integer particle swarm optimization approach
    Murtza, Shahid Ali
    Ahmad, Ayaz
    Qadri, Muhammad Yasir
    Qadri, Nadia N.
    Ahmed, Jameel
    COMPUTING, 2018, 100 (03) : 227 - 244