Optimizing Offload Performance in Heterogeneous MPSoCs

被引:0
|
作者
Colagrande, Luca [1 ]
Benini, Luca [1 ]
机构
[1] Swiss Fed Inst Technol, Integrated Syst Lab, IIS, Zurich, Switzerland
关键词
heterogeneous systems; fine-grain parallelism; job offloading; manycore accelerators;
D O I
10.23919/DATE58400.2024.10546670
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Heterogeneous multi-core architectures combine a few "host" cores, optimized for single-thread performance, with many small energy-efficient "accelerator" cores for data-parallel processing, on a single chip. Offloading a computation to the many-core acceleration fabric introduces a communication and synchronization cost which reduces the speedup attainable on the accelerator, particularly for small and fine-grained parallel tasks. We demonstrate that by co-designing the hardware and offload routines, we can increase the speedup of an offloaded DAXPY kernel by as much as 47.9%. Furthermore, we show that it is possible to accurately model the runtime of an offloaded application, accounting for the offload overheads, with as low as 1% MAPE error, enabling optimal offload decisions under offload execution time constraints.
引用
收藏
页数:2
相关论文
共 50 条
  • [1] Modeling the Impact on Performance of Memory Pooling in Heterogeneous MPSoCs
    Pauls, Friedrich
    Fettweis, Gerhard
    2017 IEEE 85TH VEHICULAR TECHNOLOGY CONFERENCE (VTC SPRING), 2017,
  • [2] Collective Offload for Heterogeneous Clusters
    Sainz, Florentino
    Bellon, Jorge
    Beltran, Vicenc
    Labarta, Jesus
    2015 IEEE 22ND INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2015, : 376 - 385
  • [3] Power-Performance Modelling of Mobile Gaming Workloads on Heterogeneous MPSoCs
    Pathania, Anuj
    Irimiea, Alexandru Eugen
    Prakash, Alok
    Mitra, Tulika
    2015 52ND ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2015,
  • [4] Reverse Offload Programming on Heterogeneous Systems
    Chen, Cheng
    Yang, Wenxiang
    Wang, Fang
    Zhao, Dan
    Liu, Yang
    Deng, Liang
    Yang, Canqun
    IEEE ACCESS, 2019, 7 : 10787 - 10797
  • [5] IPSec offload performance
    Raissi, J
    PROCEEDINGS OF THE IEEE SOUTHEASTCON 2004: ENGINEERING CONNECTS, 2004, : 222 - 228
  • [6] A compiler infrastructure for embedded heterogeneous MPSoCs
    Sheng, Weihua
    Schuermans, Stefan
    Odendahl, Maximilian
    Bertsch, Mark
    Volevach, Vitaliy
    Leupers, Rainer
    Ascheid, Gerd
    PARALLEL COMPUTING, 2014, 40 (02) : 51 - 68
  • [7] Pre-Mapping Algorithm for Heterogeneous MPSoCs
    Marcon, Cesar
    Webber, Thais
    Poehls, Leticia B.
    Pinotti, Igor K.
    2014 27TH INTERNATIONAL CONFERENCE ON VLSI DESIGN AND 2014 13TH INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS (VLSID 2014), 2014, : 252 - 257
  • [8] Tuning the ISA for increased heterogeneous computation in MPSoCs
    Becker, Pedro H. E.
    Souza, Jeckson D.
    Beck, Antonio C. S.
    PROCEEDINGS OF THE 2020 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2020), 2020, : 1722 - 1727
  • [9] A Hybrid Task Mapping Algorithm for Heterogeneous MPSoCs
    Quan, Wei
    Pimentel, Andy D.
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2015, 14 (01)
  • [10] Shared Hardware Accelerator Architectures for Heterogeneous MPSoCs
    Bouthaina, Damak
    Baklouti, Mouna
    Niar, Smail
    Abid, Mohamed
    2013 8TH INTERNATIONAL WORKSHOP ON RECONFIGURABLE AND COMMUNICATION-CENTRIC SYSTEMS-ON-CHIP (RECOSOC), 2013,