Heterogeneous Accelerator Design for Multi-DNN Workloads via Heuristic Optimization

被引:0
|
作者
Balaskas, Konstantinos [1 ,2 ]
Khdr, Heba [1 ]
Bakr Sikal, Mohammed [1 ]
Kreß, Fabian [1 ]
Siozios, Kostas [2 ]
Becker, Jurgen [1 ]
Henkel, Jorg [1 ]
机构
[1] Karlsruhe Institute of Technology, Embedded Systems, Karlsruhe,76131, Germany
[2] Aristotle University of Thessaloniki, Department of Physics, Thessaloniki,54124, Greece
关键词
Simulated annealing;
D O I
10.1109/LES.2024.3443628
中图分类号
学科分类号
摘要
The significant advancements of deep neural networks (DNNs) in a wide range of application domains have spawned the need for more specialized, sophisticated solutions in the form of multi-DNN workloads. Heterogeneous DNN accelerators have emerged as an elegant solution to tackle the workloads' inherent diversity, achieving significant improvements compared to homogeneous solutions. However, utilizing off-the-shelf architectures provides suboptimal adaptability to given workloads, whereas custom design approaches offer limited heterogeneity, and thus reduced gains. In this letter, we combat these shortcomings and propose an exploration-based framework to holistically design heterogeneous accelerators, tailored for multi-DNN workloads. Our framework is workload-agnostic and leverages architectural heterogeneity to its full potential, by integrating low-precision arithmetic and custom structural parameters. We explore the formed design space, targeting to minimize the system's energy-delay product (EDP) via heuristic techniques. Our proposed accelerators achieve, on average, a significant 5.5 × reduction in EDP compared to the state of the art across various multi-DNN workloads. © 2009-2012 IEEE.
引用
收藏
页码:317 / 320
相关论文
共 50 条
  • [1] Heterogeneous Dataflow Accelerators for Multi-DNN Workloads
    Kwon, Hyoukjun
    Lai, Liangzhen
    Pellauer, Michael
    Krishna, Tushar
    Chen, Yu-Hsin
    Chandra, Vikas
    2021 27TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2021), 2021, : 71 - 83
  • [2] A Silicon Photonic Multi-DNN Accelerator
    Li, Yuan
    Louri, Ahmed
    Karanth, Avinash
    2023 32ND INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PACT, 2023, : 238 - 249
  • [3] CARIn: Constraint-Aware and Responsive Inference on Heterogeneous Devices for Single- and Multi-DNN Workloads
    Panopoulos, Ioannis
    Venieris, Stylianos
    Venieris, Iakovos
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2024, 23 (04)
  • [4] Serving Multi-DNN Workloads on FPGAs: A Coordinated Architecture, Scheduling, and Mapping Perspective
    Zeng, Shulin
    Dai, Guohao
    Zhang, Niansong
    Yang, Xinhao
    Zhang, Haoyu
    Zhu, Zhenhua
    Yang, Huazhong
    Wang, Yu
    IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (05) : 1314 - 1328
  • [5] Temperature-Aware Sizing of Multi-Chip Module Accelerators for Multi-DNN Workloads
    Shukla, Prachi
    Aguren, Derrick
    Burd, Tom
    Coskun, Ayse K.
    Kalamatianos, John
    2023 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2023,
  • [6] OmniBoost: Boosting Throughput of Heterogeneous Embedded Devices under Multi-DNN Workload
    Karatzas, Andreas
    Anagnostopoulos, Iraklis
    2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,
  • [7] Multi-Objective Hardware-Mapping Co-Optimisation for Multi-DNN Workloads on Chiplet-Based Accelerators
    Das, Abhijit
    Russo, Enrico
    Palesi, Maurizio
    IEEE TRANSACTIONS ON COMPUTERS, 2024, 73 (08) : 1883 - 1898
  • [8] Polyform: A Versatile Architecture for Multi-DNN Execution via Spatial and Temporal Acceleration
    Yin, Lingxiang
    Ghazizadeh, Amir
    Tian, Shilin
    Louri, Ahmed
    Zheng, Hao
    2023 IEEE 41ST INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, ICCD, 2023, : 166 - 169
  • [9] MARS: Exploiting Multi-Level Parallelism for DNN Workloads on Adaptive Multi-Accelerator Systems
    Shen, Guan
    Zhao, Jieru
    Wang, Zeke
    Lin, Zhe
    Ding, Wenchao
    Wu, Chentao
    Chen, Quan
    Guo, Minyi
    2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,
  • [10] Bayesian Optimization for Efficient Heterogeneous MPSoC based DNN Accelerator Runtime Tuning
    Zhu, Xuqi
    Gao, Cong
    Saha, Sangeet
    Zhai, Xiaojun
    McDonald-Maier, Klaus D.
    2023 33RD INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL, 2023, : 355 - 356