Design Space Exploration for Layer-parallel Execution of Convolutional Neural Networks on CGRAs

被引:7
|
作者
Heidorn, Christian [1 ]
Hannig, Frank [1 ]
Teich, Jurgen [1 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg FAU, Dept Comp Sci, Hardware Software Codesign, Erlangen, Germany
关键词
CNN Accelerators; Design Space Exploration; CGRA;
D O I
10.1145/3378678.3391878
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this work, we systematically explore the design space of throughput, energy, and hardware costs for layer-parallel mappings of Convolutional Neural Networks (CNNs) onto coarse-grained reconfigurable arrays (CGRAs). We derive an analytical model that computes the required resources (processing elements) and buffer memory and thus hardware cost C to sustain a given throughput T as well as the resulting overall energy consumption E for inference. Further, we propose an efficient design space exploration (DSE) to determine the fronts of Pareto-optimal (T,E,C) solutions. This exploration helps to determine the limits of scalability of the presented tiled CGRA accelerator architectures in terms of throughput, the number of parallel layers that can be simultaneously processed, and memory requirements. Finally, we provide an evaluation of energy savings achievable on our architecture in comparison to implementations that execute sequentially a CNN layer-by-layer. In experiments, it is shown that layer-parallel processing is able to reduce energy consumption E by 3.6x, hardware cost C by 1.2x, and increase the achievable throughput T by 6.2x for MobileNet.
引用
收藏
页码:26 / 31
页数:6
相关论文
共 50 条
  • [1] Layer-Parallel Training of Deep Residual Neural Networks
    Guenther, Stefanie
    Ruthotto, Lars
    Schroder, Jacob B.
    Cyr, Eric C.
    Gauger, Nicolas R.
    [J]. SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2020, 2 (01): : 1 - 23
  • [2] Design Space Exploration of FPGA Accelerators for Convolutional Neural Networks
    Rahman, Atul
    Oh, Sangyun
    Lee, Jongeun
    Choi, Kiyoung
    [J]. PROCEEDINGS OF THE 2017 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2017, : 1147 - 1152
  • [3] Layer-parallel training of residual networks with auxiliary variable networks
    Sun, Qi
    Dong, Hexin
    Chen, Zewei
    Sun, Jiacheng
    Li, Zhenguo
    Dong, Bin
    [J]. NUMERICAL METHODS FOR PARTIAL DIFFERENTIAL EQUATIONS, 2024, 40 (06)
  • [4] Design Space Exploration of FPGA-Based Deep Convolutional Neural Networks
    Motamedi, Mohammad
    Gysel, Philipp
    Akella, Venkatesh
    Ghiasi, Soheil
    [J]. 2016 21ST ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2016, : 575 - 580
  • [5] A Design Space Exploration Framework for Convolutional Neural Networks Implemented on Edge Devices
    Tsimpourlas, Foivos
    Papadopoulos, Lazaros
    Bartsokas, Anastasios
    Soudris, Dimitrios
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2018, 37 (11) : 2212 - 2221
  • [6] Specializing CGRAs for Light-Weight Convolutional Neural Networks
    Lee, Jungi
    Lee, Jongeun
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (10) : 3387 - 3399
  • [7] Layer-Parallel Training with GPU Concurrency of Deep Residual Neural Networks via Nonlinear Multigrid
    Kirby, Andrew
    Samsi, Siddharth
    Jones, Michael
    Reuther, Albert
    Kepner, Jeremy
    Gadepally, Vijay
    [J]. 2020 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2020,
  • [8] Towards Design Space Exploration and Optimization of Fast Algorithms for Convolutional Neural Networks (CNNs) on FPGAs
    Ahmad, Afzal
    Pasha, Muhammad Adeel
    [J]. 2019 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2019, : 1106 - 1111
  • [9] AUGER: A Multi-Objective Design Space Exploration Framework for CGRAs
    Li, Jingyuan
    Hu, Yihan
    Dai, Yuan
    Kuang, Huizhen
    Wang, Lingli
    [J]. 2023 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY, ICFPT, 2023, : 88 - 95
  • [10] Convolutional Neural Networks Inference Accelerator Design using Selective Convolutional Layer
    Huang, Tzu-Huan
    Goh, Emil
    Wey, I-Chyn
    Teo, T. Hui
    [J]. 2023 IEEE 16TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP, MCSOC, 2023, : 166 - 170