HW/SW Codesign for Approximation-Aware Binary Neural Networks

被引:1
|
作者
Dave, Abhilasha [1 ]
Frustaci, Fabio [2 ]
Spagnolo, Fanny [2 ]
Yayla, Mikail [3 ]
Chen, Jian-Jia [3 ]
Amrouch, Hussam [1 ]
机构
[1] Univ Stuttgart, Chair Semicond Test & Reliabil STAR, D-70174 Stuttgart, Germany
[2] Univ Calabria, DIMES Dept, I-87036 Arcavacata Di Rende, Italy
[3] TU Dortmund Univ, Design Automat Embedded Syst Grp, D-44227 Dortmund, Germany
关键词
Approximate computing; neural network; low-power design; FPGA;
D O I
10.1109/JETCAS.2023.3243267
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Binary Neural Networks (BNNs) are rapidly gaining remarkable attention due to their superiority in shrinking the model size, which outstandingly mitigates the fundamental "memory wall" bottleneck that is attributed to the existing vonNeumann architectures. This work investigates how principles from approximate computing can be effectively employed to further optimize BNNs. It demonstrates that HW/SW codesign, in which BNNs are either proactively trained in the presence of approximation-induced errors (i.e. design-time optimization) and/or augmented with an appropriate error-mitigation scheme (i.e., run-time optimization), is a key to realize energy-efficient yet robust BNNs. We unveil, for the first time, that although the underlying HW of BNNs can be implemented using simple XNOR gates, the complexity of the required "Popcount" circuit superlinearly grows with the filter kernel size. This largely impacts the area footprint, inference time, energy, and hence it severely constricts the prospective efficiency gains from BNNs. To overcome this challenge, we replace the accurate full adders constructing the Popcount with Majority gates that approximately perform the required additions. Then, our carefully-crafted error-mitigation scheme along with activations tuning considerably minimizes the induced errors. Afterward, abstracted error probabilities are derived and employed during BNN training to obtain approximation-aware BNNs, that are inherently robust against the underlying HW approximation. Differently from the typical approaches, the proposed HW/SW codesign methodology has the merit of allowing a training of the approximate BNN without the need to modify the existing software frameworks (i.e., PyTorch). This is of great importance since existing tools rely on efficient built-in functions that can be difficult and/or inefficient to be modified. An FPGA-based SoC realizing both accurate and approximation-aware BNNs is developed for validating our proposed methodology. With merely a 4.7% loss in the inference accuracy, our HW/SW codesign leads to 64% and 80.2% savings in the area and energy, respectively, at the parity of the latency. Our results are obtained using commercial EDA tool flows employing a commercial 28nm FDSOI technology node.
引用
收藏
页码:33 / 47
页数:15
相关论文
共 50 条
  • [1] Industrial HW/SW codesign
    Buchenrieder, K
    Sedlmeier, A
    Veith, C
    [J]. HARDWARE/SOFTWARE CO-DESIGN, 1996, 310 : 453 - 466
  • [2] AppGNN: Approximation-Aware Functional Reverse Engineering using Graph Neural Networks
    Buecher, Tim
    Alrahis, Lilas
    Paim, Guilherme
    Bampi, Sergio
    Sinanoglu, Ozgur
    Amrouch, Hussam
    [J]. 2022 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2022,
  • [3] A rise architecture to explore HW/SW parallelism in HW/SW codesign
    Carro, L
    Suzim, A
    [J]. IEEE SYMPOSIUM AND WORKSHOP ON ENGINEERING OF COMPUTER-BASED SYSTEMS, PROCEEDINGS, 1996, : 382 - 388
  • [4] A Comprehensive Approach to HW/SW Codesign
    Kuck, David
    [J]. 2013 22ND INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT), 2013, : 1 - 1
  • [5] Hw/Sw codesign of embedded systems
    Fornaciari, W
    Sciuto, D
    [J]. RELIABLE SOFTWARE TECHNOLOGIES - ADA-EUROPE' 99, 1999, 1622 : 344 - 355
  • [6] An approach to the synthesis of HW and SW in codesign
    Carchiolo, V
    Malgeri, M
    Mangioni, G
    [J]. PROCEEDINGS OF THE FIFTH INTERNATIONAL WORKSHOP ON HARDWARE/SOFTWARE CODESIGN (CODES/CASHE '97), 1997, : 173 - 177
  • [7] Signing hw/sw codesign pact
    不详
    [J]. COMPUTER DESIGN, 1997, 36 (08): : 66 - 66
  • [8] ApNet: Approximation-aware Real-Time Neural Network
    Bateni, Soroush
    Liu, Cong
    [J]. 2018 39TH IEEE REAL-TIME SYSTEMS SYMPOSIUM (RTSS 2018), 2018, : 67 - 79
  • [9] Rapid prototyping with HW/SW codesign tool
    Araki, D
    Ishii, T
    Gajski, DD
    [J]. ECBS '99, IEEE CONFERENCE AND WORKSHOP ON ENGINEERING OF COMPUTER-BASED SYSTEMS, PROCEEDINGS, 1999, : 114 - 121
  • [10] Design for testability with HW-SW codesign
    Technical Univ of Budapest, Budapest, Hungary
    [J]. Period Polytech Electr Eng, 1 (25-37):