Full Approximation of Deep Neural Networks through Efficient Optimization

被引:3
|
作者
De la Parra, Cecilia [1 ]
Guntoro, Andre [1 ]
Kumar, Akash [2 ]
机构
[1] Robert Bosch GmbH, Renningen, Germany
[2] Tech Univ Dresden, Dresden, Germany
关键词
D O I
10.1109/iscas45731.2020.9181236
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Approximate Computing is a promising paradigm for mitigating computational requirements of Deep Neural Networks (DNN), by taking advantage of their inherent error resilience. Specifically, the use of approximate multipliers in DNN inference can lead to significant improvements in power consumption of embedded DNN applications. This paper presents a methodology for efficient approximate multiplier selection and for full and uniform approximation of large DNNs, through retraining and minimization of the approximation error. We evaluate our methodology using 422 approximate multipliers from the EvoApprox library, with three different Residual architectures trained with Cifar10, and achieve energy savings of up to 18% surpassing the original floating-point accuracy, and of up to 58% with an accuracy loss of 0.73%.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] An Efficient Optimization Technique for Training Deep Neural Networks
    Mehmood, Faisal
    Ahmad, Shabir
    Whangbo, Taeg Keun
    [J]. MATHEMATICS, 2023, 11 (06)
  • [2] Linear Approximation of Deep Neural Networks for Efficient Inference on Video Data
    Rueckauer, Bodo
    Liu, Shih-Chii
    [J]. 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [3] DELAFO: An Efficient Portfolio Optimization Using Deep Neural Networks
    Hieu K Cao
    Han K Cao
    Binh T Nguyen
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT I, 2020, 12084 : 623 - 635
  • [4] Empowering deep neural quantum states through efficient optimization
    Chen, Ao
    Heyl, Markus
    [J]. NATURE PHYSICS, 2024,
  • [5] Approximation Spaces of Deep Neural Networks
    Gribonval, Remi
    Kutyniok, Gitta
    Nielsen, Morten
    Voigtlaender, Felix
    [J]. CONSTRUCTIVE APPROXIMATION, 2022, 55 (01) : 259 - 367
  • [6] Approximation Spaces of Deep Neural Networks
    Rémi Gribonval
    Gitta Kutyniok
    Morten Nielsen
    Felix Voigtlaender
    [J]. Constructive Approximation, 2022, 55 : 259 - 367
  • [7] Constrained Optimization Based Low-Rank Approximation of Deep Neural Networks
    Li, Chong
    Shi, C. J. Richard
    [J]. COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 746 - 761
  • [8] Automatic Configuration of Deep Neural Networks with Parallel Efficient Global Optimization
    van Stein, Bas
    Wang, Hao
    Back, Thomas
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [9] Efficient Hardware Optimization Strategies for Deep Neural Networks Acceleration Chip
    Zhang Meng
    Zhang Jingwei
    Li Guoqing
    Wu Ruixia
    Zeng Xiaoyang
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (06) : 1510 - 1517
  • [10] Asynchronous Optimization Methods for Efficient Training of Deep Neural Networks with Guarantees
    Kungurtsev, Vyacheslav
    Egan, Malcolm
    Chatterjee, Bapi
    Alistarh, Dan
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 8209 - 8216