Quality-driven design of deep neural network hardware accelerators for low power CPS and IoT applications

被引:0
|
作者
Jan, Yahya [1 ]
Jozwiak, Lech [1 ]
机构
[1] Eindhoven Univ Technol, Fac Elect Engn, Eindhoven, Netherlands
关键词
Deep Neural Networks (DNN); Cyber-Physical System (CPS); Internet of Things (IoT); Highly-parallel DNN architectures; Design Space Exploration (DSE); Low power design techniques; GENERATION;
D O I
10.1016/j.micpro.2024.105119
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents the results of our analysis of the main problems that have to be solved in the design of highly parallel high-performance accelerators for Deep Neural Networks (DNNs) used in low power Cyber- Physical System (CPS) and Internet of Things (IoT) devices, in application areas such as smart automotive, health and smart services in social networks (Facebook, Instagram, X/Twitter, etc.). Our analysis demonstrates that to arrive a to high-quality DNN accelerator architecture, complex mutual trade-offs have to be resolved among the accelerator micro- and macro-architecture, and the corresponding memory and communication architectures, as well as among the performance, power consumption and area. Therefore, we developed a multi-processor accelerator design methodology involving an automatic design-space exploration (DSE) framework that enables a very efficient construction and analysis of DNN accelerator architectures, as well as an adequate trade-off exploitation. To satisfy the low power demands of IoT devices, we extend our quality- driven model-based multi-processor accelerator design methodology with some novel power optimization techniques at the Processor's and memory exploration stages. Our proposed power optimization techniques at the processor's exploration stage achieve up to 66.5% reduction in power consumption, while our proposed data reuse techniques avoid up to 85.92% of redundant memory accesses thereby reducing the power consumption of accelerator necessary for low-power IoT applications. Currently, we are beginning to apply this methodology with the proposed power optimization techniques to the design of low-power DNN accelerators for IoT applications.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Surrogate Model based Co-Optimization of Deep Neural Network Hardware Accelerators
    Woehrle, Hendrik
    Alvarez, Mariela De Lucas
    Schlenke, Fabian
    Walsemann, Alexander
    Karagounis, Michael
    Kirchner, Frank
    2021 IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2021, : 40 - 45
  • [22] Special Session: Effective In-field Testing of Deep Neural Network Hardware Accelerators
    Kundu, Shamik
    Banerjee, Suvadeep
    Raha, Arnab
    Basu, Kanad
    2022 IEEE 40TH VLSI TEST SYMPOSIUM (VTS), 2022,
  • [23] Exploring Quantization and Mapping Synergy in Hardware-Aware Deep Neural Network Accelerators
    Klhufek, Jan
    Safar, Miroslav
    Mrazek, Vojtech
    Vasicek, Zdenek
    Sekanina, Lukas
    2024 27TH INTERNATIONAL SYMPOSIUM ON DESIGN & DIAGNOSTICS OF ELECTRONIC CIRCUITS & SYSTEMS, DDECS, 2024, : 1 - 6
  • [24] A training method for deep neural network inference accelerators with high tolerance for their hardware imperfection
    Gao, Shuchao
    Ohsawa, Takashi
    JAPANESE JOURNAL OF APPLIED PHYSICS, 2024, 63 (02)
  • [25] Efficient Hardware Approximation for Bit-Decomposition Based Deep Neural Network Accelerators
    Soliman, Taha
    Eldebiky, Amro
    De La Parra, Cecilia
    Guntoro, Andre
    Wehn, Norbert
    2022 IEEE 35TH INTERNATIONAL SYSTEM-ON-CHIP CONFERENCE (IEEE SOCC 2022), 2022, : 77 - 82
  • [26] Fluid catalytic cracking process quality-driven fault detection based on partial least squares and deep feedforward neural network
    Yang, Jiandong
    Li, Jiangsheng
    Yan, Shifu
    Wang, Yangfeng
    Zhang, Ying
    Yan, Xuefeng
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2024, 46 (01) : 78 - 92
  • [27] Design and implementation of hybrid low power wide area network architecture for IoT applications
    Shilpa, B.
    Jha, Rajesh Kumar
    Naware, Vaibhav
    Vattem, Anuradha
    Hussain, Aftab M.
    JOURNAL OF AMBIENT INTELLIGENCE AND SMART ENVIRONMENTS, 2024, 16 (02) : 201 - 213
  • [28] Understanding Error Propagation in Deep Learning Neural Network (DNN) Accelerators and Applications
    Li, Guanpeng
    Hari, Siva Kumar Sastry
    Sullivan, Michael
    Tsai, Timothy
    Pattabiraman, Karthik
    Emer, Joel
    Keckler, Stephen W.
    SC'17: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2017,
  • [29] Constrained deep neural network architecture search for IoT devices accounting for hardware calibration
    Scheidegger, Florian
    Benini, Luca
    Bekas, Costas
    Malossi, Cristiano
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [30] Modulation Recognition with Untrained Deep Neural Network for IoT and Mobile Applications
    Woo, Jongseok
    Jung, Kuchul
    Mukhopadhyay, Saibal
    2024 IEEE RADIO AND WIRELESS SYMPOSIUM, RWS, 2024, : 54 - 57