Quality-driven design of deep neural network hardware accelerators for low power CPS and IoT applications

被引：0

作者：

Jan, Yahya ^{[1
]}

Jozwiak, Lech ^{[1
]}

机构：

[1] Eindhoven Univ Technol, Fac Elect Engn, Eindhoven, Netherlands

来源：

MICROPROCESSORS AND MICROSYSTEMS | 2024年 / 111卷

关键词：

Deep Neural Networks (DNN); Cyber-Physical System (CPS); Internet of Things (IoT); Highly-parallel DNN architectures; Design Space Exploration (DSE); Low power design techniques; GENERATION;

D O I：

10.1016/j.micpro.2024.105119

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents the results of our analysis of the main problems that have to be solved in the design of highly parallel high-performance accelerators for Deep Neural Networks (DNNs) used in low power Cyber- Physical System (CPS) and Internet of Things (IoT) devices, in application areas such as smart automotive, health and smart services in social networks (Facebook, Instagram, X/Twitter, etc.). Our analysis demonstrates that to arrive a to high-quality DNN accelerator architecture, complex mutual trade-offs have to be resolved among the accelerator micro- and macro-architecture, and the corresponding memory and communication architectures, as well as among the performance, power consumption and area. Therefore, we developed a multi-processor accelerator design methodology involving an automatic design-space exploration (DSE) framework that enables a very efficient construction and analysis of DNN accelerator architectures, as well as an adequate trade-off exploitation. To satisfy the low power demands of IoT devices, we extend our quality- driven model-based multi-processor accelerator design methodology with some novel power optimization techniques at the Processor's and memory exploration stages. Our proposed power optimization techniques at the processor's exploration stage achieve up to 66.5% reduction in power consumption, while our proposed data reuse techniques avoid up to 85.92% of redundant memory accesses thereby reducing the power consumption of accelerator necessary for low-power IoT applications. Currently, we are beginning to apply this methodology with the proposed power optimization techniques to the design of low-power DNN accelerators for IoT applications.

引用

页数：13

共 50 条

[21] Surrogate Model based Co-Optimization of Deep Neural Network Hardware Accelerators
Woehrle, Hendrik
Alvarez, Mariela De Lucas
Schlenke, Fabian
Walsemann, Alexander
Karagounis, Michael
Kirchner, Frank
2021 IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2021, : 40 - 45
[22] Special Session: Effective In-field Testing of Deep Neural Network Hardware Accelerators
Kundu, Shamik
Banerjee, Suvadeep
Raha, Arnab
Basu, Kanad
2022 IEEE 40TH VLSI TEST SYMPOSIUM (VTS), 2022,
[23] Exploring Quantization and Mapping Synergy in Hardware-Aware Deep Neural Network Accelerators
Klhufek, Jan
Safar, Miroslav
Mrazek, Vojtech
Vasicek, Zdenek
Sekanina, Lukas
2024 27TH INTERNATIONAL SYMPOSIUM ON DESIGN & DIAGNOSTICS OF ELECTRONIC CIRCUITS & SYSTEMS, DDECS, 2024, : 1 - 6
[24] A training method for deep neural network inference accelerators with high tolerance for their hardware imperfection
Gao, Shuchao
Ohsawa, Takashi
JAPANESE JOURNAL OF APPLIED PHYSICS, 2024, 63 (02)
[25] Efficient Hardware Approximation for Bit-Decomposition Based Deep Neural Network Accelerators
Soliman, Taha
Eldebiky, Amro
De La Parra, Cecilia
Guntoro, Andre
Wehn, Norbert
2022 IEEE 35TH INTERNATIONAL SYSTEM-ON-CHIP CONFERENCE (IEEE SOCC 2022), 2022, : 77 - 82
[26] Fluid catalytic cracking process quality-driven fault detection based on partial least squares and deep feedforward neural network
Yang, Jiandong
Li, Jiangsheng
Yan, Shifu
Wang, Yangfeng
Zhang, Ying
Yan, Xuefeng
TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2024, 46 (01) : 78 - 92
[27] Design and implementation of hybrid low power wide area network architecture for IoT applications
Shilpa, B.
Jha, Rajesh Kumar
Naware, Vaibhav
Vattem, Anuradha
Hussain, Aftab M.
JOURNAL OF AMBIENT INTELLIGENCE AND SMART ENVIRONMENTS, 2024, 16 (02) : 201 - 213
[28] Understanding Error Propagation in Deep Learning Neural Network (DNN) Accelerators and Applications
Li, Guanpeng
Hari, Siva Kumar Sastry
Sullivan, Michael
Tsai, Timothy
Pattabiraman, Karthik
Emer, Joel
Keckler, Stephen W.
SC'17: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2017,
[29] Constrained deep neural network architecture search for IoT devices accounting for hardware calibration
Scheidegger, Florian
Benini, Luca
Bekas, Costas
Malossi, Cristiano
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[30] Modulation Recognition with Untrained Deep Neural Network for IoT and Mobile Applications
Woo, Jongseok
Jung, Kuchul
Mukhopadhyay, Saibal
2024 IEEE RADIO AND WIRELESS SYMPOSIUM, RWS, 2024, : 54 - 57

← 1 2 3 4 5 →