Hardware accelerator systems for embedded systems

被引：3

作者：

Song, William J. ^{[1
]}

机构：

[1] Yonsei Univ, Sch Elect & Elect Engn, Seoul, South Korea

来源：

HARDWARE ACCELERATOR SYSTEMS FOR ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING | 2021年 / 122卷

关键词：

DEEP NEURAL-NETWORKS; POWER;

D O I：

10.1016/bs.adcom.2020.11.004

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This chapter describes various engineering considerations and constraints to deploy neural network applications in embedded systems and presents a variety of processing solutions to accelerate neural network computations in the embedded hardware. Deep learning on embedded systems has potentially many advantages for security, privacy, latency, energy, power, etc. However, deploying the deep neural networks in embedded systems imposes numerous hardware challenges on the resource-limited embedded edge devices. Embedded systems for deep learning typically target on providing rapid inferences, and thus latency rather than throughput in general becomes the primary objective for the executions of embedded hardware. The central point of hardware acceleration in embedded systems is to place neural network computations closer to I/Os and sensors to provide fast inferences. With continued advances in processor technologies, embedded edge devices evolve to become capable of handling compute-intensive workloads at low power. Such a trend propels integrating the hardware acceleration of deep neural networks into the embedded systems. There is not a universal solution for all different kinds of embedded systems. Different embedded processing solutions can be employed to accelerate the neural network applications depending on their performance requirements, operating conditions (e.g., network connectivity, power and thermal constraints), costs, etc. These considerations leave a wide range of hardware options for the embedded systems. The embedded hardware to accelerate neural network applications ranges from single-board devices such as Google Edge TPU to high-performance processors such as Intel Xeon and AMD EPYC CPUs, NVIDIA GPUs with Tensor Cores. All of these hardware choices for the neural network acceleration provide distinct features and computational capabilities in the embedded systems.

引用

页码：23 / 49

页数：27

共 50 条

[1] A CONFIGURABLE SVM HARDWARE ACCELERATOR FOR EMBEDDED SYSTEMS
Yuan, Tengyue
Xu, Gaowei
Zou, Yao
Han, Jun
Zeng, Xiaoyang
[J]. 2014 12TH IEEE INTERNATIONAL CONFERENCE ON SOLID-STATE AND INTEGRATED CIRCUIT TECHNOLOGY (ICSICT), 2014,
[2] A Heterogeneous Hardware Accelerator for Image Classification in Embedded Systems
Perez, Ignacio
Figueroa, Miguel
[J]. SENSORS, 2021, 21 (08)
[3] A Programmable Hardware Accelerator for Simulating Dynamical Systems
Kung, Jaeha
Long, Yun
Kim, Duckhwan
Mukhopadhyay, Saibal
[J]. 44TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2017), 2017, : 403 - 415
[4] Hardware/software codesign for embedded systems
Harrison, J
[J]. IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES, 1998, 145 (03): : 153 - 153
[5] HARDWARE ESTIMATION METHOD FOR EMBEDDED SYSTEMS
BLOMQVIST, H
HAAKER, J
[J]. MECHATRONICS, 1992, 2 (01) : 101 - 114
[6] An Overview of Reconfigurable Hardware in Embedded Systems
Garcia, Philip
Compton, Katherine
Schulte, Michael
Blem, Emily
Fu, Wenyin
[J]. EURASIP JOURNAL ON EMBEDDED SYSTEMS, 2006, (01) : 1 - 19
[7] Cryptographic Hardware & Embedded Systems for Communications
Sklavos, Nicolas
[J]. 2012 IEEE FIRST AESS EUROPEAN CONFERENCE ON SATELLITE TELECOMMUNICATIONS (ESTEL), 2012,
[8] An Open Architecture for Embedded Systems: Hardware Open Systems Technologies
Benjamin, William
Kromer, Eric
Schnelle, Christopher
[J]. SOUTHEASTCON 2017, 2017,
[9] Hardware/software partitioning of embedded systems with multiple hardware processes
Hendry, DC
Sananikone, DS
[J]. IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES, 1997, 144 (05): : 285 - 294
[10] Reconfigurable hardware accelerator for embedded DSP
Reeves, K
Sienski, K
Field, C
[J]. HIGH-SPEED COMPUTING, DIGITAL SIGNAL PROCESSING, AND FILTERING USING RECONFIGURABLE LOGIC, 1996, 2914 : 332 - 340

← 1 2 3 4 5 →