Hardware accelerator systems for embedded systems

被引:3
|
作者
Song, William J. [1 ]
机构
[1] Yonsei Univ, Sch Elect & Elect Engn, Seoul, South Korea
关键词
DEEP NEURAL-NETWORKS; POWER;
D O I
10.1016/bs.adcom.2020.11.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This chapter describes various engineering considerations and constraints to deploy neural network applications in embedded systems and presents a variety of processing solutions to accelerate neural network computations in the embedded hardware. Deep learning on embedded systems has potentially many advantages for security, privacy, latency, energy, power, etc. However, deploying the deep neural networks in embedded systems imposes numerous hardware challenges on the resource-limited embedded edge devices. Embedded systems for deep learning typically target on providing rapid inferences, and thus latency rather than throughput in general becomes the primary objective for the executions of embedded hardware. The central point of hardware acceleration in embedded systems is to place neural network computations closer to I/Os and sensors to provide fast inferences. With continued advances in processor technologies, embedded edge devices evolve to become capable of handling compute-intensive workloads at low power. Such a trend propels integrating the hardware acceleration of deep neural networks into the embedded systems. There is not a universal solution for all different kinds of embedded systems. Different embedded processing solutions can be employed to accelerate the neural network applications depending on their performance requirements, operating conditions (e.g., network connectivity, power and thermal constraints), costs, etc. These considerations leave a wide range of hardware options for the embedded systems. The embedded hardware to accelerate neural network applications ranges from single-board devices such as Google Edge TPU to high-performance processors such as Intel Xeon and AMD EPYC CPUs, NVIDIA GPUs with Tensor Cores. All of these hardware choices for the neural network acceleration provide distinct features and computational capabilities in the embedded systems.
引用
收藏
页码:23 / 49
页数:27
相关论文
共 50 条
  • [1] A CONFIGURABLE SVM HARDWARE ACCELERATOR FOR EMBEDDED SYSTEMS
    Yuan, Tengyue
    Xu, Gaowei
    Zou, Yao
    Han, Jun
    Zeng, Xiaoyang
    [J]. 2014 12TH IEEE INTERNATIONAL CONFERENCE ON SOLID-STATE AND INTEGRATED CIRCUIT TECHNOLOGY (ICSICT), 2014,
  • [2] A Heterogeneous Hardware Accelerator for Image Classification in Embedded Systems
    Perez, Ignacio
    Figueroa, Miguel
    [J]. SENSORS, 2021, 21 (08)
  • [3] A Programmable Hardware Accelerator for Simulating Dynamical Systems
    Kung, Jaeha
    Long, Yun
    Kim, Duckhwan
    Mukhopadhyay, Saibal
    [J]. 44TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2017), 2017, : 403 - 415
  • [4] Hardware/software codesign for embedded systems
    Harrison, J
    [J]. IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES, 1998, 145 (03): : 153 - 153
  • [5] HARDWARE ESTIMATION METHOD FOR EMBEDDED SYSTEMS
    BLOMQVIST, H
    HAAKER, J
    [J]. MECHATRONICS, 1992, 2 (01) : 101 - 114
  • [6] An Overview of Reconfigurable Hardware in Embedded Systems
    Garcia, Philip
    Compton, Katherine
    Schulte, Michael
    Blem, Emily
    Fu, Wenyin
    [J]. EURASIP JOURNAL ON EMBEDDED SYSTEMS, 2006, (01) : 1 - 19
  • [7] Cryptographic Hardware & Embedded Systems for Communications
    Sklavos, Nicolas
    [J]. 2012 IEEE FIRST AESS EUROPEAN CONFERENCE ON SATELLITE TELECOMMUNICATIONS (ESTEL), 2012,
  • [8] An Open Architecture for Embedded Systems: Hardware Open Systems Technologies
    Benjamin, William
    Kromer, Eric
    Schnelle, Christopher
    [J]. SOUTHEASTCON 2017, 2017,
  • [9] Hardware/software partitioning of embedded systems with multiple hardware processes
    Hendry, DC
    Sananikone, DS
    [J]. IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES, 1997, 144 (05): : 285 - 294
  • [10] Reconfigurable hardware accelerator for embedded DSP
    Reeves, K
    Sienski, K
    Field, C
    [J]. HIGH-SPEED COMPUTING, DIGITAL SIGNAL PROCESSING, AND FILTERING USING RECONFIGURABLE LOGIC, 1996, 2914 : 332 - 340