Resource and Data Optimization for Hardware Implementation of Deep Neural Networks Targeting FPGA-based Edge Devices

被引:2
|
作者
Liu, Xinheng [1 ,2 ]
Kim, Dae Hee [1 ]
Wu, Chang [3 ]
Chen, Deming [1 ,2 ]
机构
[1] Univ Illinois, Urbana, IL 61801 USA
[2] Inspirit IoT Inc, Champaign, IL 61822 USA
[3] Fudan Univ, Shanghai, Peoples R China
来源
2018 ACM/IEEE INTERNATIONAL WORKSHOP ON SYSTEM LEVEL INTERCONNECT PREDICTION (SLIP) | 2018年
关键词
FPGA; Convolutional Neural Network; Optimization; Acceleration; High-Level Synthesis;
D O I
10.1145/3225209.3225214
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, as machine learning algorithms have become more practical, there has been much effort to implement them on edge devices that can be used in our daily lives. However, unlike server-scale devices, edge devices are relatively small and thus have much more limited resources. Therefore, control of resource usage and hardware optimization play an important role when we implement machine learning algorithms on an edge device. In this paper, we target convolutional neural networks (CNN) and explore various optimization and design techniques to realize them on FPGA devices. The key idea explored in this paper is Backward Pipeline Scheduling together with Latency Balancing which optimize the pipeline between CNN layers in order to significantly reduce the overall latency for processing a single image. We also develop a batch processing design to improve the throughput of the FPGA solution. We have achieved latency of 175.7 mu s for classifying one image in the MNIST data set using LeNet and 653.4 mu s for classifying one image in Cifar-10 data set using CifarNet. Without retraining, we are still able to maintain high accuracy of 97.6% for MNIST data set and 83.6% for the Cifar-10 data set. Our achieved single-image latency is 5.2x faster for LeNet and 1.95x faster for CifarNet compared to the NVIDIA Jetson TX1 solution.
引用
收藏
页数:8
相关论文
共 50 条
  • [11] The Impact of Hardware Folding on Dependability in Spaceborne FPGA-based Neural Networks
    Souvatzoglou, Ioanna
    Agiakatsikas, Dimitris
    Antonopoulos, George
    Vlagkoulis, Vasileios
    Sari, Aitzan
    Papadimitriou, Athanasios
    Psarakis, Mihalis
    2022 21ST INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2022), 2022, : 324 - 324
  • [12] A survey of FPGA-based hardware implementation of ANNs
    Liu, JH
    Liang, DQ
    PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND BRAIN, VOLS 1-3, 2005, : 915 - 918
  • [13] Efficient FPGA-Based Convolutional Neural Network Implementation for Edge Computing
    Cuong, Pham-Quoc
    Thinh, Tran Ngoc
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2023, 14 (03) : 479 - 487
  • [14] Composite FPGA-based Accelerator for Deep Convolutional Neural Networks
    HuanZhang
    YuanYang
    YangXiao
    2019 IEEE INTERNATIONAL CONFERENCE ON ELECTRON DEVICES AND SOLID-STATE CIRCUITS (EDSSC), 2019,
  • [15] FPGA-Based Lightweight Hardware Architecture of the PHOTON Hash Function for IoT Edge Devices
    Al-Shatari, Mohammed
    Hussin, Fawnizu Azmadi
    Abd Aziz, Azrina
    Witjaksono, Gunawan
    Xuan-Tu Tran
    IEEE ACCESS, 2020, 8 (08): : 207610 - 207618
  • [16] FPGA-based hardware implementation of chaotic opposition-based arithmetic optimization algorithm
    Zermani, Mohamed Aymen
    Manita, Ghaith
    Chhabra, Amit
    Feki, Elyes
    Mami, Abdelkader
    APPLIED SOFT COMPUTING, 2024, 154
  • [17] Hardware Implementation of Spiking Neural Networks on FPGA
    Han, Jianhui
    Li, Zhaolin
    Zheng, Weimin
    Zhang, Youhui
    TSINGHUA SCIENCE AND TECHNOLOGY, 2020, 25 (04) : 479 - 486
  • [18] Hardware Implementation of Spiking Neural Networks on FPGA
    Jianhui Han
    Zhaolin Li
    Weimin Zheng
    Youhui Zhang
    TsinghuaScienceandTechnology, 2020, 25 (04) : 479 - 486
  • [19] FPGA-based implementation of deep neural network using stochastic computing
    Nobari, Maedeh
    Jahanirad, Hadi
    APPLIED SOFT COMPUTING, 2023, 137
  • [20] FPGA-based design and implementation of the location attention mechanism in neural networks
    Qiao, Ruixiu
    Guo, Xiaozhou
    Mao, Wenyu
    Li, Jixing
    Lu, Huaxiang
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (04) : 5309 - 5323