Design and implementation of an efficient CNN accelerator for low-cost FPGAs

被引:7
|
作者
Xu Y. [1 ]
Wang S. [1 ]
Li N. [1 ]
Xiao H. [1 ]
机构
[1] School of Microelectronics, Hefei University of Technology, Hefei
基金
中国国家自然科学基金;
关键词
convolutional neural network; FPGA; hardware accelerator;
D O I
10.1587/elex.19.20220370
中图分类号
学科分类号
摘要
This paper proposes a computation-array-centered dataflow, which adjusts the convolution with different kernel sizes to a unified computing manner and reduces the dimension of computation array from 2D to 1D, so as to maximize the utilization of the computation elements offered by the accelerator. Furthermore, a single unit multiple data (SUMD) strategy is proposed to effectively alleviate the mismatch between the quantized data and the hardware resources with fixed bit width on FPGA. As a case study, an 8-bit MobileNetV2 model has been implemented on the low-cost ZYNQ XC7Z020 FPGA, whose FPS/DSP and GOPS/DSP achieve upto 0.55 and 0.35 respectively. © 2022 The Institute of Electronics.
引用
收藏
相关论文
共 50 条
  • [21] Design and Low-Cost Implementation of an Electric Wheelchair Control
    Sankardoss, V.
    Geethanjali, P.
    IETE JOURNAL OF RESEARCH, 2021, 67 (05) : 657 - 666
  • [22] Design and Implementation of a Low-Cost Torque Sensor for Manipulators
    Pastor, Miroslav
    Hagara, Martin
    Gaspar, Stefan
    Sapieta, Milan
    APPLIED SCIENCES-BASEL, 2023, 13 (16):
  • [23] DESIGN AND IMPLEMENTATION OF A LOW-COST OBSTACLE AVOIDING UAV
    Rao, Venkata Subba P.
    Rao, G. Srinivasa
    JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES, 2019, 14 (06): : 667 - 677
  • [24] Design and Implementation of a Low-power, Embedded CNN Accelerator on a Low-end FPGA
    Khabbazan, Bahareh
    Mirzakuchaki, Sattar
    2019 22ND EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD), 2019, : 647 - 650
  • [25] A Low-Cost VLSI Implementation for Efficient Removal of Impulse Noise
    Chen, Pei-Yin
    Lien, Chih-Yuan
    Chuang, Hsu-Ming
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2010, 18 (03) : 473 - 481
  • [26] Efficient Homomorphic Encryption Accelerator With Integrated PRNG Using Low-Cost FPGA
    Syafalni, Infall
    Jonatan, Gilbert
    Sutisna, Nana
    Mulyawan, Rahmat
    Adiono, Trio
    IEEE ACCESS, 2022, 10 : 7753 - 7771
  • [27] A Heterogeneous FPGA-based Accelerator Design for Efficient and Low-cost Point Clouds Deep Learning Inference
    Xu, Jinling
    Wang, Yonggui
    Zhouy, Wenbiao
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 2725 - 2729
  • [28] Parallel Computing with Low-Cost FPGAs: A Framework for COPACOBANA
    Gueneysu, Tim
    Paar, Christof
    Pelzl, Jan
    Pfeiffer, Gerd
    Schimmler, Manfred
    Schleiffer, Christian
    PARALLEL COMPUTING: ARCHITECTURES, ALGORITHMS AND APPLICATIONS, 2008, 15 : 741 - +
  • [29] Design and Implementation of a Low-Cost Secure Vehicle Tracking System
    Ibraheem, Ibraheem Kasim
    Hadi, Salam Wisam
    2018 INTERNATIONAL CONFERENCE ON ENGINEERING TECHNOLOGY AND THEIR APPLICATIONS (IICETA), 2018, : 146 - 150
  • [30] Control, design, and implementation of a low-cost ultracapacitor test system
    Eroglu, Hasan Huseyin
    Hava, Ahmet Masum
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2013, 21 (03) : 630 - 648