Design and implementation of an efficient CNN accelerator for low-cost FPGAs

被引:7
|
作者
Xu Y. [1 ]
Wang S. [1 ]
Li N. [1 ]
Xiao H. [1 ]
机构
[1] School of Microelectronics, Hefei University of Technology, Hefei
基金
中国国家自然科学基金;
关键词
convolutional neural network; FPGA; hardware accelerator;
D O I
10.1587/elex.19.20220370
中图分类号
学科分类号
摘要
This paper proposes a computation-array-centered dataflow, which adjusts the convolution with different kernel sizes to a unified computing manner and reduces the dimension of computation array from 2D to 1D, so as to maximize the utilization of the computation elements offered by the accelerator. Furthermore, a single unit multiple data (SUMD) strategy is proposed to effectively alleviate the mismatch between the quantized data and the hardware resources with fixed bit width on FPGA. As a case study, an 8-bit MobileNetV2 model has been implemented on the low-cost ZYNQ XC7Z020 FPGA, whose FPS/DSP and GOPS/DSP achieve upto 0.55 and 0.35 respectively. © 2022 The Institute of Electronics.
引用
收藏
相关论文
共 50 条
  • [41] Design and Implementation of Low-Cost Multitone Signal Generation Systems
    Cordeiro, Patricio
    Ramalho, Leonardo
    Souza, Vitoria
    Castro, Adalbery
    Klautau, Aldebaro
    2017 2ND INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION SYSTEMS, CIRCUITS AND TRANSDUCERS (INSCIT), 2017, : 88 - 93
  • [42] Design and implementation of low-cost portable potentiostat based on WeChat
    Shen, Xiaoyan
    Li, Ziqiang
    Ma, Lei
    Bian, Xiongheng
    Cheng, Xingsi
    Lou, Xiongjie
    JOURNAL OF THE SERBIAN CHEMICAL SOCIETY, 2022, 87 (05) : 603 - 614
  • [43] An Efficient CNN Accelerator Using Inter-Frame Data Reuse of Videos on FPGAs
    Li, Shengzhao
    Wang, Qin
    Jiang, Jianfei
    Sheng, Weiguang
    Jing, Naifeng
    Mao, Zhigang
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2022, 30 (11) : 1587 - 1600
  • [44] DESIGNING A LOW-COST SCALABLE GRAPHICS ACCELERATOR
    MALACKOWSKY, C
    HIGH PERFORMANCE SYSTEMS-THE MAGAZINE FOR TECHNOLOGY CHAMPIONS, 1989, 10 (05): : 68 - &
  • [45] Design Exploration of An Energy-Efficient Acceleration System for CNNs on Low-Cost Resource-Constraint SoC-FPGAs
    Wen, Shao-Cheng
    Huang, Po-Tsang
    2022 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS 2022): INTELLIGENT TECHNOLOGY IN THE POST-PANDEMIC ERA, 2022, : 234 - 237
  • [46] CNN inference simulator for accurate and efficient accelerator design
    Choi, Seong Bin
    Lee, Sang Seol
    Jang, Sung Joon
    2019 INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2019, : 283 - 284
  • [47] Efficient Implementation on Low-Cost SoC-FPGAs of TLSv1.2 Protocol with ECC_AES Support for Secure IoT Coordinators
    Bellemou, Ahmed Mohamed
    Garcia, Antonio
    Castillo, Encarnacion
    Benblidia, Nadjia
    Anane, Mohamed
    Antonio Alvarez-Bermejo, Jose
    Parrilla, Luis
    ELECTRONICS, 2019, 8 (11)
  • [48] Low-cost Technique for Measuring Clock Duty Cycle on FPGAs
    Lee, Seongkwan
    Kim, Taehwan
    2018 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2018,
  • [49] Self-reconfigurable embedded systems on low-cost FPGAs
    Gonzalez, Ivan
    Aguayo, Estanislao
    Lopez-Buedo, Sergio
    IEEE MICRO, 2007, 27 (04) : 49 - 57
  • [50] Hardware Isolation Support for Low-Cost SoC-FPGAs
    Passaretti, Daniele
    Boehm, Felix
    Wilhelm, Martin
    Pionteck, Thilo
    ARCHITECTURE OF COMPUTING SYSTEMS, ARCS 2022, 2022, 13642 : 148 - 163