High-performance Convolutional Neural Network Accelerator Based on Systolic Arrays and Quantization

被引:0
|
作者
Li, Yufeng [1 ]
Lu, Shengli [1 ]
Luo, Jihe [1 ]
Pang, Wei [1 ]
Liu, Hao [1 ]
机构
[1] Southeast Univ, Natl ASIC Syst Engn Res Ctr, Nanjing, Peoples R China
关键词
CNN accelerator; systolic arrays; quantization; high-performance; FPGA;
D O I
10.1109/siprocess.2019.8868327
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In recent years, convolutional neural networks (CNN) has achieved great success in computer vision tasks, such as object detection, face recognition and image classification. With the diversification of CNN models, accelerators that only support a single network can no longer meet the needs of applications. Due to the computational intensiveness of convolution operations, the implementation of CNN on the FPGA platform faces many challenges. In this paper, a convolution unit based on systolic arrays is proposed in the design of CNN accelerator, and the fixed-point quantization method is adopted to save a large amount of storage resources and reduce the required transmission bandwidth, thus improving throughput and power efficiency. The performance density and power efficiency of our design can reach 0.165 GOPS/DSP and 36.3 GOPS/W under 100 MHz clock frequency.
引用
收藏
页码:335 / 339
页数:5
相关论文
共 50 条
  • [1] High-Performance Winograd Based Accelerator Architecture for Convolutional Neural Network
    Vardhana, M.
    Pinto, Rohan
    IEEE COMPUTER ARCHITECTURE LETTERS, 2025, 24 (01) : 21 - 24
  • [2] Convolutional Neural Network Accelerator with Vector Quantization
    Lee, Heng
    Wu, Yi-Heng
    Lin, Yu-Sheng
    Chien, Shao-Yi
    2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
  • [3] Systolic-based pyramidal neuron accelerator blocks for convolutional neural network
    Ahmed, Hossam O.
    Ghoneima, Maged
    Dessouky, Mohamed
    MICROELECTRONICS JOURNAL, 2019, 89 : 16 - 22
  • [4] VHDL Generator for A High Performance Convolutional Neural Network FPGA-Based Accelerator
    Hamdan, Muhammad K.
    Rover, Diane T.
    2017 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG), 2017,
  • [5] HIGH-PERFORMANCE PACKET ROUTING BASED ON SYSTOLIC ARRAYS
    MILLER, PR
    YANTCHEV, JT
    JESSHOPE, CR
    SYSTOLIC ARRAY PROCESSORS, 1989, : 620 - 630
  • [6] A High-Performance Accelerator for Large-Scale Convolutional Neural Networks
    Sun, Fan
    Wang, Chao
    Gong, Lei
    Xu, Chongchong
    Zhang, Yiwei
    Lu, Yuntao
    Li, Xi
    Zhou, Xuehai
    2017 15TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS AND 2017 16TH IEEE INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING AND COMMUNICATIONS (ISPA/IUCC 2017), 2017, : 622 - 629
  • [7] Design of High Performance Convolutional Neural Network Accelerator for Embedded FPGA
    Zeng C.
    Liu Q.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (09): : 1645 - 1652
  • [8] GoSPA: An Energy-efficient High-performance Globally Optimized SParse Convolutional Neural Network Accelerator
    Deng, Chunhua
    Sui, Yang
    Liao, Siyu
    Qian, Xuehai
    Yuan, Bo
    2021 ACM/IEEE 48TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2021), 2021, : 1110 - 1123
  • [9] An energy-efficient convolutional neural network accelerator for speech classification based on FPGA and quantization
    Wen, Dong
    Jiang, Jingfei
    Dou, Yong
    Xu, Jinwei
    Xiao, Tao
    CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING, 2021, 3 (01) : 4 - 16
  • [10] An energy-efficient convolutional neural network accelerator for speech classification based on FPGA and quantization
    Dong Wen
    Jingfei Jiang
    Yong Dou
    Jinwei Xu
    Tao Xiao
    CCF Transactions on High Performance Computing, 2021, 3 : 4 - 16