RSNN: A Software/Hardware Co-Optimized Framework for Sparse Convolutional Neural Networks on FPGAs

被引:14
|
作者
You, Weijie [1 ]
Wu, Chang [1 ]
机构
[1] Fudan Univ, Sch Microelect, State Key Lab ASIC & Syst, Shanghai 200433, Peoples R China
来源
IEEE ACCESS | 2021年 / 9卷 / 09期
关键词
Kernel; Field programmable gate arrays; Hardware; Convolutional neural networks; Sparse matrices; Optimization; Digital signal processors; Accelerator; convolutional neural network; FPGA; sparse neural network;
D O I
10.1109/ACCESS.2020.3047144
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional Neural Networks (CNNs) have been shown to be very useful in image recognition and other Artificial Intelligence (AI) applications, however, at the expense of intensive computation requirement. To address the challenge of overwhelming calculation requirements, researchers have proposed various network pruning techniques. But, due to the irregular sparse patterns, unstructured sparse networks are difficult to compute efficiently on either Graphic processing units (GPUs) or Field Programmable Gate Arrays (FPGAs). In this paper, we propose a software/hardware co-optimized Reconfigurable Sparse convolutional Neural Network accelerator design (RSNN) on FPGAs. A novel sparse convolution dataflow is proposed with simpler control logic than existing mux-based selection logic. To balance the computation load on different Processing Units (PUs), we propose a software-based load-balance aware pruning technique as well as a kernel merging method. Experimental results show that RSNN is 2.41x-7.91x better on Digital Signal Processor (DSP) efficiency than previous dense CNN FPGA accelerators, and 1.23x-2.93x better than previous sparse CNN FPGA accelerators.
引用
收藏
页码:949 / 960
页数:12
相关论文
共 50 条
  • [1] An Efficient Hardware Accelerator for Sparse Convolutional Neural Networks on FPGAs
    Lu, Liqiang
    Xie, Jiaming
    Huang, Ruirui
    Zhang, Jiansong
    Lin, Wei
    Liang, Yun
    [J]. 2019 27TH IEEE ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2019, : 17 - 25
  • [2] Software/Hardware Co-Design Optimization for Sparse Convolutional Neural Networks
    Hu, Wei
    Dong, Yong
    Liu, Fang
    Jiao, Qiang
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 2069 - 2074
  • [3] An Efficient Hardware Accelerator for Structured Sparse Convolutional Neural Networks on FPGAs
    Zhu, Chaoyang
    Huang, Kejie
    Yang, Shuyuan
    Zhu, Ziqi
    Zhang, Hejia
    Shen, Haibin
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2020, 28 (09) : 1953 - 1965
  • [4] An Algorithm-Hardware Co-Optimized Framework for Accelerating N:M Sparse Transformers
    Fang, Chao
    Zhou, Aojun
    Wang, Zhongfeng
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2022, 30 (11) : 1573 - 1586
  • [5] Caffeinated FPGAs: FPGA Framework For Convolutional Neural Networks
    DiCecco, Roberto
    Lacey, Griffin
    Vasiljevic, Jasmina
    Chow, Paul
    Taylor, Graham
    Areibi, Shawki
    [J]. 2016 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), 2016, : 265 - 268
  • [6] fpgaConvNet: A Framework for Mapping Convolutional Neural Networks on FPGAs
    Venieris, Stylianos I.
    Bouganis, Christos-Savvas
    [J]. 2016 IEEE 24TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2016, : 40 - 47
  • [7] An algorithm/hardware co-optimized method to accelerate CNNs with compressed convolutional weights on FPGA
    Shang, Jiangwei
    Zhang, Zhan
    Zhang, Kun
    Li, Chuanyou
    Qian, Lei
    Liu, Hongwei
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (11):
  • [8] A Systolic SNN Inference Accelerator and its Co-optimized Software Framework
    Guo, Shasha
    Wang, Lei
    Wang, Shuquan
    Deng, Yu
    Yang, Zhijie
    Li, Shiming
    Xie, Zhige
    Dou, Qiang
    [J]. GLSVLSI '19 - PROCEEDINGS OF THE 2019 ON GREAT LAKES SYMPOSIUM ON VLSI, 2019, : 63 - 68
  • [9] SpWA: An Efficient Sparse Winograd Convolutional Neural Networks Accelerator on FPGAs
    Lu, Liqiang
    Liang, Yun
    [J]. 2018 55TH ACM/ESDA/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2018,
  • [10] Hardware/Software Co-Exploration for Graph Neural Architectures on FPGAs
    Lu, Qing
    Jiang, Weiwen
    Jiang, Meng
    Hu, Jingtong
    Shi, Yiyu
    [J]. 2022 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2022), 2022, : 358 - 362