A Dynamically Reconfigurable Accelerator Design Using a Sparse-Winograd Decomposition Algorithm for CNNs

被引:2
|
作者
Zhao, Yunping [1 ]
Lu, Jianzhuang [1 ]
Chen, Xiaowen [1 ]
机构
[1] Natl Univ Def Technol, Changsha, Peoples R China
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2021年 / 66卷 / 01期
关键词
High performance computing; accelerator architecture; hardware; NEURAL-NETWORK; CONVOLUTION;
D O I
10.32604/cmc.2020.012380
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional Neural Networks (CNNs) are widely used in many fields. Due to their high throughput and high level of computing characteristics, however, an increasing number of researchers are focusing on how to improve the computational efficiency, hardware utilization, or flexibility of CNN hardware accelerators. Accordingly, this paper proposes a dynamically reconfigurable accelerator architecture that implements a Sparse-Winograd F(2 x 2.3 x 3)-based high-parallelism hardware architecture. This approach not only eliminates the pre-calculation complexity associated with the Winograd algorithm, thereby reducing the difficulty of hardware implementation, but also greatly improves the flexibility of the hardware; as a result, the accelerator can realize the calculation of Conventional Convolution, Grouped Convolution (GCONV) or Depthwise Separable Convolution (DSC) using the same hardware architecture. Our experimental results show that the accelerator achieves a 3x-4.14x speedup compared with the designs that do not use the acceleration algorithm on VGG-16 and MobileNet V1. Moreover, compared with previous designs using the traditional Winograd algorithm, the accelerator design achieves 1.4x-1.8x speedup. At the same time, the efficiency of the multiplier improves by up to 142%.
引用
收藏
页码:517 / 535
页数:19
相关论文
共 50 条
  • [1] SpWMM: A High-Performance Sparse-Winograd Matrix-Matrix Multiplication Accelerator for CNNs
    Wu, Di
    Cao, Wei
    Wang, Lingli
    2019 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2019), 2019, : 255 - 258
  • [2] An Accelerator Design Using a MTCA Decomposition Algorithm for CNNs
    Zhao, Yunping
    Lu, Jianzhuang
    Chen, Xiaowen
    SENSORS, 2020, 20 (19) : 1 - 15
  • [3] A LOW-LATENCY SPARSE-WINOGRAD ACCELERATOR FOR CONVOLUTIONAL NEURAL NETWORKS
    Wang, Haonan
    Liu, Wenjian
    Xu, Tianyi
    Lin, Jun
    Wang, Zhongfeng
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1448 - 1452
  • [4] SWM: A High-Performance Sparse-Winograd Matrix Multiplication CNN Accelerator
    Wu, Di
    Fan, Xitian
    Cao, Wei
    Wang, Lingli
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2021, 29 (05) : 936 - 949
  • [5] WRA: A 2.2-to-6.3 TOPS Highly Unified Dynamically Reconfigurable Accelerator Using a Novel Winograd Decomposition Algorithm for Convolutional Neural Networks
    Yang, Chen
    Wang, Yizhou
    Wang, Xiaoli
    Geng, Li
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2019, 66 (09) : 3480 - 3493
  • [6] WinTA: An Efficient Reconfigurable CNN Training Accelerator With Decomposition Winograd
    Lu, Jinming
    Wang, Hui
    Lin, Jun
    Wang, Zhongfeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2024, 71 (02) : 634 - 645
  • [7] Design and Optimization of Target Detection Accelerator Based on Winograd Algorithm
    Li B.
    Qi Y.-R.
    Zhou Q.-L.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2022, 50 (10): : 2387 - 2397
  • [8] A reconfigurable accelerator based on fast Winograd algorithm for convolutional neural network in Internet of Things
    Yang, Chen
    Wang, YiZhou
    Wang, XiaoLi
    Geng, Li
    2018 14TH IEEE INTERNATIONAL CONFERENCE ON SOLID-STATE AND INTEGRATED CIRCUIT TECHNOLOGY (ICSICT), 2018, : 613 - 615
  • [9] An FPGA-Based Reconfigurable CNN Training Accelerator Using Decomposable Winograd
    Wang, Hui
    Lu, Jinming
    Lin, Jun
    Wang, Zhongfeng
    2023 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, ISVLSI, 2023, : 175 - 180
  • [10] An Operation-Minimized FPGA Accelerator Design by Dynamically Exploiting Sparsity in CNN Winograd Transform
    Di, Xinkai
    Yang, Haigang
    Huang, Zhihong
    Mao, Ning
    32ND IEEE INTERNATIONAL SYSTEM ON CHIP CONFERENCE (IEEE SOCC 2019), 2019, : 50 - 55