A Dynamically Reconfigurable Accelerator Design Using a Sparse-Winograd Decomposition Algorithm for CNNs

被引:2
|
作者
Zhao, Yunping [1 ]
Lu, Jianzhuang [1 ]
Chen, Xiaowen [1 ]
机构
[1] Natl Univ Def Technol, Changsha, Peoples R China
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2021年 / 66卷 / 01期
关键词
High performance computing; accelerator architecture; hardware; NEURAL-NETWORK; CONVOLUTION;
D O I
10.32604/cmc.2020.012380
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional Neural Networks (CNNs) are widely used in many fields. Due to their high throughput and high level of computing characteristics, however, an increasing number of researchers are focusing on how to improve the computational efficiency, hardware utilization, or flexibility of CNN hardware accelerators. Accordingly, this paper proposes a dynamically reconfigurable accelerator architecture that implements a Sparse-Winograd F(2 x 2.3 x 3)-based high-parallelism hardware architecture. This approach not only eliminates the pre-calculation complexity associated with the Winograd algorithm, thereby reducing the difficulty of hardware implementation, but also greatly improves the flexibility of the hardware; as a result, the accelerator can realize the calculation of Conventional Convolution, Grouped Convolution (GCONV) or Depthwise Separable Convolution (DSC) using the same hardware architecture. Our experimental results show that the accelerator achieves a 3x-4.14x speedup compared with the designs that do not use the acceleration algorithm on VGG-16 and MobileNet V1. Moreover, compared with previous designs using the traditional Winograd algorithm, the accelerator design achieves 1.4x-1.8x speedup. At the same time, the efficiency of the multiplier improves by up to 142%.
引用
收藏
页码:517 / 535
页数:19
相关论文
共 50 条
  • [31] Design and Implementation of Blowfish Algorithm using Reconfigurable Platform
    Nalawade, Shraphalya B.
    Gawali, Dhanashri H.
    2017 INTERNATIONAL CONFERENCE ON RECENT INNOVATIONS IN SIGNAL PROCESSING AND EMBEDDED SYSTEMS (RISE), 2017, : 479 - 484
  • [32] Reconfigurable Manufacturing System Design Using a Genetic Algorithm
    Bortolini, Marco
    Cafarella, Cristian
    Ferrari, Emilio
    Galizia, Francesco Gabriele
    Gamberi, Mauro
    SUSTAINABLE DESIGN AND MANUFACTURING, KES-SDM 2021, 2022, 262 : 130 - 139
  • [33] Design and analysis of a layer seven Network Processor accelerator using reconfigurable logic
    Memik, G
    Memik, SO
    Mangione-Smith, WH
    10TH ANNUAL IEEE SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, PROCEEDINGS, 2002, : 131 - 140
  • [34] An Efficient Convolutional Neural Network Accelerator Design on FPGA Using the Layer-to-Layer Unified Input Winograd Architecture
    Li, Jie
    Liang, Yong
    Yang, Zhenhao
    Li, Xinhai
    ELECTRONICS, 2025, 14 (06):
  • [35] Novel Reconfigurable Hardware Accelerator for Protein Sequence Alignment Using Smith-Waterman Algorithm
    Ibrahim, Atef
    Elsimary, Hamed
    Aljumah, Abdullah
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2016, E99A (03) : 683 - 690
  • [36] Design of dynamically reconfigurable real-time software using port-based objects
    Stewart, DB
    Volpe, RA
    Khosla, PK
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1997, 23 (12) : 759 - 776
  • [37] Sanger: A Co-Design Framework for Enabling Sparse Attention using Reconfigurable Architecture
    Lu, Liqiang
    Jin, Yicheng
    Bi, Hangrui
    Luo, Zizhang
    Li, Peng
    Wang, Tao
    Liang, Yun
    PROCEEDINGS OF 54TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, MICRO 2021, 2021, : 977 - 991
  • [38] A Survey of Cognitive Radio Reconfigurable Antenna Design and Proposed Design using Genetic Algorithm
    Aizaz, Zainab
    Sinha, Poonam
    2016 IEEE STUDENTS' CONFERENCE ON ELECTRICAL, ELECTRONICS AND COMPUTER SCIENCE (SCEECS), 2016,
  • [39] Design of Ka band Reconfigurable Beam Antenna using Genetic Algorithm
    Uhm, Manseok
    Yun, Sohyeun
    Lee, Hongyeal
    Kwak, Changsoo
    Shin, Donghwan
    Yom, Inbok
    2013 ASIA-PACIFIC MICROWAVE CONFERENCE PROCEEDINGS (APMC 2013), 2013, : 1082 - 1084
  • [40] Design Method of Single-Flux-Quantum Logic Circuits Using Dynamically Reconfigurable Logic Gates
    Nishimoto, Shohei
    Yamanashi, Yuki
    Yoshikawa, Nobuyuki
    IEEE TRANSACTIONS ON APPLIED SUPERCONDUCTIVITY, 2015, 25 (03)