High-Performance Winograd Based Accelerator Architecture for Convolutional Neural Network

被引:0
|
作者
Vardhana, M. [1 ,2 ]
Pinto, Rohan [3 ]
机构
[1] Qualcomm India Private Ltd, Bangalore 560037, India
[2] Visvesvaraya Technol Univ, St Joseph Engn Coll, Belagavi 590018, India
[3] Visvesvaraya Technol Univ, St Joseph Engn Coll, Fac Elect & Commun Engn, Belagavi 590018, India
关键词
CNN; accelerator; winograd; inference; ALGORITHM;
D O I
10.1109/LCA.2025.3525970
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional Neural Networks are deployed mostly on GPUs or CPUs. However, due to the increasing complexity of architecture and growing performance requirements, these platforms may not be suitable for deploying inference engines. ASIC and FPGA implementations are appearing as superior alternatives to software-based solutions for achieving the required performance. In this article, an efficient architecture for accelerating convolution using the Winograd transform is proposed and implemented on FPGA. The proposed accelerator consumes 38% less resources as compared with conventional GEMM-based implementation. Analysis results indicate that our accelerator can achieve 3.5 TOP/s, 1.28 TOP/s, and 1.42 TOP/s for VGG16, ResNet18, and MobileNetV2 CNNs, respectively, at 250 MHz. The proposed accelerator demonstrates the best energy efficiency as compared with prior arts.
引用
收藏
页码:21 / 24
页数:4
相关论文
共 50 条
  • [21] SpWA: An Efficient Sparse Winograd Convolutional Neural Networks Accelerator on FPGAs
    Lu, Liqiang
    Liang, Yun
    2018 55TH ACM/ESDA/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2018,
  • [22] Scalable High-Performance Architecture for Convolutional Ternary Neural Networks on FPGA
    Prost-Boucle, Adrien
    Bourge, Alban
    Petrot, Frederic
    Alemdar, Hande
    Caldwell, Nicholas
    Leroy, Vincent
    2017 27TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2017,
  • [23] Design and implementation of convolution neural network accelerator for Winograd algorithm based on FPGA
    Niu Zhao-xu
    Sun Hai-jiang
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2023, 38 (11) : 1521 - 1530
  • [24] A High Performance Reconfigurable Hardware Architecture for Lightweight Convolutional Neural Network
    An, Fubang
    Wang, Lingli
    Zhou, Xuegong
    ELECTRONICS, 2023, 12 (13)
  • [25] SWM: A High-Performance Sparse-Winograd Matrix Multiplication CNN Accelerator
    Wu, Di
    Fan, Xitian
    Cao, Wei
    Wang, Lingli
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2021, 29 (05) : 936 - 949
  • [26] Improving the Performance of OpenCL-based FPGA Accelerator for Convolutional Neural Network
    Zhang, Jialiang
    Li, Jing
    FPGA'17: PROCEEDINGS OF THE 2017 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS, 2017, : 25 - 34
  • [27] High Speed Winograd Convolutional Circuit for Convolutional Neural Networks
    Chen, Ting-Wei
    Hsieh, Hao-An
    Fan, Yu-Cheng
    2022 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN, IEEE ICCE-TW 2022, 2022, : 347 - 348
  • [28] High-Performance and Robust Spintronic/CNTFET-Based Binarized Neural Network Hardware Accelerator
    Nasab, Milad Tanavardi
    Amirany, Abdolah
    Moaiyeri, Mohammad Hossein
    Jafari, Kian
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2023, 11 (02) : 527 - 533
  • [29] A LOW-LATENCY SPARSE-WINOGRAD ACCELERATOR FOR CONVOLUTIONAL NEURAL NETWORKS
    Wang, Haonan
    Liu, Wenjian
    Xu, Tianyi
    Lin, Jun
    Wang, Zhongfeng
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1448 - 1452
  • [30] A High Efficient Architecture for Convolution Neural Network Accelerator
    Kong Anmin
    Zhao Bin
    2019 2ND INTERNATIONAL CONFERENCE ON INTELLIGENT AUTONOMOUS SYSTEMS (ICOIAS 2019), 2019, : 131 - 134