High-Performance Winograd Based Accelerator Architecture for Convolutional Neural Network

被引:0
|
作者
Vardhana, M. [1 ,2 ]
Pinto, Rohan [3 ]
机构
[1] Qualcomm India Private Ltd, Bangalore 560037, India
[2] Visvesvaraya Technol Univ, St Joseph Engn Coll, Belagavi 590018, India
[3] Visvesvaraya Technol Univ, St Joseph Engn Coll, Fac Elect & Commun Engn, Belagavi 590018, India
关键词
CNN; accelerator; winograd; inference; ALGORITHM;
D O I
10.1109/LCA.2025.3525970
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional Neural Networks are deployed mostly on GPUs or CPUs. However, due to the increasing complexity of architecture and growing performance requirements, these platforms may not be suitable for deploying inference engines. ASIC and FPGA implementations are appearing as superior alternatives to software-based solutions for achieving the required performance. In this article, an efficient architecture for accelerating convolution using the Winograd transform is proposed and implemented on FPGA. The proposed accelerator consumes 38% less resources as compared with conventional GEMM-based implementation. Analysis results indicate that our accelerator can achieve 3.5 TOP/s, 1.28 TOP/s, and 1.42 TOP/s for VGG16, ResNet18, and MobileNetV2 CNNs, respectively, at 250 MHz. The proposed accelerator demonstrates the best energy efficiency as compared with prior arts.
引用
收藏
页码:21 / 24
页数:4
相关论文
共 50 条
  • [1] High-performance Convolutional Neural Network Accelerator Based on Systolic Arrays and Quantization
    Li, Yufeng
    Lu, Shengli
    Luo, Jihe
    Pang, Wei
    Liu, Hao
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP 2019), 2019, : 335 - 339
  • [2] WRA-SS: A High-Performance Accelerator Integrating Winograd With Structured Sparsity for Convolutional Neural Networks
    Yang, Chen
    Meng, Yishuo
    Xi, Jiawei
    Xiang, Siwei
    Wang, Jianfei
    Mei, Kuizhi
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2024, 32 (01) : 164 - 177
  • [3] A reconfigurable accelerator based on fast Winograd algorithm for convolutional neural network in Internet of Things
    Yang, Chen
    Wang, YiZhou
    Wang, XiaoLi
    Geng, Li
    2018 14TH IEEE INTERNATIONAL CONFERENCE ON SOLID-STATE AND INTEGRATED CIRCUIT TECHNOLOGY (ICSICT), 2018, : 613 - 615
  • [4] High-Performance CNN Accelerator on FPGA Using Unified Winograd-GEMM Architecture
    Kala, S.
    Jose, Babita R.
    Mathew, Jimson
    Nalesh, S.
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2019, 27 (12) : 2816 - 2828
  • [5] A Winograd-Based Integrated Photonics Accelerator for Convolutional Neural Networks
    Mehrabian, Armin
    Miscuglio, Mario
    Alkabani, Yousra
    Sorger, Volker J.
    El-Ghazawi, Tarek
    IEEE JOURNAL OF SELECTED TOPICS IN QUANTUM ELECTRONICS, 2020, 26 (01)
  • [6] A High-Performance VLSI Architecture for a Self-Feedback Convolutional Neural Network
    Parmar, Yashrajsinh
    Sridharan, K.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2021, 68 (01) : 456 - 460
  • [7] An Efficient Convolutional Neural Network Accelerator Design on FPGA Using the Layer-to-Layer Unified Input Winograd Architecture
    Li, Jie
    Liang, Yong
    Yang, Zhenhao
    Li, Xinhai
    ELECTRONICS, 2025, 14 (06):
  • [8] An Efficient Accelerator with Winograd for Novel Convolutional Neural Networks
    Lin, Zhijian
    Zhang, Meng
    Weng, Dongpeng
    Liu, Fei
    2022 5TH INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS AND SIMULATION (ICCSS 2022), 2022, : 126 - 130
  • [9] VHDL Generator for A High Performance Convolutional Neural Network FPGA-Based Accelerator
    Hamdan, Muhammad K.
    Rover, Diane T.
    2017 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG), 2017,
  • [10] A High-efficiency FPGA-based Accelerator for Convolutional Neural Networks using Winograd Algorithm
    Huang, Y.
    Shen, J.
    Wang, Z.
    Wen, M.
    Zhang, C.
    2018 INTERNATIONAL CONFERENCE ON ELECTRONICS, COMMUNICATIONS AND CONTROL ENGINEERING (ICECC), 2018, 1026