High-Performance Winograd Based Accelerator Architecture for Convolutional Neural Network

被引:0
|
作者
Vardhana, M. [1 ,2 ]
Pinto, Rohan [3 ]
机构
[1] Qualcomm India Private Ltd, Bangalore 560037, India
[2] Visvesvaraya Technol Univ, St Joseph Engn Coll, Belagavi 590018, India
[3] Visvesvaraya Technol Univ, St Joseph Engn Coll, Fac Elect & Commun Engn, Belagavi 590018, India
关键词
CNN; accelerator; winograd; inference; ALGORITHM;
D O I
10.1109/LCA.2025.3525970
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional Neural Networks are deployed mostly on GPUs or CPUs. However, due to the increasing complexity of architecture and growing performance requirements, these platforms may not be suitable for deploying inference engines. ASIC and FPGA implementations are appearing as superior alternatives to software-based solutions for achieving the required performance. In this article, an efficient architecture for accelerating convolution using the Winograd transform is proposed and implemented on FPGA. The proposed accelerator consumes 38% less resources as compared with conventional GEMM-based implementation. Analysis results indicate that our accelerator can achieve 3.5 TOP/s, 1.28 TOP/s, and 1.42 TOP/s for VGG16, ResNet18, and MobileNetV2 CNNs, respectively, at 250 MHz. The proposed accelerator demonstrates the best energy efficiency as compared with prior arts.
引用
收藏
页码:21 / 24
页数:4
相关论文
共 50 条
  • [31] Reconfigurable Network-on-Chip based Convolutional Neural Network Accelerator
    Firuzan, Arash
    Modarressi, Mehdi
    Reshadi, Midia
    JOURNAL OF SYSTEMS ARCHITECTURE, 2022, 129
  • [32] High-Performance FPGA-based Accelerator for Bayesian Neural Networks
    Fan, Hongxiang
    Ferianc, Martin
    Rodrigues, Miguel
    Zhou, Hongyu
    Niu, Xinyu
    Luk, Wayne
    2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 1063 - 1068
  • [33] High-Performance and Robust Binarized Neural Network Accelerator Based on Modified Content-Addressable Memory
    Choi, Sureum
    Jeon, Youngjun
    Seo, Yeongkyo
    ELECTRONICS, 2022, 11 (17)
  • [34] Convolutional Neural Network Accelerator Architecture Design for Ultimate Edge Computing Scenario
    Wu, Ruidong
    Liu, Bing
    Fu, Ping
    Ji, Xinglong
    Lu, Wenshuai
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2023, 45 (06) : 1933 - 1943
  • [35] Memristor based spiking neural network accelerator architecture
    Wu Chang-Chun
    Zhou Pu-Jun
    Wang Jun-Jie
    Li Guo
    Hu Shao-Gang
    Yu Qi
    Liu Yang
    ACTA PHYSICA SINICA, 2022, 71 (14)
  • [36] High performance reconfigurable accelerator for deep convolutional neural networks
    Qiao R.
    Chen G.
    Gong G.
    Lu H.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2019, 46 (03): : 130 - 139
  • [37] Convolutional neural network for high-performance reservoir computing using dynamic memristors
    Byun, Yongjin
    So, Hyojin
    Kim, Sungjun
    CHAOS SOLITONS & FRACTALS, 2024, 188
  • [38] Steganalysis of convolutional neural network based on neural architecture search
    Hongbo Wang
    Xingyu Pan
    Lingyan Fan
    Shuofeng Zhao
    Multimedia Systems, 2021, 27 : 379 - 387
  • [39] Steganalysis of convolutional neural network based on neural architecture search
    Wang, Hongbo
    Pan, Xingyu
    Fan, Lingyan
    Zhao, Shuofeng
    MULTIMEDIA SYSTEMS, 2021, 27 (03) : 379 - 387
  • [40] SpWMM: A High-Performance Sparse-Winograd Matrix-Matrix Multiplication Accelerator for CNNs
    Wu, Di
    Cao, Wei
    Wang, Lingli
    2019 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2019), 2019, : 255 - 258