High-Performance Winograd Based Accelerator Architecture for Convolutional Neural Network

被引：0

作者：

Vardhana, M. ^{[1
,2
]}

Pinto, Rohan ^{[3
]}

机构：

[1] Qualcomm India Private Ltd, Bangalore 560037, India

[2] Visvesvaraya Technol Univ, St Joseph Engn Coll, Belagavi 590018, India

[3] Visvesvaraya Technol Univ, St Joseph Engn Coll, Fac Elect & Commun Engn, Belagavi 590018, India

来源：

IEEE COMPUTER ARCHITECTURE LETTERS | 2025年 / 24卷 / 01期

关键词：

CNN; accelerator; winograd; inference; ALGORITHM;

D O I：

10.1109/LCA.2025.3525970

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Convolutional Neural Networks are deployed mostly on GPUs or CPUs. However, due to the increasing complexity of architecture and growing performance requirements, these platforms may not be suitable for deploying inference engines. ASIC and FPGA implementations are appearing as superior alternatives to software-based solutions for achieving the required performance. In this article, an efficient architecture for accelerating convolution using the Winograd transform is proposed and implemented on FPGA. The proposed accelerator consumes 38% less resources as compared with conventional GEMM-based implementation. Analysis results indicate that our accelerator can achieve 3.5 TOP/s, 1.28 TOP/s, and 1.42 TOP/s for VGG16, ResNet18, and MobileNetV2 CNNs, respectively, at 250 MHz. The proposed accelerator demonstrates the best energy efficiency as compared with prior arts.

引用

页码：21 / 24

页数：4

共 50 条

[31] Reconfigurable Network-on-Chip based Convolutional Neural Network Accelerator
Firuzan, Arash
Modarressi, Mehdi
Reshadi, Midia
JOURNAL OF SYSTEMS ARCHITECTURE, 2022, 129
[32] High-Performance FPGA-based Accelerator for Bayesian Neural Networks
Fan, Hongxiang
Ferianc, Martin
Rodrigues, Miguel
Zhou, Hongyu
Niu, Xinyu
Luk, Wayne
2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 1063 - 1068
[33] High-Performance and Robust Binarized Neural Network Accelerator Based on Modified Content-Addressable Memory
Choi, Sureum
Jeon, Youngjun
Seo, Yeongkyo
ELECTRONICS, 2022, 11 (17)
[34] Convolutional Neural Network Accelerator Architecture Design for Ultimate Edge Computing Scenario
Wu, Ruidong
Liu, Bing
Fu, Ping
Ji, Xinglong
Lu, Wenshuai
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2023, 45 (06) : 1933 - 1943
[35] Memristor based spiking neural network accelerator architecture
Wu Chang-Chun
Zhou Pu-Jun
Wang Jun-Jie
Li Guo
Hu Shao-Gang
Yu Qi
Liu Yang
ACTA PHYSICA SINICA, 2022, 71 (14)
[36] High performance reconfigurable accelerator for deep convolutional neural networks
Qiao R.
Chen G.
Gong G.
Lu H.
Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2019, 46 (03): : 130 - 139
[37] Convolutional neural network for high-performance reservoir computing using dynamic memristors
Byun, Yongjin
So, Hyojin
Kim, Sungjun
CHAOS SOLITONS & FRACTALS, 2024, 188
[38] Steganalysis of convolutional neural network based on neural architecture search
Hongbo Wang
Xingyu Pan
Lingyan Fan
Shuofeng Zhao
Multimedia Systems, 2021, 27 : 379 - 387
[39] Steganalysis of convolutional neural network based on neural architecture search
Wang, Hongbo
Pan, Xingyu
Fan, Lingyan
Zhao, Shuofeng
MULTIMEDIA SYSTEMS, 2021, 27 (03) : 379 - 387
[40] SpWMM: A High-Performance Sparse-Winograd Matrix-Matrix Multiplication Accelerator for CNNs
Wu, Di
Cao, Wei
Wang, Lingli
2019 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2019), 2019, : 255 - 258

← 1 2 3 4 5 →