High-Performance Winograd Based Accelerator Architecture for Convolutional Neural Network

被引：0

作者：

Vardhana, M. ^{[1
,2
]}

Pinto, Rohan ^{[3
]}

机构：

[1] Qualcomm India Private Ltd, Bangalore 560037, India

[2] Visvesvaraya Technol Univ, St Joseph Engn Coll, Belagavi 590018, India

[3] Visvesvaraya Technol Univ, St Joseph Engn Coll, Fac Elect & Commun Engn, Belagavi 590018, India

来源：

IEEE COMPUTER ARCHITECTURE LETTERS | 2025年 / 24卷 / 01期

关键词：

CNN; accelerator; winograd; inference; ALGORITHM;

D O I：

10.1109/LCA.2025.3525970

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Convolutional Neural Networks are deployed mostly on GPUs or CPUs. However, due to the increasing complexity of architecture and growing performance requirements, these platforms may not be suitable for deploying inference engines. ASIC and FPGA implementations are appearing as superior alternatives to software-based solutions for achieving the required performance. In this article, an efficient architecture for accelerating convolution using the Winograd transform is proposed and implemented on FPGA. The proposed accelerator consumes 38% less resources as compared with conventional GEMM-based implementation. Analysis results indicate that our accelerator can achieve 3.5 TOP/s, 1.28 TOP/s, and 1.42 TOP/s for VGG16, ResNet18, and MobileNetV2 CNNs, respectively, at 250 MHz. The proposed accelerator demonstrates the best energy efficiency as compared with prior arts.

引用

页码：21 / 24

页数：4

共 50 条

[21] SpWA: An Efficient Sparse Winograd Convolutional Neural Networks Accelerator on FPGAs
Lu, Liqiang
Liang, Yun
2018 55TH ACM/ESDA/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2018,
[22] Scalable High-Performance Architecture for Convolutional Ternary Neural Networks on FPGA
Prost-Boucle, Adrien
Bourge, Alban
Petrot, Frederic
Alemdar, Hande
Caldwell, Nicholas
Leroy, Vincent
2017 27TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2017,
[23] Design and implementation of convolution neural network accelerator for Winograd algorithm based on FPGA
Niu Zhao-xu
Sun Hai-jiang
CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2023, 38 (11) : 1521 - 1530
[24] A High Performance Reconfigurable Hardware Architecture for Lightweight Convolutional Neural Network
An, Fubang
Wang, Lingli
Zhou, Xuegong
ELECTRONICS, 2023, 12 (13)
[25] SWM: A High-Performance Sparse-Winograd Matrix Multiplication CNN Accelerator
Wu, Di
Fan, Xitian
Cao, Wei
Wang, Lingli
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2021, 29 (05) : 936 - 949
[26] Improving the Performance of OpenCL-based FPGA Accelerator for Convolutional Neural Network
Zhang, Jialiang
Li, Jing
FPGA'17: PROCEEDINGS OF THE 2017 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS, 2017, : 25 - 34
[27] High Speed Winograd Convolutional Circuit for Convolutional Neural Networks
Chen, Ting-Wei
Hsieh, Hao-An
Fan, Yu-Cheng
2022 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN, IEEE ICCE-TW 2022, 2022, : 347 - 348
[28] High-Performance and Robust Spintronic/CNTFET-Based Binarized Neural Network Hardware Accelerator
Nasab, Milad Tanavardi
Amirany, Abdolah
Moaiyeri, Mohammad Hossein
Jafari, Kian
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2023, 11 (02) : 527 - 533
[29] A LOW-LATENCY SPARSE-WINOGRAD ACCELERATOR FOR CONVOLUTIONAL NEURAL NETWORKS
Wang, Haonan
Liu, Wenjian
Xu, Tianyi
Lin, Jun
Wang, Zhongfeng
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1448 - 1452
[30] A High Efficient Architecture for Convolution Neural Network Accelerator
Kong Anmin
Zhao Bin
2019 2ND INTERNATIONAL CONFERENCE ON INTELLIGENT AUTONOMOUS SYSTEMS (ICOIAS 2019), 2019, : 131 - 134

← 1 2 3 4 5 →