High-Performance Winograd Based Accelerator Architecture for Convolutional Neural Network

被引：0

作者：

Vardhana, M. ^{[1
,2
]}

Pinto, Rohan ^{[3
]}

机构：

[1] Qualcomm India Private Ltd, Bangalore 560037, India

[2] Visvesvaraya Technol Univ, St Joseph Engn Coll, Belagavi 590018, India

[3] Visvesvaraya Technol Univ, St Joseph Engn Coll, Fac Elect & Commun Engn, Belagavi 590018, India

来源：

IEEE COMPUTER ARCHITECTURE LETTERS | 2025年 / 24卷 / 01期

关键词：

CNN; accelerator; winograd; inference; ALGORITHM;

D O I：

10.1109/LCA.2025.3525970

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Convolutional Neural Networks are deployed mostly on GPUs or CPUs. However, due to the increasing complexity of architecture and growing performance requirements, these platforms may not be suitable for deploying inference engines. ASIC and FPGA implementations are appearing as superior alternatives to software-based solutions for achieving the required performance. In this article, an efficient architecture for accelerating convolution using the Winograd transform is proposed and implemented on FPGA. The proposed accelerator consumes 38% less resources as compared with conventional GEMM-based implementation. Analysis results indicate that our accelerator can achieve 3.5 TOP/s, 1.28 TOP/s, and 1.42 TOP/s for VGG16, ResNet18, and MobileNetV2 CNNs, respectively, at 250 MHz. The proposed accelerator demonstrates the best energy efficiency as compared with prior arts.

引用

页码：21 / 24

页数：4

共 50 条

[41] High-performance one-stage detector for SiC crystal defects based on convolutional neural network
Shi, Haochen
Jin, Zhiyuan
Tang, Wenjing
Wang, Jing
Jiang, Kai
Xu, Mingsheng
Xia, Wei
Xu, Xiangang
KNOWLEDGE-BASED SYSTEMS, 2023, 280
[42] A High-Performance Downlink Synchronization Algorithm Based on Convolutional Neural Network for 5G Systems
Li, Xiaohui
Wang, Xianwen
Fan, Tao
Liu, Jiawen
Wan, Hongjie
Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2022, 45 (02): : 117 - 123
[43] FPGA-based Convolutional Neural Network Accelerator design using High Level Synthesize
Ghaffari, Sina
Sharifian, Saeed
2016 2ND INTERNATIONAL CONFERENCE OF SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2016, : 29 - 34
[44] A Convolutional Neural Network Accelerator Architecture with Fine-Granular Mixed Precision Configurability
Zhou, Xian
Zhang, Li
Guo, Chuliang
Yin, Xunzhao
Zhuo, Cheng
2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
[45] High energy efficiency convolutional neural network accelerator based on switched-capacitor matrix
Li, Dawei
Miao, Rong
Han, Xiao
Yan, Bonan
Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2024, 52 (09): : 23 - 28
[46] A High Utilization FPGA-Based Accelerator for Variable-Scale Convolutional Neural Network
Li, Xin
Cai, Yujie
Han, Jun
Zeng, Xiaoyang
2017 IEEE 12TH INTERNATIONAL CONFERENCE ON ASIC (ASICON), 2017, : 944 - 947
[47] High-performance pipeline architecture for packet classification accelerator in DPU
Tan, Jing
Lv, GaoFeng
Ma, Yanni
Qiao, GuanJie
2021 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT), 2021, : 286 - 289
[48] Hybrid Accelerator with MapReduce Architecture for Convolutional Neural Networks
Mihaita, David
Stefan, Gheorghe M.
ROMANIAN JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY, 2017, 20 (03): : 186 - 197
[49] Design of a Safe Convolutional Neural Network Accelerator
Xu, Zheng
Abraham, Jacob
2019 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2019), 2019, : 248 - 253
[50] Convolutional Neural Network Accelerator with Vector Quantization
Lee, Heng
Wu, Yi-Heng
Lin, Yu-Sheng
Chien, Shao-Yi
2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,

← 1 2 3 4 5 →