High-Performance Winograd Based Accelerator Architecture for Convolutional Neural Network

被引:0
|
作者
Vardhana, M. [1 ,2 ]
Pinto, Rohan [3 ]
机构
[1] Qualcomm India Private Ltd, Bangalore 560037, India
[2] Visvesvaraya Technol Univ, St Joseph Engn Coll, Belagavi 590018, India
[3] Visvesvaraya Technol Univ, St Joseph Engn Coll, Fac Elect & Commun Engn, Belagavi 590018, India
关键词
CNN; accelerator; winograd; inference; ALGORITHM;
D O I
10.1109/LCA.2025.3525970
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional Neural Networks are deployed mostly on GPUs or CPUs. However, due to the increasing complexity of architecture and growing performance requirements, these platforms may not be suitable for deploying inference engines. ASIC and FPGA implementations are appearing as superior alternatives to software-based solutions for achieving the required performance. In this article, an efficient architecture for accelerating convolution using the Winograd transform is proposed and implemented on FPGA. The proposed accelerator consumes 38% less resources as compared with conventional GEMM-based implementation. Analysis results indicate that our accelerator can achieve 3.5 TOP/s, 1.28 TOP/s, and 1.42 TOP/s for VGG16, ResNet18, and MobileNetV2 CNNs, respectively, at 250 MHz. The proposed accelerator demonstrates the best energy efficiency as compared with prior arts.
引用
收藏
页码:21 / 24
页数:4
相关论文
共 50 条
  • [41] High-performance one-stage detector for SiC crystal defects based on convolutional neural network
    Shi, Haochen
    Jin, Zhiyuan
    Tang, Wenjing
    Wang, Jing
    Jiang, Kai
    Xu, Mingsheng
    Xia, Wei
    Xu, Xiangang
    KNOWLEDGE-BASED SYSTEMS, 2023, 280
  • [42] A High-Performance Downlink Synchronization Algorithm Based on Convolutional Neural Network for 5G Systems
    Li, Xiaohui
    Wang, Xianwen
    Fan, Tao
    Liu, Jiawen
    Wan, Hongjie
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2022, 45 (02): : 117 - 123
  • [43] FPGA-based Convolutional Neural Network Accelerator design using High Level Synthesize
    Ghaffari, Sina
    Sharifian, Saeed
    2016 2ND INTERNATIONAL CONFERENCE OF SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2016, : 29 - 34
  • [44] A Convolutional Neural Network Accelerator Architecture with Fine-Granular Mixed Precision Configurability
    Zhou, Xian
    Zhang, Li
    Guo, Chuliang
    Yin, Xunzhao
    Zhuo, Cheng
    2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
  • [45] High energy efficiency convolutional neural network accelerator based on switched-capacitor matrix
    Li, Dawei
    Miao, Rong
    Han, Xiao
    Yan, Bonan
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2024, 52 (09): : 23 - 28
  • [46] A High Utilization FPGA-Based Accelerator for Variable-Scale Convolutional Neural Network
    Li, Xin
    Cai, Yujie
    Han, Jun
    Zeng, Xiaoyang
    2017 IEEE 12TH INTERNATIONAL CONFERENCE ON ASIC (ASICON), 2017, : 944 - 947
  • [47] High-performance pipeline architecture for packet classification accelerator in DPU
    Tan, Jing
    Lv, GaoFeng
    Ma, Yanni
    Qiao, GuanJie
    2021 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT), 2021, : 286 - 289
  • [48] Hybrid Accelerator with MapReduce Architecture for Convolutional Neural Networks
    Mihaita, David
    Stefan, Gheorghe M.
    ROMANIAN JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY, 2017, 20 (03): : 186 - 197
  • [49] Design of a Safe Convolutional Neural Network Accelerator
    Xu, Zheng
    Abraham, Jacob
    2019 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2019), 2019, : 248 - 253
  • [50] Convolutional Neural Network Accelerator with Vector Quantization
    Lee, Heng
    Wu, Yi-Heng
    Lin, Yu-Sheng
    Chien, Shao-Yi
    2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,