Efficient Hardware Acceleration of Convolutional Neural Networks

被引:0
|
作者
Kala, S. [1 ]
Jose, Babita R. [1 ]
Mathew, Jimson [2 ]
Nalesh, S. [3 ]
机构
[1] Cochin Univ Sci & Technol, Sch Engn, Kochi, Kerala, India
[2] Indian Inst Technol Patna, Dept Comp Sci & Engn, Patna, Bihar, India
[3] Cochin Univ Sci & Technol, Dept Elect, Cochin, Kerala, India
关键词
Convolutional neural networks; FPGA; high performance; Winograd algorithm;
D O I
10.1109/SOCC46988.2019.1570573948
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional neural networks (CNNs) have emerged as the most efficient technique for solving a host of machine learning tasks. Compute and memory intensive nature of CNN has stimulated lot of work in hardware acceleration of these network models. FPGAs have emerged as a promising approach for accelerating CNNs, due to its high performance, flexibility and energy efficiency. We propose a unified architecture named UniWiG, where both Winograd based convolution and general matrix multiplication (GEMM) can be accelerated using the same set of processing elements. Proposed architecture has been used to accelerate AlexNet and VGG-16 models on FPGA with a performance of 433.63 GOPS and 407.23 GOPS respectively. We have also analyzed the performance with varying Winograd tile sizes and found out the most appropriate tile sizes for maximizing the performance while reducing on-chip memory resource.
引用
收藏
页码:191 / 192
页数:2
相关论文
共 50 条
  • [1] A Survey on Efficient Convolutional Neural Networks and Hardware Acceleration
    Ghimire, Deepak
    Kil, Dayoung
    Kim, Seong-heum
    [J]. ELECTRONICS, 2022, 11 (06)
  • [2] Efficient Hardware Acceleration of Sparsely Active Convolutional Spiking Neural Networks
    Sommer, Jan
    Ozkan, M. Akif
    Keszocze, Oliver
    Teich, Juergen
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (11) : 3767 - 3778
  • [3] Binarized Convolutional Neural Networks with Separable Filters for Efficient Hardware Acceleration
    Lin, Jeng-Hau
    Xing, Tianwei
    Zhao, Ritchie
    Zhang, Zhiru
    Srivastava, Mani
    Tu, Zhuowen
    Gupta, Rajesh K.
    [J]. 2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 344 - 352
  • [4] Compressing Sparse Ternary Weight Convolutional Neural Networks for Efficient Hardware Acceleration
    Wi, Hyeonwook
    Kim, Hyeonuk
    Choi, Seungkyu
    Kim, Lee-Sup
    [J]. 2019 IEEE/ACM INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN (ISLPED), 2019,
  • [5] Design of Convolutional Neural Networks Hardware Acceleration Based on FPGA
    Qin, Huabiao
    Cao, Qinping
    [J]. Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2019, 41 (11): : 2599 - 2605
  • [6] Design of Convolutional Neural Networks Hardware Acceleration Based on FPGA
    Qin Huabiao
    Cao Qinping
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2019, 41 (11) : 2599 - 2605
  • [7] A Configurable and Versatile Architecture for Low Power, Energy Efficient Hardware Acceleration of Convolutional Neural Networks
    Christensen, Steinar Thune
    Aunet, Snorre
    Qadir, Omer
    [J]. 2019 IEEE NORDIC CIRCUITS AND SYSTEMS CONFERENCE (NORCAS) - NORCHIP AND INTERNATIONAL SYMPOSIUM OF SYSTEM-ON-CHIP (SOC), 2019,
  • [8] An Efficient Reconfigurable Hardware Accelerator for Convolutional Neural Networks
    Ansari, Anaam
    Gunnam, Kiran
    Ogunfunmi, Tokunbo
    [J]. 2017 FIFTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2017, : 1337 - 1341
  • [9] Snowflake: An Efficient Hardware Accelerator for Convolutional Neural Networks
    Gokhale, Vinayak
    Zaidy, Aliasger
    Chang, Andre Xian Ming
    Culurciello, Eugenio
    [J]. 2017 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2017, : 2082 - 2085
  • [10] Efficient Reconfigurable Hardware Core for Convolutional Neural Networks
    Wang, Haonan
    Lin, Jun
    Xie, Yi
    Yuan, Bo
    Wang, Zhongfeng
    [J]. 2018 CONFERENCE RECORD OF 52ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2018, : 777 - 781