A Pipelined and Scalable Dataflow Implementation of Convolutional Neural Networks on FPGA

被引:11
|
作者
Bacis, Marco [1 ]
Natale, Giuseppe [1 ]
Del Sozzo, Emanuele [1 ]
Santambrogio, Marco Domenico [1 ]
机构
[1] Politecn Milan, Dipartimento Elettron Informaz & Bioingn, Milan, Italy
关键词
Field Programmable Gate Arrays; Convolutional Neural Networks; Dataflow Architectures; COPROCESSOR; PERFORMANCE;
D O I
10.1109/IPDPSW.2017.44
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional Neural Network (CNN) is a deep learning algorithm extended from Artificial Neural Network (ANN) and widely used for image classification and recognition, thanks to its invariance to distortions. The recent rapid growth of applications based on deep learning algorithms, especially in the context of Big Data analytics, has dramatically improved both industrial and academic research and exploration of optimized implementations of CNNs on accelerators such as GPUs, FPGAs and ASICs, as general purpose processors can hardly meet the ever increasing performance and energy-efficiency requirements. FPGAs in particular are one of the most attractive alternative, as they allow the exploitation of the implicit parallelism of the algorithm and the acceleration of the different layers of a CNN with custom optimizations, while retaining extreme flexibility thanks to their reconfigurability. In this work, we propose a methodology to implement CNNs on FPGAs in a modular, scalable way. This is done by exploiting the dataflow pattern of convolutions, using an approach derived from previous work on the acceleration of Iterative Stencil Loops (ISLs), a computational pattern that shares some characteristics with convolutions. Furthermore, this approach allows the implementation of a high-level pipeline between the different network layers, resulting in an increase of the overall performance when the CNN is employed to process batches of multiple images, as it would happen in real-life scenarios.
引用
收藏
页码:90 / 97
页数:8
相关论文
共 50 条
  • [41] An Adaptive Row-based Weight Reuse Scheme for FPGA Implementation of Convolutional Neural Networks
    Je, Hyeonseung
    Duy Thanh Nguyen
    Lee, Kyujoong
    Lee, Hyuk-Jae
    2021 36TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC), 2021,
  • [42] DSIP: A Scalable Inference Accelerator for Convolutional Neural Networks
    Jo, Jihyuck
    Cha, Soyoung
    Rho, Dayoung
    Park, In-Cheol
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2018, 53 (02) : 605 - 618
  • [43] Optimizing Fully Spectral Convolutional Neural Networks on FPGA
    Liu, Shuanglong
    Luk, Wayne
    2020 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2020), 2020, : 39 - 47
  • [44] Efficient Utilization of FPGA Multipliers for Convolutional Neural Networks
    Boulasikis, M. A.
    Birbas, M.
    Tsafas, N.
    Kanakaris, N.
    2021 10TH INTERNATIONAL CONFERENCE ON MODERN CIRCUITS AND SYSTEMS TECHNOLOGIES (MOCAST), 2021,
  • [45] Efficient Design of Pruned Convolutional Neural Networks on FPGA
    Vestias, Mario
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2021, 93 (05): : 531 - 544
  • [46] Optimized Compression for Implementing Convolutional Neural Networks on FPGA
    Zhang, Min
    Li, Linpeng
    Wang, Hai
    Liu, Yan
    Qin, Hongbo
    Zhao, Wei
    ELECTRONICS, 2019, 8 (03)
  • [47] FastWave: Accelerating Autoregressive Convolutional Neural Networks on FPGA
    Hussain, Shehzeen
    Javaheripi, Mojan
    Neekhara, Paarth
    Kastner, Ryan
    Koushanfar, Farinaz
    2019 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2019,
  • [48] Efficient Design of Pruned Convolutional Neural Networks on FPGA
    Mário Véstias
    Journal of Signal Processing Systems, 2021, 93 : 531 - 544
  • [49] Caffeinated FPGAs: FPGA Framework For Convolutional Neural Networks
    DiCecco, Roberto
    Lacey, Griffin
    Vasiljevic, Jasmina
    Chow, Paul
    Taylor, Graham
    Areibi, Shawki
    2016 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), 2016, : 265 - 268
  • [50] A Method for Accelerating Convolutional Neural Networks Based on FPGA
    Zhao, Mengxing
    Li, Xiang
    Zhu, Shunyi
    Zhou, Li
    2019 4TH INTERNATIONAL CONFERENCE ON COMMUNICATION AND INFORMATION SYSTEMS (ICCIS 2019), 2019, : 241 - 246