A Pipelined and Scalable Dataflow Implementation of Convolutional Neural Networks on FPGA

被引:11
|
作者
Bacis, Marco [1 ]
Natale, Giuseppe [1 ]
Del Sozzo, Emanuele [1 ]
Santambrogio, Marco Domenico [1 ]
机构
[1] Politecn Milan, Dipartimento Elettron Informaz & Bioingn, Milan, Italy
关键词
Field Programmable Gate Arrays; Convolutional Neural Networks; Dataflow Architectures; COPROCESSOR; PERFORMANCE;
D O I
10.1109/IPDPSW.2017.44
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional Neural Network (CNN) is a deep learning algorithm extended from Artificial Neural Network (ANN) and widely used for image classification and recognition, thanks to its invariance to distortions. The recent rapid growth of applications based on deep learning algorithms, especially in the context of Big Data analytics, has dramatically improved both industrial and academic research and exploration of optimized implementations of CNNs on accelerators such as GPUs, FPGAs and ASICs, as general purpose processors can hardly meet the ever increasing performance and energy-efficiency requirements. FPGAs in particular are one of the most attractive alternative, as they allow the exploitation of the implicit parallelism of the algorithm and the acceleration of the different layers of a CNN with custom optimizations, while retaining extreme flexibility thanks to their reconfigurability. In this work, we propose a methodology to implement CNNs on FPGAs in a modular, scalable way. This is done by exploiting the dataflow pattern of convolutions, using an approach derived from previous work on the acceleration of Iterative Stencil Loops (ISLs), a computational pattern that shares some characteristics with convolutions. Furthermore, this approach allows the implementation of a high-level pipeline between the different network layers, resulting in an increase of the overall performance when the CNN is employed to process batches of multiple images, as it would happen in real-life scenarios.
引用
下载
收藏
页码:90 / 97
页数:8
相关论文
共 50 条
  • [1] A Scalable FPGA Accelerator for Convolutional Neural Networks
    Xu, Ke
    Wang, Xiaoyun
    Fu, Shihang
    Wang, Dong
    ADVANCED COMPUTER ARCHITECTURE, 2018, 908 : 3 - 14
  • [2] Efficient Implementation of Convolutional Neural Networks on FPGA
    Hadnagy, A.
    Feher, B.
    Kovacshazy, T.
    2018 19TH INTERNATIONAL CARPATHIAN CONTROL CONFERENCE (ICCC), 2018, : 359 - 364
  • [3] FPGA Implementation and Acceleration of Convolutional Neural Networks
    Pisharody, Jayanth N.
    Pranav, K. B.
    Ranjitha, M.
    Rajeshwari, B.
    2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2021,
  • [4] Noise Convolutional Neural Networks and FPGA Implementation
    Munakala, Atsuki
    Nakahara, IIiroki
    Sato, Shimpei
    2019 IEEE 49TH INTERNATIONAL SYMPOSIUM ON MULTIPLE-VALUED LOGIC (ISMVL), 2019, : 85 - 90
  • [5] A Scalable Pipelined Dataflow Accelerator for Object Region Proposals on FPGA Platform
    Fu, Wenzhi
    Yang, Jianlei
    Dai, Pengcheng
    Chen, Yiran
    Zhao, Weisheng
    2018 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT 2018), 2018, : 349 - 352
  • [6] Fully Pipelined FPGA Acceleration of Binary Convolutional Neural Networks with Neural Architecture Search
    Ji, Mengfei
    Al-Ars, Zaid
    Chang, Yuchun
    Zhang, Baolin
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2024, 33 (10)
  • [7] Optimizing Loop Operation and Dataflow in FPGA Acceleration of Deep Convolutional Neural Networks
    Ma, Yufei
    Cao, Yu
    Vrudhula, Sarma
    Seo, Jae-sun
    FPGA'17: PROCEEDINGS OF THE 2017 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS, 2017, : 45 - 54
  • [8] Fast and Efficient Implementation of Convolutional Neural Networks on FPGA
    Podili, Abhinav
    Zhang, Chi
    Prasanna, Viktor
    2017 IEEE 28TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP), 2017, : 11 - 18
  • [9] Acceleration and implementation of convolutional neural networks based on FPGA
    Zhao, Sijie
    Gao, Shangshang
    Wang, Rugang
    Wang, Yuanyuan
    Zhou, Feng
    Guo, Naihong
    DIGITAL SIGNAL PROCESSING, 2023, 141
  • [10] Convolutional Neural Networks on Dataflow Engines
    Voss, Nils
    Bacis, Marco
    Mencer, Oskar
    Gaydadjiev, Georgi
    Luk, Wayne
    2017 IEEE 35TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2017, : 435 - 438