Implementation of deep neural networks on FPGA-CPU platform using Xilinx SDSOC

被引:5
|
作者
Hassan, Rania O. [1 ]
Mostafa, Hassan [1 ,2 ]
机构
[1] Cairo Univ, Fac Engn, Elect & Elect Commun, Giza, Egypt
[2] Univ Sci & Technol, Nanotechnol & Nanoelect Program, Giza 12578, Egypt
关键词
Convolutional neural networks (CNNs); Alex-Net; Accelerating CNNs; FPGA; Virtex; HW; SW co-design partitioning; SDSOC; HLS;
D O I
10.1007/s10470-020-01638-5
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Convolutional Neural Networks (CNNs) are the state-of-the-art systems for image classification due to their high accuracy but on the other hand their high computational complexity is very costly. The acceleration is the target in this field nowadays for using these systems in real time applications. The Graphics Processing Units is the solution but its high-power consumption prevents its utilization in daily-used equipment moreover the Field Programmable Gate Array (FPGA) has low power consumption and flexible architecture which fits more for CNN implementations. This work discusses this problem and provides a solution that compromises between the speed of the CNN and the power consumption of the FPGA. This solution depends on two main techniques for speeding up: parallelism of layers resources and pipelining inside some layers. On the other hand, we added a new methodology to compromise the area requirements with the speed and design time by implementing CNN using Xilinx SDSOC tool (including processor and FPGA on the same board). Implementing design using HW/SW partitioning will enhance time design based on high level language(C or C++) in Vivado HLS (High Level Synthesis). It also fits for more large designs than using FPGA only and faster in design time.
引用
收藏
页码:399 / 408
页数:10
相关论文
共 50 条
  • [21] Artificial neural networks processor - A hardware implementation using a FPGA
    Ferreira, P
    Ribeiro, P
    Antunes, A
    Dias, FM
    FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS, 2004, 3203 : 1084 - 1086
  • [22] FPGA Implementation of Harmonic Detection methods using Neural Networks
    Naoussi, Serge Raoul Dzonde
    Berviller, Herve
    Blonde, Jean Philippe
    Kom, Charles Hubert
    Kom, Martin
    Braun, Francis
    EPE: 2009 13TH EUROPEAN CONFERENCE ON POWER ELECTRONICS AND APPLICATIONS, VOLS 1-9, 2009, : 2497 - +
  • [23] Abstraction in FPGA implementation of neural networks
    Ogrenci, Arif Selcuk
    PROCEEDINGS OF THE 9TH WSEAS INTERNATIONAL CONFERENCE ON NEURAL NETWORKS (NN' 08): ADVANCED TOPICS ON NEURAL NETWORKS, 2008, : 221 - 224
  • [24] Reconfigurable FPGA implementation of neural networks
    Hajduk, Zbigniew
    NEUROCOMPUTING, 2018, 308 : 227 - 234
  • [25] A Real-time P-SFA hardware implementation of Deep Neural Networks using FPGA
    Elshahawy, Nour
    Wasif, Sandy A.
    Mashaly, Maggie
    Azab, Eman
    MICROPROCESSORS AND MICROSYSTEMS, 2024, 106
  • [26] A Reconfigurable Hardware Platform Implementation for Software Defined Radio using Dynamic Partial Reconfiguration on Xilinx Zynq FPGA
    Kamaleldin, Ahmed
    Hosny, Sherif
    Mohamed, Khaled
    Gamal, Mostafa
    Hussien, Abdelrhman
    Elnader, Eslam
    Shalash, Ahmed
    Obeid, Abdelfattah M.
    Ismail, Yehea
    Mostafa, Hassan
    2017 IEEE 60TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2017, : 1540 - 1543
  • [27] Innovative deep neural networks resizing for FPGA implementation in future collider experiments
    Mascione, D.
    Cristoforetti, M.
    Di Luca, A.
    Follega, F. M.
    Iuppa, R.
    NUOVO CIMENTO C-COLLOQUIA AND COMMUNICATIONS IN PHYSICS, 2023, 46 (04):
  • [28] Accelerating the Activation Function Selection for Hybrid Deep Neural Networks - FPGA Implementation
    Waseem, Shaik Mohammed
    Suraj, Alavala Venkata
    Roy, Subir Kumar
    2021 IEEE REGION 10 SYMPOSIUM (TENSYMP), 2021,
  • [29] FPGA Implementation of Spatial Image Filters using Xilinx System Generator
    Elamaran, V.
    Praveen, Angam
    Reddy, Medapati Srinivasa
    Aditya, Lanka Venkata
    Suman, Kunta
    INTERNATIONAL CONFERENCE ON MODELLING OPTIMIZATION AND COMPUTING, 2012, 38 : 2244 - 2249
  • [30] Real-Time Highly Accurate Dense Depth on a Power Budget Using an FPGA-CPU Hybrid SoC
    Rahnama, Oscar
    Cavallari, Tommaso
    Golodetz, Stuart
    Tonioni, Alessio
    Joy, Thomas
    Di Stefano, Luigi
    Walker, Simon
    Torr, Philip H. S.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2019, 66 (05) : 773 - 777