PhD Forum: Towards Embedded Heterogeneous FPGA-GPU Smart Camera Architectures for CNN Inference

被引:0
|
作者
Carballo-Hernandez, Walther [1 ]
Berry, Francois [1 ]
Pelcat, Maxime [2 ]
Arias-Estrada, Miguel [3 ]
机构
[1] Inst Pascal, Dept Images Percept Syst & Robot, Aubiere, France
[2] UMR CNRS, Inst Natl Sci Appliquees INSA Rennes, IETR, Dept Images, Rennes, France
[3] INAOE, Dept Comp Sci, Puebla, Mexico
关键词
Heterogeneous Computing; Edge Computing; Internet of Things; Parallel Programming; Single Instruction Multiple Data; Pipelining; Models of Computation and Architecture;
D O I
10.1145/3349801.3357136
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The success of Deep Learning (DL) algorithms in computer vision tasks have created an on-going demand of dedicated hardware architectures that could keep up with the their required computation and memory complexities. This task is particularly challenging when embedded smart camera platforms have constrained resources such as power consumption, Processing Element (PE) and communication. This article describes a heterogeneous system embedding an FPGA and a GPU for executing CNN inference for computer vision applications. The built system addresses some challenges of embedded CNN such as task and data partitioning, and workload balancing. The selected heterogeneous platform embeds an Nvidia (R) Jetson TX2 for the CPU-GPU side and an Intel Altera (R) Cyclone10GX for the FPGA side interconnected by PCIe Gen2 with a MIPI-CSI camera for prototyping. This test environment will be used as a support for future work on a methodology for optimized model partitioning.
引用
收藏
页数:2
相关论文
共 5 条
  • [1] Exploring FPGA-GPU Heterogeneous Architecture for ADAS: Towards Performance and Energy
    Wang, Xiebing
    Liu, Linlin
    Huang, Kai
    Knoll, Alois
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2017, 2017, 10393 : 33 - 48
  • [2] PhD Forum: BiSeeMos: a Fast Embedded Stereo Smart Camera
    Pelissier, Frantz
    Berry, Francois
    2011 FIFTH ACM/IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED SMART CAMERAS (ICDSC), 2011,
  • [3] Automatic CNN Model Partitioning for GPU/FPGA-based Embedded Heterogeneous Accelerators using Geometric Programming
    Carballo-Hernandez, Walther
    Pelcat, Maxime
    Berry, Francois
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2023, 95 (10): : 1203 - 1218
  • [4] Automatic CNN Model Partitioning for GPU/FPGA-based Embedded Heterogeneous Accelerators using Geometric Programming
    Walther Carballo-Hernández
    Maxime Pelcat
    François Berry
    Journal of Signal Processing Systems, 2023, 95 : 1203 - 1218
  • [5] Ph.D Forum: Towards an FPGA-Based Smart Camera for Virtual Reality Applications
    Perez Cruz, Antonio
    Aguilar-Gonzalez, Abiel
    Perez-Patricio, M.
    ICDSC 2019: 13TH INTERNATIONAL CONFERENCE ON DISTRIBUTED SMART CAMERAS, 2019,