Embedded GPU Cluster Computing Framework for Inference of Convolutional Neural Networks

被引:0
|
作者
Kain, Evan [1 ]
Wildenstein, Diego [2 ]
Pineda, Andrew C. [3 ]
机构
[1] Univ Pittsburgh, Dept Elect & Comp Engn, NSF Ctr Space Highperformance & Resilient Comp SH, Pittsburgh, PA 15260 USA
[2] Arizona State Univ, Sch Elect Comp & Energy Engn, Tempe, AZ USA
[3] US Air Force, Spacecraft Component Technol Branch, Space Vehicles Directorate, Res Lab, Kirtland AFB, NM USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The growing need for on-board image processing for space vehicles requires computing solutions that are both low-power and high-performance. Parallel computation using low-power embedded Graphics Processing Units (GPUs) satisfy both requirements. Our experiment involves the use of OpenMPI domain decomposition of an image processing algorithm based upon a pre-trained convolutional neural network (CNN) developed by the U.S. Air Force Research Laboratory (AFRL). Our testbed consists of six NVIDIA Jetson TX2 development boards operating in parallel. This parallel framework results in a speedup of 4.3x on six processing nodes. This approach also leads to a linear decay in parallel efficiency as more processing nodes are added to the network. By replicating the data across processors in addition to distributing, we also characterize the best-case impact of adding triple modular redundancy (TMR) to our application.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Performance Optimizing Method for Sparse Convolutional Neural Networks on GPU
    Dong X.
    Liu L.
    Li J.
    Feng X.-B.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (09): : 2944 - 2964
  • [32] Performance Analysis of GPU-based Convolutional Neural Networks
    Li, Xiaqing
    Zhang, Guangyan
    Huang, H. Howie
    Wang, Zhufan
    Zheng, Weimin
    PROCEEDINGS 45TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING - ICPP 2016, 2016, : 67 - 76
  • [33] Accelerator Framework of Spike-By-Spike Neural Networks for Inference and Incremental Learning in Embedded Systems
    Nevarez, Yarib
    Garcia-Ortiz, Alberto
    Rotermund, David
    Pawelzik, Klaus R.
    2020 9TH INTERNATIONAL CONFERENCE ON MODERN CIRCUITS AND SYSTEMS TECHNOLOGIES (MOCAST), 2020,
  • [34] FAQ-CNN: A Flexible Acceleration Framework for Quantized Convolutional Neural Networks on Embedded FPGAs
    Xie K.
    Lu Y.
    Jin Z.
    Liu Y.
    Gong C.
    Chen X.
    Li T.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (07): : 1409 - 1427
  • [35] HEIF: Highly Efficient Stochastic Computing-Based Inference Framework for Deep Neural Networks
    Li, Zhe
    Li, Ji
    Ren, Ao
    Cai, Ruizhe
    Ding, Caiwen
    Qian, Xuehai
    Draper, Jeffrey
    Yuan, Bo
    Tang, Jian
    Qiu, Qinru
    Wang, Yanzhi
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2019, 38 (08) : 1543 - 1556
  • [36] Accelerating Convolutional Neural Network Inference in Split Computing: An In-Network Computing Approach
    Lee, Hochan
    Ko, Haneul
    Bae, Chanbin
    Pack, Sangheon
    38TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, ICOIN 2024, 2024, : 773 - 776
  • [37] Performance Analysis of Convolutional Neural Networks on Embedded Systems
    Grzymkowski, Lukasz
    Stefanski, Tomasz P.
    PROCEEDINGS OF 2020 27TH INTERNATIONAL CONFERENCE ON MIXED DESIGN OF INTEGRATED CIRCUITS AND SYSTEM (MIXDES), 2020, : 266 - 271
  • [38] A fusing framework of shortcut convolutional neural networks
    Zhang, Ting
    Waqas, Muhammad
    Liu, Zhaoying
    Tu, Shanshan
    Halim, Zahid
    Rehman, Sadaqat Ur
    Li, Yujian
    Han, Zhu
    INFORMATION SCIENCES, 2021, 579 (579) : 685 - 699
  • [39] Embedded facial image processing with Convolutional Neural Networks
    Mamalet, Franck
    Roux, Sebastien
    Garcia, Christophe
    2010 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, 2010, : 261 - 264
  • [40] HPCGCN: A Predictive Framework on High Performance Computing Cluster Log Data Using Graph Convolutional Networks
    Bose, Avishek
    Yang, Huichen
    Hsu, William H.
    Andresen, Daniel
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 4113 - 4118