Embedded GPU Cluster Computing Framework for Inference of Convolutional Neural Networks

被引:0
|
作者
Kain, Evan [1 ]
Wildenstein, Diego [2 ]
Pineda, Andrew C. [3 ]
机构
[1] Univ Pittsburgh, Dept Elect & Comp Engn, NSF Ctr Space Highperformance & Resilient Comp SH, Pittsburgh, PA 15260 USA
[2] Arizona State Univ, Sch Elect Comp & Energy Engn, Tempe, AZ USA
[3] US Air Force, Spacecraft Component Technol Branch, Space Vehicles Directorate, Res Lab, Kirtland AFB, NM USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The growing need for on-board image processing for space vehicles requires computing solutions that are both low-power and high-performance. Parallel computation using low-power embedded Graphics Processing Units (GPUs) satisfy both requirements. Our experiment involves the use of OpenMPI domain decomposition of an image processing algorithm based upon a pre-trained convolutional neural network (CNN) developed by the U.S. Air Force Research Laboratory (AFRL). Our testbed consists of six NVIDIA Jetson TX2 development boards operating in parallel. This parallel framework results in a speedup of 4.3x on six processing nodes. This approach also leads to a linear decay in parallel efficiency as more processing nodes are added to the network. By replicating the data across processors in addition to distributing, we also characterize the best-case impact of adding triple modular redundancy (TMR) to our application.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Convolutional Neural Networks Inference Accelerator Design using Selective Convolutional Layer
    Huang, Tzu-Huan
    Goh, Emil
    Wey, I-Chyn
    Teo, T. Hui
    2023 IEEE 16TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP, MCSOC, 2023, : 166 - 170
  • [42] Compression and Speed-up of Convolutional Neural Networks Through Dimensionality Reduction for Efficient Inference on Embedded Multiprocessor
    Lucas Fernández Brillet
    Nicolas Leclaire
    Stéphane Mancini
    Marina Nicolas
    Sébastien Cleyet-Merle
    Jean-Paul Henriques
    Claude Delnondedieu
    Journal of Signal Processing Systems, 2022, 94 : 263 - 281
  • [43] Compression and Speed-up of Convolutional Neural Networks Through Dimensionality Reduction for Efficient Inference on Embedded Multiprocessor
    Fernandez Brillet, Lucas
    Leclaire, Nicolas
    Mancini, Stephane
    Nicolas, Marina
    Cleyet-Merle, Sebastien
    Henriques, Jean-Paul
    Delnondedieu, Claude
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2022, 94 (03): : 263 - 281
  • [44] Complexity of Deep Convolutional Neural Networks in Mobile Computing
    Naeem, Saad
    Jamil, Noreen
    Khan, Habib Ullah
    Nazir, Shah
    COMPLEXITY, 2020, 2020
  • [45] A Survey of Convolutional Neural Networks on Edge with Reconfigurable Computing
    Vestias, Mario P.
    ALGORITHMS, 2019, 12 (08)
  • [46] CLUSTER CONVOLUTIONAL NEURAL NETWORKS FOR FACIAL AGE ESTIMATION
    Shang, Chong
    Ai, Haizhou
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1817 - 1821
  • [47] Latency Estimation Tool and Investigation of Neural Networks Inference on Mobile GPU
    Ponomarev, Evgeny
    Matveev, Sergey
    Oseledets, Ivan
    Glukhov, Valery
    COMPUTERS, 2021, 10 (08)
  • [48] Calculating the Credibility of Test Samples at Inference by a Layer-wise Activation Cluster Analysis of Convolutional Neural Networks
    Lehmann, Daniel
    Ebner, Marc
    DELTA: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON DEEP LEARNING THEORY AND APPLICATIONS, 2022, : 34 - 43
  • [49] Simulation-based inference of dynamical galaxy cluster masses with 3D convolutional neural networks
    Ramanah, Doogesh Kodi
    Wojtak, Radoslaw
    Arendse, Nikki
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2021, 501 (03) : 4080 - 4091
  • [50] The Unreasonable Effectiveness of Convolutional Neural Networks in Population Genetic Inference
    Flagel, Lex
    Brandvain, Yaniv
    Schrider, Daniel R.
    MOLECULAR BIOLOGY AND EVOLUTION, 2019, 36 (02) : 220 - 238