Embedded GPU Cluster Computing Framework for Inference of Convolutional Neural Networks

被引:0
|
作者
Kain, Evan [1 ]
Wildenstein, Diego [2 ]
Pineda, Andrew C. [3 ]
机构
[1] Univ Pittsburgh, Dept Elect & Comp Engn, NSF Ctr Space Highperformance & Resilient Comp SH, Pittsburgh, PA 15260 USA
[2] Arizona State Univ, Sch Elect Comp & Energy Engn, Tempe, AZ USA
[3] US Air Force, Spacecraft Component Technol Branch, Space Vehicles Directorate, Res Lab, Kirtland AFB, NM USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The growing need for on-board image processing for space vehicles requires computing solutions that are both low-power and high-performance. Parallel computation using low-power embedded Graphics Processing Units (GPUs) satisfy both requirements. Our experiment involves the use of OpenMPI domain decomposition of an image processing algorithm based upon a pre-trained convolutional neural network (CNN) developed by the U.S. Air Force Research Laboratory (AFRL). Our testbed consists of six NVIDIA Jetson TX2 development boards operating in parallel. This parallel framework results in a speedup of 4.3x on six processing nodes. This approach also leads to a linear decay in parallel efficiency as more processing nodes are added to the network. By replicating the data across processors in addition to distributing, we also characterize the best-case impact of adding triple modular redundancy (TMR) to our application.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] An Embedded Inference Framework for Convolutional Neural Network Applications
    Bi, Sheng
    Zhang, Yingjie
    Dong, Min
    Min, Huaqing
    IEEE ACCESS, 2019, 7 : 171084 - 171094
  • [2] Fast Computing Framework for Convolutional Neural Networks
    Korytkowski, Marcin
    Staszewski, Pawel
    Woldan, Piotr
    Scherer, Rafal
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCES ON BIG DATA AND CLOUD COMPUTING (BDCLOUD 2016) SOCIAL COMPUTING AND NETWORKING (SOCIALCOM 2016) SUSTAINABLE COMPUTING AND COMMUNICATIONS (SUSTAINCOM 2016) (BDCLOUD-SOCIALCOM-SUSTAINCOM 2016), 2016, : 118 - 123
  • [3] Accelerating Inference of Convolutional Neural Networks Using In-memory Computing
    Dazzi, Martino
    Sebastian, Abu
    Benini, Luca
    Eleftheriou, Evangelos
    FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2021, 15
  • [4] Optimizing Stochastic Computing for Low Latency Inference of Convolutional Neural Networks
    Chen, Zhiyuan
    Ma, Yufei
    Wang, Zhongfeng
    2020 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED-DESIGN (ICCAD), 2020,
  • [5] SkippyNN: An Embedded Stochastic-Computing Accelerator for Convolutional Neural Networks
    Hojabr, Reza
    Givaki, Kamyar
    Tayaranian, S. M. Reza
    Esfahanian, Parsa
    Khonsari, Ahmad
    Rahmati, Dara
    Najafi, M. Hassan
    PROCEEDINGS OF THE 2019 56TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2019,
  • [6] A MapReduce Computing Framework Based on GPU Cluster
    Gao, Heng
    Tang, Jie
    Wu, Gangshan
    2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC), 2013, : 1902 - 1907
  • [7] Acceleration of Neural Network Inference for Embedded GPU Systems
    Terakura, Kei
    Chang, Qiong
    Miyazaki, Jun
    2024 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, IEEE BIGCOMP 2024, 2024, : 361 - 362
  • [8] Neural Networks Integer Computation: Quantizing Convolutional Neural Networks of Inference and Training for Object Detection in Embedded Systems
    Xiao, Penghao
    Zhang, Chunjie
    Guo, Qian
    Xiao, Xiayang
    Wang, Haipeng
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 15862 - 15884
  • [9] CNNParted: An open source framework for efficient Convolutional Neural Network inference partitioning in embedded systems
    Kress, Fabian
    Sidorenko, Vladimir
    Schmidt, Patrick
    Hoefer, Julian
    Hotfilter, Tim
    Walter, Iris
    Harbaum, Tanja
    Becker, Jurgen
    COMPUTER NETWORKS, 2023, 229
  • [10] Simulating quantized inference on convolutional neural networks
    Finotti, Vitor
    Albertini, Bruno
    COMPUTERS & ELECTRICAL ENGINEERING, 2021, 95