Embedded GPU Cluster Computing Framework for Inference of Convolutional Neural Networks

被引：0

作者：

Kain, Evan ^{[1
]}

Wildenstein, Diego ^{[2
]}

Pineda, Andrew C. ^{[3
]}

机构：

[1] Univ Pittsburgh, Dept Elect & Comp Engn, NSF Ctr Space Highperformance & Resilient Comp SH, Pittsburgh, PA 15260 USA

[2] Arizona State Univ, Sch Elect Comp & Energy Engn, Tempe, AZ USA

[3] US Air Force, Spacecraft Component Technol Branch, Space Vehicles Directorate, Res Lab, Kirtland AFB, NM USA

来源：

2019 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC) | 2019年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The growing need for on-board image processing for space vehicles requires computing solutions that are both low-power and high-performance. Parallel computation using low-power embedded Graphics Processing Units (GPUs) satisfy both requirements. Our experiment involves the use of OpenMPI domain decomposition of an image processing algorithm based upon a pre-trained convolutional neural network (CNN) developed by the U.S. Air Force Research Laboratory (AFRL). Our testbed consists of six NVIDIA Jetson TX2 development boards operating in parallel. This parallel framework results in a speedup of 4.3x on six processing nodes. This approach also leads to a linear decay in parallel efficiency as more processing nodes are added to the network. By replicating the data across processors in addition to distributing, we also characterize the best-case impact of adding triple modular redundancy (TMR) to our application.

引用

页数：7

共 50 条

[41] Convolutional Neural Networks Inference Accelerator Design using Selective Convolutional Layer
Huang, Tzu-Huan
Goh, Emil
Wey, I-Chyn
Teo, T. Hui
2023 IEEE 16TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP, MCSOC, 2023, : 166 - 170
[42] Compression and Speed-up of Convolutional Neural Networks Through Dimensionality Reduction for Efficient Inference on Embedded Multiprocessor
Lucas Fernández Brillet
Nicolas Leclaire
Stéphane Mancini
Marina Nicolas
Sébastien Cleyet-Merle
Jean-Paul Henriques
Claude Delnondedieu
Journal of Signal Processing Systems, 2022, 94 : 263 - 281
[43] Compression and Speed-up of Convolutional Neural Networks Through Dimensionality Reduction for Efficient Inference on Embedded Multiprocessor
Fernandez Brillet, Lucas
Leclaire, Nicolas
Mancini, Stephane
Nicolas, Marina
Cleyet-Merle, Sebastien
Henriques, Jean-Paul
Delnondedieu, Claude
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2022, 94 (03): : 263 - 281
[44] Complexity of Deep Convolutional Neural Networks in Mobile Computing
Naeem, Saad
Jamil, Noreen
Khan, Habib Ullah
Nazir, Shah
COMPLEXITY, 2020, 2020
[45] A Survey of Convolutional Neural Networks on Edge with Reconfigurable Computing
Vestias, Mario P.
ALGORITHMS, 2019, 12 (08)
[46] CLUSTER CONVOLUTIONAL NEURAL NETWORKS FOR FACIAL AGE ESTIMATION
Shang, Chong
Ai, Haizhou
2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1817 - 1821
[47] Latency Estimation Tool and Investigation of Neural Networks Inference on Mobile GPU
Ponomarev, Evgeny
Matveev, Sergey
Oseledets, Ivan
Glukhov, Valery
COMPUTERS, 2021, 10 (08)
[48] Calculating the Credibility of Test Samples at Inference by a Layer-wise Activation Cluster Analysis of Convolutional Neural Networks
Lehmann, Daniel
Ebner, Marc
DELTA: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON DEEP LEARNING THEORY AND APPLICATIONS, 2022, : 34 - 43
[49] Simulation-based inference of dynamical galaxy cluster masses with 3D convolutional neural networks
Ramanah, Doogesh Kodi
Wojtak, Radoslaw
Arendse, Nikki
MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2021, 501 (03) : 4080 - 4091
[50] The Unreasonable Effectiveness of Convolutional Neural Networks in Population Genetic Inference
Flagel, Lex
Brandvain, Yaniv
Schrider, Daniel R.
MOLECULAR BIOLOGY AND EVOLUTION, 2019, 36 (02) : 220 - 238

← 1 2 3 4 5 →