Neural Networks Training on Graphics Processing Unit (GPU) Using Dynamic Parallelism (DP)

被引:0
|
作者
Hall, Will [1 ]
Tian, Yun [1 ]
机构
[1] Eastern Washington Univ, Spokane, WA 99201 USA
关键词
Neural network training; GPU; CUDA; Performance; Dynamic parallelism; MEMORY;
D O I
10.1007/978-3-031-16078-3_56
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Artificial Neural Networks (ANN) are a crucial foundation for deep learning and many machine learning algorithms. Training an ANN is computationally intensive and inherently parallel, thus may be accelerated by a Graphics Processing Unit (GPU). Due to the dependency across different ANN layers, which is created by the nature of Back Propagation (BP) algorithm, it is quite challenging to design a highly efficient ANN training algorithm on GPU. In this work, we investigate and demonstrate the technology, Dynamic Parallelism (DP) and will further speed up an ANN training task on GPU. We implemented a generic ANN framework on GPU that consists of an arbitrary number of layers and an arbitrary number of nodes in each layer. In two sets of experiments, we trained the generic ANN on GPU for handwritten digit recognition with DP enabled and disabled. We observed that training ANNs on GPU with DP enabled achieved up to 12.7x performance gain, compared with that with DP disabled on GPU. After being trained on GPU, our neural network achieved an accuracy rate of 96% in handwritten digit recognition.
引用
收藏
页码:811 / 818
页数:8
相关论文
共 50 条
  • [41] Feature-Preserving Displacement Mapping With Graphics Processing Unit (GPU) Tessellation
    Jang, Hanyoung
    Han, JungHyun
    COMPUTER GRAPHICS FORUM, 2012, 31 (06) : 1880 - 1894
  • [42] Kepler GPU accelerated recursive sorting using dynamic parallelism
    Neelima, B.
    Shamsundar, Bharath
    Narayan, Anjjan
    Prabhu, Rithesh
    Gomes, Crystal
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2017, 29 (04):
  • [43] Data-intensive document clustering on graphics processing unit (GPU) clusters
    Zhang, Yongpeng
    Mueller, Frank
    Cui, Xiaohui
    Potok, Thomas
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2011, 71 (02) : 211 - 224
  • [44] Graphics Processing Unit (GPU) real-time infrared scene generation
    Christie, Chad L.
    Gouthas, Efthimios
    Williams, Owen M.
    TECHNOLOGIES FOR SYNTHETIC ENVIRONMENTS: HARDWARE-IN-THE-LOOP TESTING XII, 2007, 6544
  • [45] GPU-accelerated backtracking using CUDA Dynamic Parallelism
    Pessoa, Tiago Carneiro
    Gmys, Jan
    de Carvalho Junior, Francisco Heron
    Melab, Nouredine
    Tuyttens, Daniel
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (09):
  • [46] Convolution hierarchical deep-learning neural network (C-HiDeNN) with graphics processing unit (GPU) acceleration
    Chanwook Park
    Ye Lu
    Sourav Saha
    Tianju Xue
    Jiachen Guo
    Satyajit Mojumder
    Daniel W. Apley
    Gregory J. Wagner
    Wing Kam Liu
    Computational Mechanics, 2023, 72 : 383 - 409
  • [47] Convolution hierarchical deep-learning neural network (C-HiDeNN) with graphics processing unit (GPU) acceleration
    Park, Chanwook
    Lu, Ye
    Saha, Sourav
    Xue, Tianju
    Guo, Jiachen
    Mojumder, Satyajit
    Apley, Daniel W.
    Wagner, Gregory J.
    Liu, Wing Kam
    COMPUTATIONAL MECHANICS, 2023, 72 (02) : 383 - 409
  • [48] Accelerating geostatistical simulations using graphics processing units (GPU)
    Tahmasebi, Pejman
    Sahimi, Muhammad
    Mariethoz, Gregoire
    Hezarkhani, Ardeshir
    COMPUTERS & GEOSCIENCES, 2012, 46 : 51 - 59
  • [49] Breadth-First Search on Dynamic Graphs using Dynamic Parallelism on the GPU
    Toedling, Dominik
    Winter, Martin
    Steinberger, Markus
    2019 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2019,
  • [50] Dynamic modeling of retort processing using neural networks
    Chen, CR
    Ramaswamy, HS
    Prasher, SO
    JOURNAL OF FOOD PROCESSING AND PRESERVATION, 2002, 26 (02) : 91 - 111