Energy-efficient Stencil Computations on Distributed GPUs using Dynamic Parallelism and GPU-controlled Communication

被引:1
|
作者
Oden, Lena [1 ]
Klenk, Benjamin [2 ]
Froening, Holger [2 ]
机构
[1] Fraunhofer Inst Ind Math, Competence Ctr High Perfomance Comp, Kaiserslautern, Germany
[2] Heidelberg Univ, Inst Comp Engn, Heidelberg, Germany
关键词
GPUs; Energy Efficient; Dynamic Parallelism; Communication; Data Transfer; Infiniband;
D O I
10.1109/E2SC.2014.14
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
GPUs are widely used in high performance computing, due to their high computational power and high performance per Watt. Still, one of the main bottlenecks of GPU-accelerated cluster computing is the data transfer between distributed GPUs. This not only affects performance, but also power consumption. The most common way to utilize a GPU cluster is a hybrid model, in which the GPU is used to accelerate the computation while the CPU is responsible for the communication. This approach always requires a dedicated CPU thread, which consumes additional CPU cycles and therefore increases the power consumption of the complete application. In recent work we have shown that the GPU is able to control the communication independently of the CPU. Still, there are several problems with GPU-controlled communication. The main problem is intra-GPU synchronization, since GPU blocks are non-preemptive. Therefore, the use of communication requests within a GPU can easily result in a deadlock. In this work we show how Dynamic Parallelism solves this problem. GPU-controlled communication in combination with Dynamic Parallelism allows keeping the control flow of multi-GPU applications on the GPU and bypassing the CPU completely. Although the performance of applications using GPU-controlled communication is still slightly worse than the performance of hybrid applications, we will show that performance per Watt increases by up to 10% while still using commodity hardware.
引用
收藏
页码:31 / 40
页数:10
相关论文
共 50 条
  • [21] An Opportunistic MAC Protocol for Energy-Efficient Wireless Communication in a Dynamic, Cyclical Channel
    Wymore, Mathew L.
    Qiao, Daji
    [J]. IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2018, 2 (02): : 533 - 544
  • [22] A Domain-Specific Processor Microarchitecture for Energy-Efficient, Dynamic IoT Communication
    Muzaffar, Shahzad
    Elfadel, Ibrahim M.
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2019, 27 (09) : 2074 - 2087
  • [23] Energy-Efficient Distributed Machine Learning at Wireless Edge with Device-to-Device Communication
    Hu, Rui
    Guo, Yuanxiong
    Gong, Yanmin
    [J]. IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 5208 - 5213
  • [24] Energy-efficient scheme using multiple antennas in secure distributed detection
    Lee, Yonggu
    Choi, Jinho
    [J]. IET SIGNAL PROCESSING, 2018, 12 (05) : 652 - 658
  • [25] Energy-Efficient Distributed Target Tracking Using Wireless Relay Robots
    Ooi, Chia Ching
    Schindelhauer, Christian
    [J]. DISTRIBUTED AUTONOMOUS ROBOTIC SYSTEMS 8, 2009, : 39 - 50
  • [26] Energy-Efficient Stochastic Computing for Convolutional Neural Networks by Using Kernel-wise Parallelism
    Xie, Zaipeng
    Yuan, Chenyu
    Li, Likun
    Wu, Jiahao
    [J]. 2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
  • [27] EEMIP: Energy-Efficient Communication Using Timing Channels and Prioritization in ZigBee
    Gocal, Pavol
    Macko, Dominik
    [J]. SENSORS, 2019, 19 (10):
  • [28] Controlled Hopwise Averaging: Bandwidth/Energy-Efficient Asynchronous Distributed Averaging for Wireless Networks
    Tang, Choon Yik
    Lu, Jie
    [J]. 2009 AMERICAN CONTROL CONFERENCE, VOLS 1-9, 2009, : 1561 - 1568
  • [29] An Energy-Efficient Distributed Dynamic Bandwidth Allocation Algorithm for Passive Optical Access Networks
    Khalili, Hamzeh
    Rincon, David
    Sallent, Sebastia
    Ramon Piney, Jose
    [J]. SUSTAINABILITY, 2020, 12 (06)
  • [30] Energy-Efficient Distributed Topology Control Algorithm for Low-Power loT Communication Networks
    Yi, Gangman
    Park, Jong Hyuk
    Choi, Sangil
    [J]. IEEE ACCESS, 2016, 4 : 9193 - 9203