Regularity versus Load-Balancing on GPU for treefix computations

被引:3
|
作者
Defour, David [1 ]
Marin, Manuel
机构
[1] Univ Perpignan, DALI, Via Domitia, F-66860 Perpignan, France
关键词
GPU computing; regular versus irregular algorithms; numerical quality; ALGORITHMS;
D O I
10.1016/j.procs.2013.05.194
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The use of GPUs has enabled us to achieve substantial acceleration in highly regular data parallel applications. The trend is now to look at irregular applications, as it requires advanced load balancing technics. However, it is well known that the use of regular computation is preferable and more suitable when working with these architectures. An alternative to the use of load balancing is to rely on scan and other GPU friendly parallel primitives to build the desired result; however implying in return, the involvement of extra memory storage and computation. This article discusses of both solutions for treefix operations, which consist of applying a certain operation while performing a tree traversal. They can be performed by traversing the tree from top to bottom or from bottom to top, applying the proper operation at each vertex. It can be accelerated using either load balancing which maintains a pool of tasks while performing only the necessary amount of computation or using a vector friendly representation that will involve twice the amount of computation than the first solution. We will explore these two approaches and compare them in terms of performance and accuracy. We will show that the vectorial approach is always faster for any category of trees, but it raises accuracy issues when working with floating-point data.
引用
收藏
页码:309 / 318
页数:10
相关论文
共 50 条
  • [31] Load-Balancing in Distributed Selective Search
    Kim, Yubin
    Callan, Jamie
    Culpepper, J. Shane
    Moffat, Alistair
    [J]. SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 905 - 908
  • [32] A Fair and Dynamic Load-Balancing Mechanism
    Larroca, Federico
    Rougier, Jean-Louis
    [J]. TRAFFIC MANAGEMENT AND TRAFFIC ENGINEERING FOR THE FUTURE INTERNET, 2009, 5464 : 36 - 52
  • [33] Multiple Service Load-Balancing with OpenFlow
    Koerner, Marc
    Kao, Odej
    [J]. 2012 IEEE 13TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE SWITCHING AND ROUTING (HPSR), 2012,
  • [34] Troodon: A machine-learning based load-balancing application scheduler for CPU-GPU system
    Khalid, Yasir Noman
    Aleem, Muhammad
    Ahmed, Usman
    Islam, Muhammad Arshad
    Lqbal, Muhammad Azhar
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 132 : 79 - 94
  • [35] The convergence of realistic distributed load-balancing algorithms
    Cedo, F.
    Cortes, A.
    Ripoll, A.
    Senar, M. A.
    Luque, E.
    [J]. THEORY OF COMPUTING SYSTEMS, 2007, 41 (04) : 609 - 618
  • [36] Improved Analysis of Deterministic Load-Balancing Schemes
    Berenbrink, Petra
    Klasing, Ralf
    Kosowski, Adrian
    Mallmann-Trenn, Frederik
    Uznanski, Przemyslaw
    [J]. ACM TRANSACTIONS ON ALGORITHMS, 2019, 15 (01)
  • [37] Dynamic load-balancing for BSP time warp
    Low, MYH
    [J]. 35TH ANNUAL SIMULATION SYMPOSIUM, PROCEEDINGS, 2002, : 267 - 274
  • [38] Load-balancing routing for wireless access networks
    Hsiao, PH
    Hwang, A
    Kung, HT
    Vlah, D
    [J]. IEEE INFOCOM 2001: THE CONFERENCE ON COMPUTER COMMUNICATIONS, VOLS 1-3, PROCEEDINGS: TWENTY YEARS INTO THE COMMUNICATIONS ODYSSEY, 2001, : 986 - 995
  • [39] Fair Load-Balancing on Parallel Systems for QoS
    Orleans, L. F.
    Furtado, P. N.
    [J]. 2007 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS (ICPP), 2007, : 180 - 187
  • [40] Minimum-Cost Load-Balancing Partitions
    Boris Aronov
    Paz Carmi
    Matthew J. Katz
    [J]. Algorithmica, 2009, 54 : 318 - 336