Regularity versus Load-Balancing on GPU for treefix computations

被引:3
|
作者
Defour, David [1 ]
Marin, Manuel
机构
[1] Univ Perpignan, DALI, Via Domitia, F-66860 Perpignan, France
关键词
GPU computing; regular versus irregular algorithms; numerical quality; ALGORITHMS;
D O I
10.1016/j.procs.2013.05.194
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The use of GPUs has enabled us to achieve substantial acceleration in highly regular data parallel applications. The trend is now to look at irregular applications, as it requires advanced load balancing technics. However, it is well known that the use of regular computation is preferable and more suitable when working with these architectures. An alternative to the use of load balancing is to rely on scan and other GPU friendly parallel primitives to build the desired result; however implying in return, the involvement of extra memory storage and computation. This article discusses of both solutions for treefix operations, which consist of applying a certain operation while performing a tree traversal. They can be performed by traversing the tree from top to bottom or from bottom to top, applying the proper operation at each vertex. It can be accelerated using either load balancing which maintains a pool of tasks while performing only the necessary amount of computation or using a vector friendly representation that will involve twice the amount of computation than the first solution. We will explore these two approaches and compare them in terms of performance and accuracy. We will show that the vectorial approach is always faster for any category of trees, but it raises accuracy issues when working with floating-point data.
引用
收藏
页码:309 / 318
页数:10
相关论文
共 50 条
  • [42] Brokering and Load-Balancing Mechanism in the Cloud - Revisited
    Naha, Ranesh Kumar
    Othman, Mohamed
    [J]. IETE TECHNICAL REVIEW, 2014, 31 (04) : 271 - 276
  • [43] A three-stage load-balancing switch
    Wang, Xiaolin
    Cai, Yan
    Xiao, Sheng
    Gong, Weibo
    [J]. 27TH IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (INFOCOM), VOLS 1-5, 2008, : 96 - 100
  • [44] Dynamic load-balancing via a genetic algorithm
    Greene, WA
    [J]. ICTAI 2001: 13TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2001, : 121 - 128
  • [45] Packet Loss Minimization in Load-Balancing Switch
    Audzevich, Yury
    Bodrog, Levente
    Ofek, Yoram
    Telek, Miklos
    [J]. ANALYTICAL AND STOCHASTIC MODELING TECHNIQUES AND APPLICATIONS, PROCEEDINGS, 2010, 6148 : 44 - +
  • [46] On direct routing in the Valiant Load-Balancing architecture
    Liu, H
    Rui, ZS
    [J]. GLOBECOM '05: IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, VOLS 1-6: DISCOVERY PAST AND FUTURE, 2005, : 721 - 726
  • [47] Minimum-Cost Load-Balancing Partitions
    Aronov, Boris
    Carmi, Paz
    Katz, Matthew J.
    [J]. ALGORITHMICA, 2009, 54 (03) : 318 - 336
  • [48] Randomized Algorithms for Dynamic Storage Load-Balancing
    Liu, Liang
    Fortnow, Lance
    Li, Jin
    Wang, Yating
    Xu, Jun
    [J]. PROCEEDINGS OF THE SEVENTH ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC 2016), 2016, : 210 - 222
  • [49] A load-balancing scheme based on Bloom Filters
    Gou, Chengcheng
    Zhao, Rongcai
    Diao, Jing
    [J]. SECOND INTERNATIONAL CONFERENCE ON FUTURE NETWORKS: ICFN 2010, 2010, : 404 - 407
  • [50] Beowulf parallel processing for dynamic load-balancing
    Bennett, BH
    Davis, E
    Kunau, T
    [J]. 2000 IEEE AEROSPACE CONFERENCE PROCEEDINGS, VOL 4, 2000, : 389 - 395