Optimized Distributed Work-Stealing

被引:0
|
作者
Kumar, Vivek [1 ]
Murthy, Karthik [1 ]
Sarkar, Vivek [1 ]
Zheng, Yili [2 ]
机构
[1] Rice Univ, Houston, TX 77251 USA
[2] Lawrence Berkeley Natl Lab, Berkeley, CA USA
关键词
Distributed work-stealing; Habanero; PGAS;
D O I
10.1109/IA3.2016.19
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Work-stealing is a popular approach for dynamic load balancing of task-parallel programs. However, as has been widely studied, the use of classical work-stealing algorithms on massively parallel and distributed supercomputers introduces several performance issues. One such issue is the overhead of failed steals (communicating with a victim that has no work), which is far more severe in the distributed context than within a single SMP node. Due to the cost of inter-node communication, it is critical to reduce the number of failed steals in a distributed context. Prior work has demonstrated that load-aware victim processor selection can reduce the number of failed steals, but it cannot eliminate the failed steals completely. In this paper, we present two different load-aware implementations of distributed work-stealing algorithm in HabaneroUPC++ PGAS library - BaselineWS and SuccessOnlyWS. BaselineWS follows prior work in implementing a distributed work-stealing strategy. SuccessOnlyWS implements a novel distributed work-stealing strategy that completely eliminate inter-node failed attempts by introducing a new policy for moving work from busy to idle processors. This strategy also avoids querying the same processor multiple times with failed steals. We evaluate both BaselineWS and SuccessOnlyWS by using up to 12288 cores of Edison, a CRAY-XC30 supercomputer and by using dynamic irregular applications, as exemplified by the UTS and NQueens benchmarks. We demonstrate that SuccessOnlyWS provides performance improvements up to 7% over BaselineWS.
引用
收藏
页码:74 / 77
页数:4
相关论文
共 50 条
  • [1] Hierarchical Work-Stealing
    Quintin, Jean-Noel
    Wagner, Frederic
    [J]. EURO-PAR 2010 PARALLEL PROCESSING, PT I, 2010, 6271 : 217 - 229
  • [2] Work-Stealing Without The Baggage
    Kumar, Vivek
    Frampton, Daniel
    Blackburn, Stephen M.
    Grove, David
    Tardieu, Olivier
    [J]. ACM SIGPLAN NOTICES, 2012, 47 (10) : 297 - 313
  • [3] Limits of Work-Stealing Scheduling
    Vrba, Zeljko
    Espeland, Havard
    Halvorsen, Pal
    Griwodz, Carsten
    [J]. JOB SCHEDULING STRATEGIES FOR PARALLEL PROCESSING, 2009, 5798 : 280 - 299
  • [4] Work-stealing with Configurable Scheduling Strategies
    Wimmer, Martin
    Traeff, Jesper Larsson
    Cederman, Daniel
    Tsigas, Philippas
    [J]. ACM SIGPLAN NOTICES, 2013, 48 (08) : 315 - 316
  • [5] On the Merits of Distributed Work-stealing on Selective Locality-aware Tasks
    Paudel, Jeeva
    Tardieu, Olivier
    Amaral, Jose Nelson
    [J]. 2013 42ND ANNUAL INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2013, : 100 - 109
  • [6] The natural work-stealing algorithm is stable
    Berenbrink, P
    Friedetzky, T
    Goldberg, LA
    [J]. 42ND ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 2001, : 178 - 187
  • [7] Dynamic memory ABP work-stealing
    Hendler, D
    Lev, Y
    Shavit, N
    [J]. DISTRIBUTED COMPUTING, PROCEEDINGS, 2004, 3274 : 188 - 200
  • [8] Efficient Work-Stealing with Blocking Deques
    Liu, Chi
    Song, Ping
    Liu, Yi
    Hao, Qinfen
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2014 IEEE 6TH INTL SYMP ON CYBERSPACE SAFETY AND SECURITY, 2014 IEEE 11TH INTL CONF ON EMBEDDED SOFTWARE AND SYST (HPCC,CSS,ICESS), 2014, : 149 - 152
  • [9] Adaptive work-stealing with parallelism feedback
    Agrawal, Kunal
    Leiserson, Charles E.
    He, Yuxiong
    Hsu, Wen Jing
    [J]. ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2008, 26 (03):
  • [10] The natural work-stealing algorithm is stable
    Berenbrink, P
    Friedetzky, T
    Goldberg, LA
    [J]. SIAM JOURNAL ON COMPUTING, 2003, 32 (05) : 1260 - 1279