Improving Performance of Distributed Graph Traversals via Application-Aware Plug-In Work Scheduler

被引:0
|
作者
Firoz, Jesun Sahariar [1 ]
Zalewski, Marcin [1 ]
Barnas, Martina [1 ]
Lumsdaine, Andrew [1 ]
机构
[1] Indiana Univ, Ctr Res Extreme Scale Technol CREST, Bloomington, IN 47405 USA
关键词
D O I
10.1007/978-3-319-58943-5_44
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Unordered graph algorithms can offer efficient resource utilization that is advantageous for performance in distributed setting. Unordered execution allows for parallel computation without synchronization. In unordered algorithms, work is data-driven and can be performed in any order, refining the result as the algorithm progresses. Unfortunately, a sub-optimal work ordering may lead to more time spent on correcting the results than on useful work. On HPC systems, the issue is compounded by irregular nature of distributed graph algorithms which makes them sensitive to the whole software/hardware stack, collectively referred to as runtime. In this paper, we consider an example of such algorithms: Distributed Control (DC) single-source shortest paths (SSSP). DC relies on performance gains stemming from the inherent asynchrony of unordered algorithms while optimizing work ordering locally. We demonstrate that distributed runtime scheduling policy can prevent effective work ordering optimization. We show that lifting and delegating some scheduling decisions to the algorithm level can result in significantly better performance. We propose that this strategy can be useful for performance engineering.
引用
收藏
页码:545 / 556
页数:12
相关论文
共 1 条
  • [1] Application of Plug-In Electric Vehicles to Frequency Regulation Based on Distributed Signal Acquisition Via Limited Communication
    Yang, Hongming
    Chung, C. Y.
    Zhao, Junhua
    [J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 2013, 28 (02) : 1017 - 1026