Dynamic Reconfiguration of Data Parallel Programs

被引:2
|
作者
Dias, Vinicius [1 ]
Meira, Wagner, Jr. [1 ]
Guedes, Dorgival [1 ]
机构
[1] Univ Fed Minas Gerais, Dept Comp Sci, Belo Horizonte, MG, Brazil
关键词
D O I
10.1109/SBAC-PAD.2016.32
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Given the large amount of data from different sources that have become available to researchers in multiple fields, Data Science has emerged as a new paradigm for exploring and getting value from that data. In that context, new parallel processing environments with abstract programming interfaces, like Spark, were proposed to try to simplify the development of distributed programs. Although such solutions have become widely used, achieving the best performance with them is still not always straight-forward, despite the multiple run-time strategies they use. In this work we analyze some of the causes of performance degradation in such systems and, based on that analysis, we propose a tool to improve performance by dynamically adjusting data partitioning and parallelism degree in recurrent applications based on previous executions. Our results applying that methodology show consistent reductions in execution time for the applications considered, with gains of up to 50%.
引用
收藏
页码:190 / 197
页数:8
相关论文
共 50 条