Dynamic Reconfiguration of Data Parallel Programs

被引:2
|
作者
Dias, Vinicius [1 ]
Meira, Wagner, Jr. [1 ]
Guedes, Dorgival [1 ]
机构
[1] Univ Fed Minas Gerais, Dept Comp Sci, Belo Horizonte, MG, Brazil
关键词
D O I
10.1109/SBAC-PAD.2016.32
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Given the large amount of data from different sources that have become available to researchers in multiple fields, Data Science has emerged as a new paradigm for exploring and getting value from that data. In that context, new parallel processing environments with abstract programming interfaces, like Spark, were proposed to try to simplify the development of distributed programs. Although such solutions have become widely used, achieving the best performance with them is still not always straight-forward, despite the multiple run-time strategies they use. In this work we analyze some of the causes of performance degradation in such systems and, based on that analysis, we propose a tool to improve performance by dynamically adjusting data partitioning and parallelism degree in recurrent applications based on previous executions. Our results applying that methodology show consistent reductions in execution time for the applications considered, with gains of up to 50%.
引用
收藏
页码:190 / 197
页数:8
相关论文
共 50 条
  • [31] Reconfiguration of 'data connectivity trees' in large dynamic networks
    Ravindran, K
    Liu, X
    Kumar, MR
    GLOBECOM'03: IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, VOLS 1-7, 2003, : 3758 - 3764
  • [32] Topological data analysis for revealing dynamic brain reconfiguration in MEG data
    Duman, Ali Nabi
    Tatar, Ahmet E.
    PEERJ, 2023, 11
  • [33] DATA PARTITION AND PARALLEL EVALUATION OF DATALOG PROGRAMS
    ZHANG, WN
    WANG, K
    CHAU, SC
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1995, 7 (01) : 163 - 176
  • [34] COMPILING TASK AND DATA PARALLEL PROGRAMS FOR IWARP
    GROSS, T
    HINRICHS, S
    LUEH, G
    OHALLARON, D
    STICHNOTH, J
    SUBHLOK, J
    SIGPLAN NOTICES, 1993, 28 (01): : 32 - 35
  • [35] Optimized Checkpointing Protocols for Data Parallel Programs
    Bertolli, Carlo
    Vanneschi, Marco
    PARALLEL COMPUTING: FROM MULTICORES AND GPU'S TO PETASCALE, 2010, 19 : 433 - 440
  • [36] CODE GENERATION FOR DATA PARALLEL PROGRAMS ON DMPCS
    PAZAT, JL
    LECTURE NOTES IN COMPUTER SCIENCE, 1991, 487 : 337 - 346
  • [37] A Data-centric Profiler for Parallel Programs
    Liu, Xu
    Mellor-Crummey, John
    2013 INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC), 2013,
  • [38] Parallel Data Flow analysis for OpenMP programs
    Huang, Lei
    Sethuraman, Girija
    Chapman, Barbara
    PRACTICAL PROGRAMMING MODEL FOR THE MULTI-CORE ERA, PROCEEDINGS, 2008, 4935 : 138 - 142
  • [39] Remote execution of data-parallel programs
    Borowiec, J
    INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-IV, PROCEEDINGS, 1998, : 1272 - 1279
  • [40] CHECKING THE PERFORMANCE OF PARALLEL PROGRAMS WITH COMMON DATA
    PARSHENTSEV, VV
    AUTOMATION AND REMOTE CONTROL, 1986, 47 (03) : 437 - 442