Diagnosing Performance Bottlenecks in Massive Data Parallel Programs

被引:5
|
作者
Dias, Vinicius [1 ]
Moreira, Rubens [1 ]
Meira, Wagner, Jr. [1 ]
Guedes, Dorgival [1 ]
机构
[1] Univ Fed Minas Gerais, Dept Comp Sci, Belo Horizonte, MG, Brazil
关键词
D O I
10.1109/CCGrid.2016.81
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The increasing amount of data being stored and the variety of applications being proposed recently to make use of those data enabled a whole new generation of parallel programming environments and paradigms. Although most of these novel environments provide abstract programming interfaces and embed several run-time strategies that simplify several typical tasks in parallel and distributed systems, achieving good performance is still a challenge. In this paper we identify some common sources of performance degradation in the Spark programming environment and discuss some diagnosis dimensions that can be used to better understand such degradation. We then describe our experience in the use of those dimensions to drive the identification performance problems, and suggest how their impact may be minimized considering real applications.
引用
收藏
页码:273 / 276
页数:4
相关论文
共 50 条
  • [31] PARTITIONING OF MASSIVE REAL-TIME PROGRAMS FOR PARALLEL PROCESSING
    LEE, I
    PRYWES, N
    SZYMANSKI, B
    ADVANCES IN COMPUTERS, 1986, 25 : 215 - 275
  • [32] Refinement of data parallel programs in PEI
    Violard, E
    Genaud, S
    Perrin, GR
    ALGORITHMIC LANGUAGES AND CALCULI, 1997, : 107 - 131
  • [33] Dynamic Reconfiguration of Data Parallel Programs
    Dias, Vinicius
    Meira, Wagner, Jr.
    Guedes, Dorgival
    PROCEEDINGS OF 28TH IEEE INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING, (SBAC-PAD 2016), 2016, : 190 - 197
  • [34] On Data Distributions in the Construction of Parallel Programs
    Virginia Niculescu
    The Journal of Supercomputing, 2004, 29 : 5 - 25
  • [35] Fault tolerance for data parallel programs
    Bertolli, C.
    Vanneschi, M.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2011, 23 (06): : 595 - 632
  • [36] On data distributions in the construction of parallel programs
    Niculescu, V
    JOURNAL OF SUPERCOMPUTING, 2004, 29 (01): : 5 - 25
  • [37] A performance adviser for the development of parallel programs
    Li, KC
    Zhang, K
    INTERNATIONAL JOURNAL OF HIGH SPEED COMPUTING, 1996, 8 (03): : 205 - 227
  • [38] Performance challenges in modular parallel programs
    Acar U.A.
    Aksenov V.
    Charguéraud A.
    Rainey M.
    ACM SIGPLAN Notices, 2018, 53 (01): : 381 - 382
  • [39] Instrument parallel programs for performance visualisation
    Zhang, K
    Zhang, DQ
    AUSTRALIAN COMPUTER JOURNAL, 1998, 30 (01): : 30 - 38
  • [40] Performance Evaluation of Parallel Heapsort Programs
    Kitano, Hikaru
    Nunome, Atsushi
    Hirata, Hiroaki
    2019 20TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2019, : 435 - 442