Diagnosing Performance Bottlenecks in Massive Data Parallel Programs

被引:5
|
作者
Dias, Vinicius [1 ]
Moreira, Rubens [1 ]
Meira, Wagner, Jr. [1 ]
Guedes, Dorgival [1 ]
机构
[1] Univ Fed Minas Gerais, Dept Comp Sci, Belo Horizonte, MG, Brazil
关键词
D O I
10.1109/CCGrid.2016.81
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The increasing amount of data being stored and the variety of applications being proposed recently to make use of those data enabled a whole new generation of parallel programming environments and paradigms. Although most of these novel environments provide abstract programming interfaces and embed several run-time strategies that simplify several typical tasks in parallel and distributed systems, achieving good performance is still a challenge. In this paper we identify some common sources of performance degradation in the Spark programming environment and discuss some diagnosis dimensions that can be used to better understand such degradation. We then describe our experience in the use of those dimensions to drive the identification performance problems, and suggest how their impact may be minimized considering real applications.
引用
收藏
页码:273 / 276
页数:4
相关论文
共 50 条
  • [1] Executing parallel programs with synchronization bottlenecks efficiently
    Oyama, Y
    Taura, K
    Yonezawa, A
    PARALLEL AND DISTRIBUTED COMPUTING FOR SYMBOLIC AND IRREGULAR APPLICATIONS, 2000, : 182 - 204
  • [2] Diagnosing Performance Bottlenecks in Emerging Petascale Applications
    Tallent, Nathan R.
    Mellor-Crummey, John M.
    Adhianto, Laksono
    Fagan, Michael W.
    Krentel, Mark
    PROCEEDINGS OF THE CONFERENCE ON HIGH PERFORMANCE COMPUTING NETWORKING, STORAGE AND ANALYSIS, 2009,
  • [3] Synthesizing Programs That Expose Performance Bottlenecks
    Della Toffola, Luca
    Pradel, Michael
    Gross, Thomas R.
    PROCEEDINGS OF THE 2018 INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO'18), 2018, : 314 - 326
  • [4] CHECKING THE PERFORMANCE OF PARALLEL PROGRAMS WITH COMMON DATA
    PARSHENTSEV, VV
    AUTOMATION AND REMOTE CONTROL, 1986, 47 (03) : 437 - 442
  • [5] Detection and optimization approaches for synchronization bottlenecks in parallel programs
    Zhang Y.
    Li L.
    Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2022, 44 (05): : 92 - 101
  • [6] Performance analysis for massive problem data parallel computing
    Shu, Jiwu
    Zheng, Weimin
    Shen, Meiming
    Wang, Dongsheng
    Ruan Jian Xue Bao/Journal of Software, 2000, 11 (05): : 628 - 633
  • [7] Pinpointing and repairing performance bottlenecks in concurrent programs
    Tingting Yu
    Michael Pradel
    Empirical Software Engineering, 2018, 23 : 3034 - 3071
  • [8] Pinpointing and repairing performance bottlenecks in concurrent programs
    Yu, Tingting
    Pradel, Michael
    EMPIRICAL SOFTWARE ENGINEERING, 2018, 23 (05) : 3034 - 3071
  • [9] ScaAnalyzer: A Tool to Identify Memory Scalability Bottlenecks in Parallel Programs
    Liu, Xu
    Wu, Bo
    PROCEEDINGS OF SC15: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2015,
  • [10] VISUALIZING THE PERFORMANCE OF SPMD AND DATA-PARALLEL PROGRAMS
    ROVER, DT
    WRIGHT, CT
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1993, 18 (02) : 129 - 146