SPARK Job Performance Analysis and Prediction Tool(DEMO)

被引:0
|
作者
Singhal, Rekha [1 ]
Phalak, Chetan [1 ]
Kumar, Praveen [1 ]
机构
[1] TCS Res, Chennai, Tamil Nadu, India
关键词
Spark; Performance; Prediction;
D O I
10.1145/3185768.3185772
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Spark is one of most widely deployed in-memory big data technology for parallel data processing across cluster of machines. The availability of these big data platforms on commodity machines has raised the challenge of assuring performance of applications with increase in data size. We have build a tool to assist application developer and tester to estimate an application execution time for larger data size before deployment. Conversely, the tool may also be used to estimate the competent cluster size for desired application performance in production environment. The tool can be used for detailed profiling of Spark job, post execution, to understand performance bottleneck. This tool incorporates different configurations of Spark cluster to estimate application performance. Therefore, it can also be used with optimization techniques to get tuned value of Spark parameters for an optimal performance. The tool's key innovations are support for different configurations of Spark platform for performance prediction and simulator to estimate Spark stage execution time which includes task execution variability due to HDFS, data skew and cluster nodes heterogeneity. The tool using model [3] has been shown to predict within 20% error bound for Wordcount, Terasort,Kmeans and few SQL workloads.
引用
收藏
页码:49 / 50
页数:2
相关论文
共 50 条
  • [21] PREDICTION OF JOB REQUIREMENTS BY A STRUCTURED JOB ANALYSIS PROCEDURE
    MCCORMICK, EJ
    CUNNINGHAM, JW
    THORNTON, GC
    [J]. PERSONNEL PSYCHOLOGY, 1967, 20 (04) : 431 - 440
  • [22] THE KOMPLEX PERFORMANCE PREDICTION TOOL
    LEHMANN, D
    BERGHOLZ, G
    [J]. SYSTEMS ANALYSIS MODELLING SIMULATION, 1988, 5 (03): : 243 - 251
  • [23] DACUM - AN ALTERNATIVE JOB ANALYSIS TOOL
    KOSIDLAK, JG
    [J]. PERSONNEL, 1987, 64 (03) : 14 - &
  • [24] Job analysis is organization's tool
    Voorhies, Darrell H.
    [J]. LIBRARY JOURNAL, 1947, 72 (22) : 1737 - 1738
  • [25] DEMO: CNN Performance Prediction on a CPU-based Edge Platform
    Velasco-Montero, D.
    Fernandez-Berni, J.
    Carmona-Galan, R.
    Rodriguez-Vazquez, A.
    [J]. ICDSC 2019: 13TH INTERNATIONAL CONFERENCE ON DISTRIBUTED SMART CAMERAS, 2019,
  • [26] JOB SATISFACTION AND JOB PERFORMANCE - THEORETICAL ANALYSIS
    LOCKE, EA
    [J]. ORGANIZATIONAL BEHAVIOR AND HUMAN PERFORMANCE, 1970, 5 (05): : 484 - 500
  • [27] Machine Learning for Performance Prediction of Spark Cloud Applications
    Maros, Alexandre
    Murai, Fabricio
    Couto da Silva, Ana Paula
    Almeida, Jussara M.
    Lattuada, Marco
    Gianniti, Eugenio
    Hosseini, Marjan
    Ardagna, Danilo
    [J]. 2019 IEEE 12TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (IEEE CLOUD 2019), 2019, : 99 - 106
  • [28] Performance Prediction for Convolutional Neural Network on Spark Cluster
    Myung, Rohyoung
    Yu, Heonchang
    [J]. ELECTRONICS, 2020, 9 (09) : 1 - 22
  • [29] Software Defect Prediction with Bug-Code Analyzer - a Data Collection Tool Demo
    Mausa, Goran
    Grbac, Tihana Galinac
    Basic, Bojana Dalbelo
    [J]. 2014 22ND INTERNATIONAL CONFERENCE ON SOFTWARE, TELECOMMUNICATIONS AND COMPUTER NETWORKS (SOFTCOM), 2014,
  • [30] The Validity of Conscientiousness Is Overestimated in the Prediction of Job Performance
    Kepes, Sven
    McDaniel, Michael A.
    [J]. PLOS ONE, 2015, 10 (10):