SimCost: cost-effective resource provision prediction and recommendation for spark workloads

被引:0
|
作者
Yuxing Chen
Mohammad A. Hoque
Pengfei Xu
Jiaheng Lu
Sasu Tarkoma
机构
[1] Univeristy of Helsinki,Department of Computer Science
来源
关键词
Parameter tuning; Cost modeling; Spark; Resource provisioning;
D O I
暂无
中图分类号
学科分类号
摘要
Spark is one of the most popular big data analytical platforms. To save time, achieve high resource utilization, and remain cost-effective for Spark jobs, it is challenging but imperative for data scientists to configure suitable resource portions.In this paper, we investigate the proper parameter values that meet workloads’ performance requirements with minimized resource cost and resource utilization time. We propose SimCost, a simulation-based cost model, to predict the performance of jobs accurately. We achieve low-cost training by taking advantage of simulation framework, i.e., Monte Carlo simulation, which uses a small amount of data and resources to make a reliable prediction for larger datasets and clusters. Our method’s salient feature is that it allows us to invest low training costs while obtaining an accurate prediction. Through empirical experiments with 12 benchmark workloads, we show that the cost model yields less than 5% error on average prediction accuracy, and the recommendation achieves up to 6x resource cost saving.
引用
收藏
页码:73 / 102
页数:29
相关论文
共 50 条
  • [1] SimCost: cost-effective resource provision prediction and recommendation for spark workloads
    Chen, Yuxing
    Hoque, Mohammad A.
    Xu, Pengfei
    Lu, Jiaheng
    Tarkoma, Sasu
    [J]. DISTRIBUTED AND PARALLEL DATABASES, 2024, 42 (01) : 73 - 102
  • [2] Cost-effective Resource Provisioning for Spark Workloads
    Chen, Yuxing
    Lu, Jiaheng
    Chen, Chen
    Hoque, Mohammad
    Tarkoma, Sasu
    [J]. PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2477 - 2480
  • [3] Cost-effective charity provision
    Jarvis, Suzanne
    [J]. VETERINARY RECORD, 2022, 190 (03) : 89 - 89
  • [4] A recommendation for cost-effective preparation of phenylephrine
    Bhatia, Pradeep
    Kumar, Rakesh
    Sharma, Ankur
    Chhabra, Swati
    Mohammed, Sadik
    [J]. JOURNAL OF ANAESTHESIOLOGY CLINICAL PHARMACOLOGY, 2018, 34 (03) : 424 - 425
  • [5] Dynamic Cost-effective Emergency Network Provision
    Sriramulu, Roop K.
    Park, Younghee
    Chang, Sang-Yoon
    Liu, Kaikai
    [J]. PROCEEDINGS OF THE 2017 FIRST CONEXT WORKSHOP ON ICT TOOLS FOR EMERGENCY NETWORKS AND DISASTER RELIEF (I-TENDER '17), 2017, : 37 - 43
  • [6] Barriers to the provision of cost-effective technical education in Bangladesh
    Oxtoby, R
    [J]. INTERNATIONAL JOURNAL OF EDUCATIONAL DEVELOPMENT, 1997, 17 (01) : 91 - 99
  • [7] Cost-effective prediction of reading difficulties
    Heath, SM
    Hogben, JH
    [J]. JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2004, 47 (04): : 751 - 765
  • [8] Cost-Effective Resource Provisioning for MapReduce in a Cloud
    Palanisamy, Balaji
    Singh, Aameek
    Liu, Ling
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (05) : 1265 - 1279
  • [9] Approach to Intensive Care Costing and Provision of Cost-effective Care
    Chacko, Binila
    Ramakrishnan, Nagarajan
    Peter, John Victor
    [J]. INDIAN JOURNAL OF CRITICAL CARE MEDICINE, 2023, 27 (12) : 876 - 887
  • [10] Cost-effective innovations in low-resource settings
    Chang, O. H.
    Wilkinson, J. P.
    [J]. BJOG-AN INTERNATIONAL JOURNAL OF OBSTETRICS AND GYNAECOLOGY, 2018, 125 (09) : 1185 - 1185