Scalability and performance analysis of BDPS in clouds

被引:0
|
作者
Yuegang Li
Dongyang Ou
Xin Zhou
Congfeng Jiang
Christophe Cérin
机构
[1] Hangzhou Dianzi University,School of Computer Science and Technology
[2] Université Sorbonne Paris Nord,undefined
[3] LIPN UMR CNRS 7030,undefined
来源
Computing | 2022年 / 104卷
关键词
Big data processing platforms; Scalability; Performance optimization; Cloud computing; Hadoop; Spark; 68M14;
D O I
暂无
中图分类号
学科分类号
摘要
The increasing demand for big data processing leads to commercial off-the-shelf (COTS) and cloud-based big data analytics services. Giant cloud service vendors provide customized big data processing systems (BDPS), which are more cost-effective for operation and maintenance than self-owned platforms. End users can rent big data analytics services with a pay-as-you-go cost model. However, when users’ data size increases, they need to scale their rental BDPS in order to achieve approximately the same performance, such as task completion time and normalized system throughput. Unfortunately, there is no effective way to help end-users to choose between scale-up direction and scale-out direction to expand their existing rental BDPS. Moreover, there is no any metric to measure the scalability of BDPS, either. Furthermore, the performance of BDPS services at different time slots is not consistent due to co-location and workload placement policies in modern internet data centers. To this end, this paper proposes scalability metric for BDPS in clouds, which can mitigate the aforementioned issues. This scalability metric quantifies the scalability of BDPS consistently under different system expansion configurations. This paper also conducts experiments on real BDPS platforms and derives optimization approaches for better scalability of BDPS, such as file compression during Shuffle process in MapReduce. The experiment results demonstrate the validity of the proposed optimization strategies.
引用
收藏
页码:1425 / 1460
页数:35
相关论文
共 50 条
  • [31] On the performance and power consumption analysis of elastic clouds
    Guo, KunYin
    Yu, Ke
    Pang, ShanChen
    Yang, Dan
    Huang, Jun
    Xia, YunNi
    Luo, Xin
    Li, Jia
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (17): : 4367 - 4384
  • [32] SensorConnect performance and scalability experiments
    Hoag, Joseph
    Phillips, Reid
    Thompson, Craig
    Huetter, Ray
    Veizades, John
    2006 INTERNATIONAL SYMPOSIUM ON INDUSTRIAL EMBEDDED SYSTEMS, 2006, : 121 - +
  • [33] Performance and scalability of EJB applications
    Cecchet, E
    Marguerite, J
    Zwaenepoel, W
    ACM SIGPLAN NOTICES, 2002, 37 (11) : 246 - 261
  • [34] Evaluating Scalability Performance in Azure
    Gusev, Marjan
    Ristov, Sasko
    Kolic, Kristina
    FUTURE ACCESS ENABLERS FOR UBIQUITOUS AND INTELLIGENT INFRASTRUCTURES, 2015, 159 : 241 - 247
  • [35] PERFORMANCE AND SCALABILITY OF FINITE-ELEMENT ANALYSIS FOR DISTRIBUTED PARALLEL COMPUTATION
    BARRAGY, E
    CAREY, GF
    VANDEGEIJN, R
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1994, 21 (02) : 202 - 212
  • [36] Computing Resources Scalability Performance Analysis in Cloud Computing Data Center
    Oumaima Ghandour
    Said El Kafhali
    Mohamed Hanini
    Journal of Grid Computing, 2023, 21
  • [37] Scalability and Performance Analysis in 5G Core Network Slicing
    Arteaga, Carlos Hernan Tobar
    Ordonez, Armando
    Rendon, Oscar Mauricio Caicedo
    IEEE ACCESS, 2020, 8 : 142086 - 142100
  • [38] AUTOMOD®: PERFORMANCE, SCALABILITY AND ACCURACY
    Muller, Daniel J.
    2017 WINTER SIMULATION CONFERENCE (WSC), 2017, : 4435 - 4447
  • [39] Scalability of Multifinger HEMT Performance
    Crupi, Giovanni
    Raffo, Antonio
    Vadala, Valeria
    Vannini, Giorgio
    Schreurs, Dominique M. M. -P.
    Caddemi, Alina
    IEEE MICROWAVE AND WIRELESS COMPONENTS LETTERS, 2020, 30 (09) : 869 - 872
  • [40] Sensorconnect performance and scalability experiments
    Hoag, Joseph
    Phillips, Reid
    Thompson, Craig
    Huetter, Ray
    Veizades, John
    2007 IEEE INTERNATIONAL CONFERENCE ON RFID, 2007, : 230 - +