Scalability and performance analysis of BDPS in clouds

被引:0
|
作者
Yuegang Li
Dongyang Ou
Xin Zhou
Congfeng Jiang
Christophe Cérin
机构
[1] Hangzhou Dianzi University,School of Computer Science and Technology
[2] Université Sorbonne Paris Nord,undefined
[3] LIPN UMR CNRS 7030,undefined
来源
Computing | 2022年 / 104卷
关键词
Big data processing platforms; Scalability; Performance optimization; Cloud computing; Hadoop; Spark; 68M14;
D O I
暂无
中图分类号
学科分类号
摘要
The increasing demand for big data processing leads to commercial off-the-shelf (COTS) and cloud-based big data analytics services. Giant cloud service vendors provide customized big data processing systems (BDPS), which are more cost-effective for operation and maintenance than self-owned platforms. End users can rent big data analytics services with a pay-as-you-go cost model. However, when users’ data size increases, they need to scale their rental BDPS in order to achieve approximately the same performance, such as task completion time and normalized system throughput. Unfortunately, there is no effective way to help end-users to choose between scale-up direction and scale-out direction to expand their existing rental BDPS. Moreover, there is no any metric to measure the scalability of BDPS, either. Furthermore, the performance of BDPS services at different time slots is not consistent due to co-location and workload placement policies in modern internet data centers. To this end, this paper proposes scalability metric for BDPS in clouds, which can mitigate the aforementioned issues. This scalability metric quantifies the scalability of BDPS consistently under different system expansion configurations. This paper also conducts experiments on real BDPS platforms and derives optimization approaches for better scalability of BDPS, such as file compression during Shuffle process in MapReduce. The experiment results demonstrate the validity of the proposed optimization strategies.
引用
收藏
页码:1425 / 1460
页数:35
相关论文
共 50 条
  • [41] Performance and Scalability of Voldemort NoSQL
    Neves, Ricardo
    Bernardino, Jorge
    2015 10TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2015,
  • [42] Performance Modeling and Scalability Analysis of Stream Computing in ESSPER FPGA Clusters
    Miyagi, Ryota
    Yasudo, Ryota
    Sano, Kentaro
    Takase, Hideki
    2023 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY, ICFPT, 2023, : 262 - 265
  • [43] Performance Analysis of a Hyperledger Fabric Blockchain Framework: Throughput, Latency and Scalability
    Kuzlu, Murat
    Pipattanasomporn, Manisa
    Gurses, Levent
    Rahman, Saifur
    2019 IEEE INTERNATIONAL CONFERENCE ON BLOCKCHAIN (BLOCKCHAIN 2019), 2019, : 536 - 540
  • [44] Performance and Scalability Analysis for Parallel Reservoir Simulations on Three Supercomputer Architectures
    Liu, Hui
    Zhang, Peng
    Wang, Kun
    Yang, Bo
    Chen, Zhangxin
    PROCEEDINGS OF XSEDE16: DIVERSITY, BIG DATA, AND SCIENCE AT SCALE, 2016,
  • [45] Computing Resources Scalability Performance Analysis in Cloud Computing Data Center
    Ghandour, Oumaima
    El Kafhali, Said
    Hanini, Mohamed
    JOURNAL OF GRID COMPUTING, 2023, 21 (04)
  • [46] AUTOMOD®: PERFORMANCE, SCALABILITY AND ACCURACY
    Muller, Daniel J.
    2017 WINTER SIMULATION CONFERENCE (WSC), 2017, : 4451 - 4451
  • [47] A path to scalability and efficient performance
    Shank, CK
    Craig, G
    Lea, D
    LANGUAGES, COMPILERS AND RUN-TIME SYSTEMS FOR SCALABLE COMPUTERS, 1996, : 99 - 109
  • [48] Performance, Power and Scalability Analysis of HEVC Interpolation Filter Using FPGAs
    Gomez-Pulido, Juan A.
    Cordeiro, Paulo J.
    Assuncao, Pedro A.
    IEEE EUROCON 2015 - INTERNATIONAL CONFERENCE ON COMPUTER AS A TOOL (EUROCON), 2015, : 733 - 738
  • [49] Proactive Scalability and Management of Resources in Hybrid Clouds via Machine Learning
    Avresky, Dimiter R.
    Di Sanzo, Pierangelo
    Pellegrini, Alessandro
    Ciciani, Bruno
    Forte, Luca
    2015 IEEE 14TH INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS (NCA), 2015, : 114 - 119
  • [50] Modeling the Scalability of Real-Time Online Interactive Applications on Clouds
    Meilaender, Dominik
    Gorlatch, Sergei
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 86 : 1019 - 1031