Scalability and performance analysis of BDPS in clouds

被引:0
|
作者
Yuegang Li
Dongyang Ou
Xin Zhou
Congfeng Jiang
Christophe Cérin
机构
[1] Hangzhou Dianzi University,School of Computer Science and Technology
[2] Université Sorbonne Paris Nord,undefined
[3] LIPN UMR CNRS 7030,undefined
来源
Computing | 2022年 / 104卷
关键词
Big data processing platforms; Scalability; Performance optimization; Cloud computing; Hadoop; Spark; 68M14;
D O I
暂无
中图分类号
学科分类号
摘要
The increasing demand for big data processing leads to commercial off-the-shelf (COTS) and cloud-based big data analytics services. Giant cloud service vendors provide customized big data processing systems (BDPS), which are more cost-effective for operation and maintenance than self-owned platforms. End users can rent big data analytics services with a pay-as-you-go cost model. However, when users’ data size increases, they need to scale their rental BDPS in order to achieve approximately the same performance, such as task completion time and normalized system throughput. Unfortunately, there is no effective way to help end-users to choose between scale-up direction and scale-out direction to expand their existing rental BDPS. Moreover, there is no any metric to measure the scalability of BDPS, either. Furthermore, the performance of BDPS services at different time slots is not consistent due to co-location and workload placement policies in modern internet data centers. To this end, this paper proposes scalability metric for BDPS in clouds, which can mitigate the aforementioned issues. This scalability metric quantifies the scalability of BDPS consistently under different system expansion configurations. This paper also conducts experiments on real BDPS platforms and derives optimization approaches for better scalability of BDPS, such as file compression during Shuffle process in MapReduce. The experiment results demonstrate the validity of the proposed optimization strategies.
引用
收藏
页码:1425 / 1460
页数:35
相关论文
共 50 条
  • [1] Scalability and performance analysis of BDPS in clouds
    Li, Yuegang
    Ou, Dongyang
    Zhou, Xin
    Jiang, Congfeng
    Cerin, Christophe
    COMPUTING, 2022, 104 (06) : 1425 - 1460
  • [2] Noise in the Clouds: Influence of Network Performance Variability on Application Scalability
    De Sensi, Daniele
    De Matteis, Tiziano
    Taranov, Konstantin
    Di Girolamo, Salvatore
    Rahn, Tobias
    Hoefler, Torsten
    PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS, 2022, 6 (03)
  • [3] Performance and Scalability Analysis of Ethereum and Hyperledger Fabric
    Ucbas, Yusuf
    Eleyan, Amna
    Hammoudeh, Mohammad
    Alohaly, Manar
    IEEE ACCESS, 2023, 11 : 67156 - 67167
  • [4] Variations in Performance and Scalability: An Experimental Study in IaaS Clouds Using Multi-Tier Workloads
    Jayasinghe, Deepal
    Malkowski, Simon
    Li, Jack
    Wang, Qingyang
    Wang, Zhikui
    Pu, Calton
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2014, 7 (02) : 293 - 306
  • [5] Performance-Based Analysis of Blockchain Scalability Metric
    Yadav, Jyoti
    Shevkar, Ranjana
    TEHNICKI GLASNIK-TECHNICAL JOURNAL, 2021, 15 (01): : 133 - 142
  • [6] Scalability and performance analysis of a probabilistic domain decomposition method
    Acebron, Juan A.
    Spigler, Renato
    PARALLEL PROCESSING AND APPLIED MATHEMATICS, 2008, 4967 : 1257 - +
  • [7] A Scalability and Performance Analysis of Preauthentication Algorithms for Wireless Networks
    Christakos, Constantine
    Allen, Patrick D.
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2012, 61 (07) : 3166 - 3176
  • [8] Performance Optimization and Scalability Analysis of the MGB Hydrological Model
    Freitas, Henrique R. A.
    Mendes, Celso L.
    Ilic, Aleksandar
    2020 IEEE 27TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC 2020), 2020, : 31 - 40
  • [9] H.264/SVC scalability performance analysis
    Ben Rhaiem, Olfa
    Fourati, Lamia Chaari
    2013 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS (IEEE ICSIPA 2013), 2013, : 203 - 208
  • [10] Analysis of Scalability and Performance in Passive Optical CDMA Networks
    Karbassian, M. Massoud
    Ghafouri-Shiraz, Hooshang
    JOURNAL OF LIGHTWAVE TECHNOLOGY, 2009, 27 (17) : 3896 - 3903