Scalability and performance analysis of BDPS in clouds

被引:0
|
作者
Yuegang Li
Dongyang Ou
Xin Zhou
Congfeng Jiang
Christophe Cérin
机构
[1] Hangzhou Dianzi University,School of Computer Science and Technology
[2] Université Sorbonne Paris Nord,undefined
[3] LIPN UMR CNRS 7030,undefined
来源
Computing | 2022年 / 104卷
关键词
Big data processing platforms; Scalability; Performance optimization; Cloud computing; Hadoop; Spark; 68M14;
D O I
暂无
中图分类号
学科分类号
摘要
The increasing demand for big data processing leads to commercial off-the-shelf (COTS) and cloud-based big data analytics services. Giant cloud service vendors provide customized big data processing systems (BDPS), which are more cost-effective for operation and maintenance than self-owned platforms. End users can rent big data analytics services with a pay-as-you-go cost model. However, when users’ data size increases, they need to scale their rental BDPS in order to achieve approximately the same performance, such as task completion time and normalized system throughput. Unfortunately, there is no effective way to help end-users to choose between scale-up direction and scale-out direction to expand their existing rental BDPS. Moreover, there is no any metric to measure the scalability of BDPS, either. Furthermore, the performance of BDPS services at different time slots is not consistent due to co-location and workload placement policies in modern internet data centers. To this end, this paper proposes scalability metric for BDPS in clouds, which can mitigate the aforementioned issues. This scalability metric quantifies the scalability of BDPS consistently under different system expansion configurations. This paper also conducts experiments on real BDPS platforms and derives optimization approaches for better scalability of BDPS, such as file compression during Shuffle process in MapReduce. The experiment results demonstrate the validity of the proposed optimization strategies.
引用
收藏
页码:1425 / 1460
页数:35
相关论文
共 50 条
  • [21] Performance and scalability analysis on client-server workflow architecture
    Kim, KH
    Han, DS
    PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, 2001, : 179 - 186
  • [22] A general framework for scalability and performance analysis of DHT routing systems
    Kong, Joseph S.
    Bridgewater, Jesse S. A.
    Roychowdhury, Vwani P.
    DSN 2006 INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2006, : 343 - 352
  • [23] Scalability analysis and performance prediction for cellular programs on parallel computers
    Folino, G
    Spezzano, G
    THEORETICAL AND PRACTICAL ISSUES ON CELLULAR AUTOMATA, 2001, : 37 - 46
  • [24] SCALABILITY AND PERFORMANCE ANALYSIS OF OPENMP CODES USING THE PERISCOPE TOOLKIT
    Benedict, Shajulin
    Gerndt, Michael
    COMPUTING AND INFORMATICS, 2014, 33 (04) : 921 - 942
  • [25] Understanding the Performance of Bluetooth Mesh: Reliability, Delay, and Scalability Analysis
    Rondon, Raul
    Mahmood, Aamir
    Grimaldi, Simone
    Gidlund, Mikael
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (03) : 2089 - 2101
  • [26] Scalability and performance evaluation of federated learning frameworks: a comparative analysis
    Soudan, Bassel
    Abbas, Sohail
    Kubba, Ahmed
    Abu Waraga, Omnia
    Abu Talib, Manar
    Nasir, Qassim
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025,
  • [27] On the Suitability of Bluetooth 5 for the Internet of Things: Performance and Scalability Analysis
    Boecker, Stefan
    Arendt, Christian
    Wietfeld, Christian
    2017 IEEE 28TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR, AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2017,
  • [28] Performance Scalability Analysis of Java']JavaScript Applications with Web Workers
    Verdu, Javier
    Pajuelo, Alex
    IEEE COMPUTER ARCHITECTURE LETTERS, 2016, 15 (02) : 105 - 108
  • [29] In-silico Research Platform in the Cloud - Performance and Scalability Analysis
    Ivanovic, Milos
    Zivic, Andreja
    Tachos, Nikolaos
    Gois, George
    Filipovic, Nenad
    Fotiadis, Dimitrios, I
    2021 IEEE 21ST INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (IEEE BIBE 2021), 2021,
  • [30] Stochastic bounding models for performance analysis of clouds
    Ait-Salaht, Farah
    Castel-Taleb, Hind
    CIT/IUCC/DASC/PICOM 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY - UBIQUITOUS COMPUTING AND COMMUNICATIONS - DEPENDABLE, AUTONOMIC AND SECURE COMPUTING - PERVASIVE INTELLIGENCE AND COMPUTING, 2015, : 603 - 610