Cost-Availability Aware Scaling: Towards Optimal Scaling of Cloud Services

被引:1
|
作者
Bento, Andre [1 ]
Araujo, Filipe [1 ]
Barbosa, Raul [1 ]
机构
[1] Univ Coimbra, Ctr Informat & Syst, Dept Informat Engn, P-3030290 Coimbra, Portugal
关键词
Cloud services; Microservices; Availability modeling; Cost-effectiveness; Multi-objective optimization; Autoscaling; MULTIOBJECTIVE OPTIMIZATION; MICROSERVICES;
D O I
10.1007/s10723-023-09718-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cloud services have become increasingly popular for developing large-scale applications due to the abundance of resources they offer. The scalability and accessibility of these resources have made it easier for organizations of all sizes to develop and implement sophisticated and demanding applications to meet demand instantly. As monetary fees are involved in the use of the cloud, one of the challenges for application developers and operators is to balance their budget constraints with crucial quality attributes, such as availability. Industry standards usually default to simplified solutions that cannot simultaneously consider competing objectives. Our research addresses this challenge by proposing a Cost-Availability Aware Scaling (CAAS) approach that uses multi-objective optimization of availability and cost. We evaluate CAAS using two open-source microservices applications, yielding improved results compared to the industry standard CPU-based Autoscaler (AS). CAAS can find optimal system configurations with higher availability, between 1 and 2 nines on average, and reduced costs, 6% on average, with the first application, and 1 nine of availability on average, and reduced costs up to 18% on average, with the second application. The gap in the results between our model and the default AS suggests that operators can significantly improve the operation of their applications.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Cost-Availability Aware Scaling: Towards Optimal Scaling of Cloud Services
    Andre Bento
    Filipe Araujo
    Raul Barbosa
    Journal of Grid Computing, 2023, 21
  • [2] Towards Efficient Automatic Scaling and Adaptive cost-optimized eHealth Services in Cloud
    Rachkidi, Elie
    Cherkaoui, El Hadi
    Ait-idir, Mustapha
    Agoulmine, Nazim
    Taher, Nada Chendeb
    Santos, Marcelo
    Fernandes, Stenio
    2015 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2015,
  • [3] Infra: SLO Aware Elastic Auto Scaling in the Cloud for Cost Reduction
    Sidhanta, Subhajit
    Mukhopadhyay, Supratik
    2016 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2016, 2016, : 141 - 148
  • [4] Scaling the Performance and Cost While Scaling the Load and Resources in the Cloud
    Simjanoska, Monika
    Ristov, Sasko
    Velkoski, Goran
    Gusev, Marjan
    2013 36TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2013, : 151 - 156
  • [5] An Availability-aware Virtual Machine Placement Approach for Dynamic Scaling of Cloud Applications
    Wang, Wenting
    Chen, Haopeng
    Chen, Xi
    2012 9TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INTELLIGENCE & COMPUTING AND 9TH INTERNATIONAL CONFERENCE ON AUTONOMIC & TRUSTED COMPUTING (UIC/ATC), 2012, : 509 - 516
  • [6] Scaling persistent connections for cloud services
    Lin, Wenjie
    Sharma, Puneet
    Chatterjee, Sarbajit
    Sharma, Deepti
    Lee, David
    Iyer, Subu
    Gupta, Ajay
    COMPUTER NETWORKS, 2015, 93 : 518 - 530
  • [7] Cost-aware Automatic Virtual Machine Scaling in Fine Granularity for Cloud Applications
    Zhao, He
    Peng, Chenglei
    Yu, Yao
    Zhou, Yu
    Wang, Ziqiang
    Du, Sidan
    2013 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY (CYBERC), 2013, : 109 - 116
  • [8] Cost Aware Resource Sizing and Scaling of Microservices
    Agarwal, Preyashi
    Lakshmi, J.
    2019 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTERNET OF THINGS (CCIOT 2019), 2019, : 66 - 74
  • [9] Web Scaling Frameworks for Web Services in the Cloud
    Fankhauser, Thomas
    Wang, Qi
    Gerlicher, Ansgar
    Grecos, Christos
    Wang, Xinheng
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2016, 9 (05) : 728 - 741
  • [10] Cost-aware automatic scaling and workload-aware replica management for edge-cloud environment
    Li, Chunlin
    Liu, Jun
    Lu, Bo
    Luo, Youlong
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2021, 180