Cost-Availability Aware Scaling: Towards Optimal Scaling of Cloud Services

被引:1
|
作者
Bento, Andre [1 ]
Araujo, Filipe [1 ]
Barbosa, Raul [1 ]
机构
[1] Univ Coimbra, Ctr Informat & Syst, Dept Informat Engn, P-3030290 Coimbra, Portugal
关键词
Cloud services; Microservices; Availability modeling; Cost-effectiveness; Multi-objective optimization; Autoscaling; MULTIOBJECTIVE OPTIMIZATION; MICROSERVICES;
D O I
10.1007/s10723-023-09718-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cloud services have become increasingly popular for developing large-scale applications due to the abundance of resources they offer. The scalability and accessibility of these resources have made it easier for organizations of all sizes to develop and implement sophisticated and demanding applications to meet demand instantly. As monetary fees are involved in the use of the cloud, one of the challenges for application developers and operators is to balance their budget constraints with crucial quality attributes, such as availability. Industry standards usually default to simplified solutions that cannot simultaneously consider competing objectives. Our research addresses this challenge by proposing a Cost-Availability Aware Scaling (CAAS) approach that uses multi-objective optimization of availability and cost. We evaluate CAAS using two open-source microservices applications, yielding improved results compared to the industry standard CPU-based Autoscaler (AS). CAAS can find optimal system configurations with higher availability, between 1 and 2 nines on average, and reduced costs, 6% on average, with the first application, and 1 nine of availability on average, and reduced costs up to 18% on average, with the second application. The gap in the results between our model and the default AS suggests that operators can significantly improve the operation of their applications.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Adaptive Application Scaling for Improving Fault-Tolerance and Availability in the Cloud
    Radhakrishnan, Ganesan
    BELL LABS TECHNICAL JOURNAL, 2012, 17 (02) : 5 - 14
  • [22] Availability-Aware Container Scheduler for Application Services in Cloud
    Alahmad, Yanal
    Daradkeh, Tariq
    Agarwal, Anjali
    2018 IEEE 37TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2018,
  • [23] Dynamic Scaling of Call-Stateful SIP Services in the Cloud
    Janssens, Nico
    An, Xueli
    Daenen, Koen
    Forlivesi, Claudio
    NETWORKING 2012, PT I, 2012, 7289 : 175 - 189
  • [24] Exploiting User Patience for Scaling Resource Capacity in Cloud Services
    Cunha, Renato L. F.
    Assuncao, Marcos D.
    Cardonha, Carlos
    Netto, Marco A. S.
    2014 IEEE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2014, : 449 - 456
  • [25] Elastic Cloud Services: Scaling Snowflake's Control Plane
    Melissaris, Themis
    Nabar, Kunal
    Radut, Rares
    Rehmtulla, Samir
    Shi, Arthur
    Chandrashekar, Samartha
    Papapanagiotou, Ioannis
    PROCEEDINGS OF THE 13TH SYMPOSIUM ON CLOUD COMPUTING, SOCC 2022, 2022, : 142 - 157
  • [26] A Method to Compare Scaling Algorithms for Cloud-Based Services
    De Vleeschauwer, Danny
    Chang, Chia-Yu
    Soto, Paola
    De Bock, Yorick
    Camelo, Miguel
    De Schepper, Koen
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2025, 13 (01) : 34 - 45
  • [27] Auto Scaling Strategy for Amazon Web Services in Cloud Computing
    Liao, Wen-Hwa
    Kuai, Ssu-Chi
    Leau, Yu-Ren
    2015 IEEE INTERNATIONAL CONFERENCE ON SMART CITY/SOCIALCOM/SUSTAINCOM (SMARTCITY), 2015, : 1059 - 1064
  • [28] A cost-AWARE approach based ON learning automata FOR resource auto-scaling IN cloud computing environment
    Mogoui, Khosro
    Arani, Mostafa Ghobaei
    International Journal of Hybrid Information Technology, 2015, 8 (07): : 389 - 398
  • [29] Heterogeneity-aware elastic scaling of streaming applications on cloud platforms
    Sahni, Jyoti
    Vidyarthi, Deo Prakash
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (09): : 10512 - 10539
  • [30] Energy-Aware Load Balancing and Application Scaling for the Cloud Ecosystem
    Paya, Ashkan
    Marinescu, Dan C.
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2017, 5 (01) : 15 - 27