On the Value of Service Demand Estimation for Auto-scaling

被引:17
|
作者
Bauer, Andre [1 ]
Grohmann, Johannes [1 ]
Herbst, Nikolas [1 ]
Kounev, Samuel [1 ]
机构
[1] Univ Wurzburg, Wurzburg, Germany
关键词
Service demand estimation; Auto-scaling; Online estimation; Elastic cloud computing;
D O I
10.1007/978-3-319-74947-1_10
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In the context of performance models, service demands are key model parameters capturing the average time individual requests of different workload classes are actively processed. In a system under load, due to measurement interference, service demands normally cannot be measured directly, however, a number of estimation approaches exist based on high-level performance metrics. In this paper, we show that service demands provide significant benefits for implementing modern auto-scalers. Auto-scaling describes the process of dynamically adjusting the number of allocated virtual resources (e.g., virtual machines) in a data center according to the incoming workload. We demonstrate that even a simple auto-scaler that leverages information about service demands significantly outperforms auto-scalers solely based on CPU utilization measurements. This is shown by testing two approaches in three different scenarios. Our results show that the service demand-based auto-scaler outperforms the CPU utilization-based one in all scenarios. Our results encourage further research on the application of service demand estimates for resource management in data centers.
引用
收藏
页码:142 / 156
页数:15
相关论文
共 50 条
  • [31] Concurrent service auto-scaling for Knative resource quota-based serverless system
    Tran, Minh-Ngoc
    Kim, YoungHan
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 160 : 326 - 339
  • [32] A cost-aware auto-scaling approach using the workload prediction in service clouds
    Yang, Jingqi
    Liu, Chuanchang
    Shang, Yanlei
    Cheng, Bo
    Mao, Zexiang
    Liu, Chunhong
    Niu, Lisha
    Chen, Junliang
    INFORMATION SYSTEMS FRONTIERS, 2014, 16 (01) : 7 - 18
  • [33] A Holistic Auto-Scaling Algorithm for Multi-Service Applications Based on Balanced Queuing Network
    Tong, Jingwan
    Wei, Mingchang
    Pan, Maolin
    Yu, Yang
    2021 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, ICWS 2021, 2021, : 531 - 540
  • [34] A survey on auto-scaling: how to exploit cloud elasticity
    Catillo, Marta
    Villano, Umberto
    Rak, Massimiliano
    INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2023, 14 (01) : 37 - 50
  • [35] An Auto-Scaling Approach for Microservices in Cloud Computing Environments
    Matineh ZargarAzad
    Mehrdad Ashtiani
    Journal of Grid Computing, 2023, 21
  • [36] Multi-objective auto-scaling scheduling for micro-service workflows in hybrid clouds
    Wang, Shijia
    Liu, Xuan
    Gao, Ming
    Chen, Mingxia
    Yung, Kai Leung
    Jiang, Shancheng
    ENTERPRISE INFORMATION SYSTEMS, 2023, 17 (07)
  • [37] Auto-scaling Applications in Edge Computing: Taxonomy and Challenges
    Taherizadeh, Salman
    Stankovski, Vlado
    INTERNATIONAL CONFERENCE ON BIG DATA AND INTERNET OF THINGS (BDIOT 2017), 2017, : 158 - 163
  • [38] An Autonomic Auto-scaling Controller for Cloud Based Applications
    Londono-Peldaez, Jorge M.
    Florez-Samur, Carlos A.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2013, 4 (09) : 1 - 6
  • [39] Chamulteon: Coordinated Auto-Scaling of Micro-Services
    Bauer, Andre
    Lesch, Veronika
    Versluis, Laurens
    Ilyushkin, Alexey
    Herbst, Nikolas
    Kounev, Samuel
    2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 2015 - 2025
  • [40] Predictive VNF auto-scaling based on genetic programming
    Rojia Nikbazm
    Mahmood Ahmadi
    Neural Computing and Applications, 2025, 37 (5) : 3129 - 3150