On the Value of Service Demand Estimation for Auto-scaling

被引：17

作者：

Bauer, Andre ^{[1
]}

Grohmann, Johannes ^{[1
]}

Herbst, Nikolas ^{[1
]}

Kounev, Samuel ^{[1
]}

机构：

[1] Univ Wurzburg, Wurzburg, Germany

来源：

MEASUREMENT, MODELLING AND EVALUATION OF COMPUTING SYSTEMS, MMB 2018 | 2018年 / 10740卷

关键词：

Service demand estimation; Auto-scaling; Online estimation; Elastic cloud computing;

D O I：

10.1007/978-3-319-74947-1_10

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In the context of performance models, service demands are key model parameters capturing the average time individual requests of different workload classes are actively processed. In a system under load, due to measurement interference, service demands normally cannot be measured directly, however, a number of estimation approaches exist based on high-level performance metrics. In this paper, we show that service demands provide significant benefits for implementing modern auto-scalers. Auto-scaling describes the process of dynamically adjusting the number of allocated virtual resources (e.g., virtual machines) in a data center according to the incoming workload. We demonstrate that even a simple auto-scaler that leverages information about service demands significantly outperforms auto-scalers solely based on CPU utilization measurements. This is shown by testing two approaches in three different scenarios. Our results show that the service demand-based auto-scaler outperforms the CPU utilization-based one in all scenarios. Our results encourage further research on the application of service demand estimates for resource management in data centers.

引用

页码：142 / 156

页数：15

共 50 条

[31] Concurrent service auto-scaling for Knative resource quota-based serverless system
Tran, Minh-Ngoc
Kim, YoungHan
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 160 : 326 - 339
[32] A cost-aware auto-scaling approach using the workload prediction in service clouds
Yang, Jingqi
Liu, Chuanchang
Shang, Yanlei
Cheng, Bo
Mao, Zexiang
Liu, Chunhong
Niu, Lisha
Chen, Junliang
INFORMATION SYSTEMS FRONTIERS, 2014, 16 (01) : 7 - 18
[33] A Holistic Auto-Scaling Algorithm for Multi-Service Applications Based on Balanced Queuing Network
Tong, Jingwan
Wei, Mingchang
Pan, Maolin
Yu, Yang
2021 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, ICWS 2021, 2021, : 531 - 540
[34] A survey on auto-scaling: how to exploit cloud elasticity
Catillo, Marta
Villano, Umberto
Rak, Massimiliano
INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2023, 14 (01) : 37 - 50
[35] An Auto-Scaling Approach for Microservices in Cloud Computing Environments
Matineh ZargarAzad
Mehrdad Ashtiani
Journal of Grid Computing, 2023, 21
[36] Multi-objective auto-scaling scheduling for micro-service workflows in hybrid clouds
Wang, Shijia
Liu, Xuan
Gao, Ming
Chen, Mingxia
Yung, Kai Leung
Jiang, Shancheng
ENTERPRISE INFORMATION SYSTEMS, 2023, 17 (07)
[37] Auto-scaling Applications in Edge Computing: Taxonomy and Challenges
Taherizadeh, Salman
Stankovski, Vlado
INTERNATIONAL CONFERENCE ON BIG DATA AND INTERNET OF THINGS (BDIOT 2017), 2017, : 158 - 163
[38] An Autonomic Auto-scaling Controller for Cloud Based Applications
Londono-Peldaez, Jorge M.
Florez-Samur, Carlos A.
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2013, 4 (09) : 1 - 6
[39] Chamulteon: Coordinated Auto-Scaling of Micro-Services
Bauer, Andre
Lesch, Veronika
Versluis, Laurens
Ilyushkin, Alexey
Herbst, Nikolas
Kounev, Samuel
2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 2015 - 2025
[40] Predictive VNF auto-scaling based on genetic programming
Rojia Nikbazm
Mahmood Ahmadi
Neural Computing and Applications, 2025, 37 (5) : 3129 - 3150

← 1 2 3 4 5 →