Adaptive Auto-Scaling of Delay-Sensitive Serverless Services with Reinforcement Learning

被引：7

作者：

Zhang, Zhiyu ^{[1
,3
]}

Wang, Tao ^{[2
,3
]}

Li, An ^{[3
]}

Zhang, Wenbo ^{[2
,3
]}

机构：

[1] Univ Chinese Acad Sci, Beijing 100190, Peoples R China

[2] Chinese Acad Sci, State Key Lab Comp Sci, Inst Software, Beijing 100190, Peoples R China

[3] Chinese Acad Sci, Inst Software, Beijing 100190, Peoples R China

来源：

2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022) | 2022年

基金：

中国国家自然科学基金;

关键词：

Serverless Computing; Auto-scaling; Reinforcement Learning; Quality of Service;

D O I：

10.1109/COMPSAC54236.2022.00137

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Serverless services such as image recognition and natural language processing have strict response-time constraints. The incoming workloads and resource requirements of a newly deployed serverless service are always unpredictable due to the lack of available historical tracing data. Therefore, making effective auto-scaling decisions for these services is challenging. Open source serverless platforms often work in a best-effort manner, which cannot guarantee the response delay. Moreover, existing studies usually adopt threshold-based methods by configuring additional resource, which cannot well balance the trade-off between the quality of service and resource efficiency. To address the above issues, we propose an adaptive auto-scaling approach for delay-sensitive serverless services with reinforcement learning. First, we characterize the service's resource profile by exploring the performance improvement of different resource allocations with the reinforcement learning method. Then, we propose an adaptive auto-scaling method combining both horizontal and vertical scaling strategies based on the characterized profile to dynamically adjust the resource allocation. Finally, we select three typical services to validate our approach by comparing with two existing state-of-the-art auto-scaling methods. The experimental results show that our approach can accurately characterize services' resource profile, and effectively ensure the response delay constraints while achieving about 10.50% reduction of cost on average.

引用

页码：866 / 871

页数：6

共 50 条

[1] AI-based Resource Allocation: Reinforcement Learning for Adaptive Auto-scaling in Serverless Environments
Schuler, Lucia
Jamil, Somaya
Kuehl, Niklas
[J]. 21ST IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2021), 2021, : 804 - 811
[2] Proactive Auto-Scaling for Delay-Sensitive IoT Applications Over Edge Clouds
Wang, Weimeng
Liu, Lei
Yan, Zhongmin
[J]. IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (06) : 9536 - 9546
[3] Reinforcement Learning Applicability for Resource-Based Auto-scaling in Serverless Edge Applications
Benedetti, Priscilla
Femminella, M.
Reali, G.
Steenhaut, Kris
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS AND OTHER AFFILIATED EVENTS (PERCOM WORKSHOPS), 2022,
[4] Factored Reinforcement Learning for Auto-scaling in Tandem Queues
Tournaire, Thomas
Jin, Yue
Aghasaryan, Armen
Castel-Taleb, Hind
Hyon, Emmanuel
[J]. PROCEEDINGS OF THE IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM 2022, 2022,
[5] Auto-scaling mechanisms in serverless computing: A comprehensive review
Tari, Mohammad
Ghobaei-Arani, Mostafa
Pouramini, Jafar
Ghorbian, Mohsen
[J]. COMPUTER SCIENCE REVIEW, 2024, 53
[6] A Comparison of Reinforcement Learning Techniques for Fuzzy Cloud Auto-Scaling
Arabnejad, Hamid
Pahl, Claus
Jamshidi, Pooyan
Estrada, Giovani
[J]. 2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, : 64 - 73
[7] Auto-scaling for a Streaming Architecture with Fuzzy Deep Reinforcement Learning
Dong Nguyen Doan
Zaharie, Daniela
Petcu, Dana
[J]. EURO-PAR 2019: PARALLEL PROCESSING WORKSHOPS, 2020, 11997 : 476 - 488
[8] Auto-Scaling with Apprenticeship Learning
Hakimzadeh, Kamal
Nicholson, Patrick K.
Lugones, Diego
[J]. PROCEEDINGS OF THE 2018 ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '18), 2018, : 512 - 512
[9] Faa$T: A Transparent Auto-Scaling Cache for Serverless Applications
Romero, Francisco
Chaudhry, Gohar Irfan
Goiri, Inigo
Gopa, Pragna
Batum, Paul
Yadwadkar, Neeraja J.
Fonseca, Rodrigo
Kozyrakis, Christos
Bianchini, Ricardo
[J]. PROCEEDINGS OF THE 2021 ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '21), 2021, : 122 - 137
[10] Towards Autonomous VNF Auto-scaling using Deep Reinforcement Learning
Soto, Paola
De Vleeschauwer, Danny
Camelo, Miguel
De Bock, Yorick
De Schepper, Koen
Chang, Chia-Yu
Hellinckx, Peter
Botero, Juan F.
Latre, Steven
[J]. 2021 EIGHTH INTERNATIONAL CONFERENCE ON SOFTWARE DEFINED SYSTEMS (SDS), 2021, : 74 - 81

← 1 2 3 4 5 →