Adaptive Auto-Scaling of Delay-Sensitive Serverless Services with Reinforcement Learning

被引:7
|
作者
Zhang, Zhiyu [1 ,3 ]
Wang, Tao [2 ,3 ]
Li, An [3 ]
Zhang, Wenbo [2 ,3 ]
机构
[1] Univ Chinese Acad Sci, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, State Key Lab Comp Sci, Inst Software, Beijing 100190, Peoples R China
[3] Chinese Acad Sci, Inst Software, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Serverless Computing; Auto-scaling; Reinforcement Learning; Quality of Service;
D O I
10.1109/COMPSAC54236.2022.00137
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Serverless services such as image recognition and natural language processing have strict response-time constraints. The incoming workloads and resource requirements of a newly deployed serverless service are always unpredictable due to the lack of available historical tracing data. Therefore, making effective auto-scaling decisions for these services is challenging. Open source serverless platforms often work in a best-effort manner, which cannot guarantee the response delay. Moreover, existing studies usually adopt threshold-based methods by configuring additional resource, which cannot well balance the trade-off between the quality of service and resource efficiency. To address the above issues, we propose an adaptive auto-scaling approach for delay-sensitive serverless services with reinforcement learning. First, we characterize the service's resource profile by exploring the performance improvement of different resource allocations with the reinforcement learning method. Then, we propose an adaptive auto-scaling method combining both horizontal and vertical scaling strategies based on the characterized profile to dynamically adjust the resource allocation. Finally, we select three typical services to validate our approach by comparing with two existing state-of-the-art auto-scaling methods. The experimental results show that our approach can accurately characterize services' resource profile, and effectively ensure the response delay constraints while achieving about 10.50% reduction of cost on average.
引用
收藏
页码:866 / 871
页数:6
相关论文
共 50 条
  • [1] AI-based Resource Allocation: Reinforcement Learning for Adaptive Auto-scaling in Serverless Environments
    Schuler, Lucia
    Jamil, Somaya
    Kuehl, Niklas
    [J]. 21ST IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2021), 2021, : 804 - 811
  • [2] Proactive Auto-Scaling for Delay-Sensitive IoT Applications Over Edge Clouds
    Wang, Weimeng
    Liu, Lei
    Yan, Zhongmin
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (06) : 9536 - 9546
  • [3] Reinforcement Learning Applicability for Resource-Based Auto-scaling in Serverless Edge Applications
    Benedetti, Priscilla
    Femminella, M.
    Reali, G.
    Steenhaut, Kris
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS AND OTHER AFFILIATED EVENTS (PERCOM WORKSHOPS), 2022,
  • [4] Factored Reinforcement Learning for Auto-scaling in Tandem Queues
    Tournaire, Thomas
    Jin, Yue
    Aghasaryan, Armen
    Castel-Taleb, Hind
    Hyon, Emmanuel
    [J]. PROCEEDINGS OF THE IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM 2022, 2022,
  • [5] Auto-scaling mechanisms in serverless computing: A comprehensive review
    Tari, Mohammad
    Ghobaei-Arani, Mostafa
    Pouramini, Jafar
    Ghorbian, Mohsen
    [J]. COMPUTER SCIENCE REVIEW, 2024, 53
  • [6] A Comparison of Reinforcement Learning Techniques for Fuzzy Cloud Auto-Scaling
    Arabnejad, Hamid
    Pahl, Claus
    Jamshidi, Pooyan
    Estrada, Giovani
    [J]. 2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, : 64 - 73
  • [7] Auto-scaling for a Streaming Architecture with Fuzzy Deep Reinforcement Learning
    Dong Nguyen Doan
    Zaharie, Daniela
    Petcu, Dana
    [J]. EURO-PAR 2019: PARALLEL PROCESSING WORKSHOPS, 2020, 11997 : 476 - 488
  • [8] Auto-Scaling with Apprenticeship Learning
    Hakimzadeh, Kamal
    Nicholson, Patrick K.
    Lugones, Diego
    [J]. PROCEEDINGS OF THE 2018 ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '18), 2018, : 512 - 512
  • [9] Faa$T: A Transparent Auto-Scaling Cache for Serverless Applications
    Romero, Francisco
    Chaudhry, Gohar Irfan
    Goiri, Inigo
    Gopa, Pragna
    Batum, Paul
    Yadwadkar, Neeraja J.
    Fonseca, Rodrigo
    Kozyrakis, Christos
    Bianchini, Ricardo
    [J]. PROCEEDINGS OF THE 2021 ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '21), 2021, : 122 - 137
  • [10] Towards Autonomous VNF Auto-scaling using Deep Reinforcement Learning
    Soto, Paola
    De Vleeschauwer, Danny
    Camelo, Miguel
    De Bock, Yorick
    De Schepper, Koen
    Chang, Chia-Yu
    Hellinckx, Peter
    Botero, Juan F.
    Latre, Steven
    [J]. 2021 EIGHTH INTERNATIONAL CONFERENCE ON SOFTWARE DEFINED SYSTEMS (SDS), 2021, : 74 - 81