Reinforcement Learning Applicability for Resource-Based Auto-scaling in Serverless Edge Applications

被引:10
|
作者
Benedetti, Priscilla [1 ,2 ]
Femminella, M. [3 ]
Reali, G. [3 ]
Steenhaut, Kris [2 ]
机构
[1] Univ Perugia, Dept Engn, Via GDuranti 93, Perugia, Italy
[2] Vrije Univ Brussel, ETRO, Pl Laan 2, Brussels, Belgium
[3] Univ Perugia, Dept Eng, CNIT RU, Via G Duranti 93, Perugia, Italy
关键词
edge computing; serverless computing; reinforcement learning; Kubernetes; OpenFaaS;
D O I
10.1109/PerComWorkshops53856.2022.9767437
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Serverless computing is an alternative deployment paradigm for cloud computing platforms, aimed to provide scalability and cost reduction without requiring any additional deployment overhead from developers. Generally, open-source serverless computing platforms rely on two auto-scaling approaches: workload-based and resource-based. In the former, a designated algorithm scales instances according to the number of incoming requests. In the latter, instances are scaled when a certain resource usage limit, such as maximum Central Processing Unit (CPU) utilization, is reached. Resource-based auto-scaling is usually implemented leveraging Kubernetes Horizontal Pod Autoscaler (HPA). In this work, we investigate the applicability of a reinforcement-based approach to resource-based auto-scaling in OpenFaaS, the most widely used open-source serverless platform. Serverless technologies are particularly convenient when dealing with edge computing on constrained devices or resource-limited machines. Our experimental analysis has been conducted on constrained Kubernetes-based nodes, to simulate such an edge application scenario. Its preliminary results show that our proposed model learns an effective scaling policy, based on CPU utilization, to provide minimal service latency within a limited number of iterations.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] AI-based Resource Allocation: Reinforcement Learning for Adaptive Auto-scaling in Serverless Environments
    Schuler, Lucia
    Jamil, Somaya
    Kuehl, Niklas
    [J]. 21ST IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2021), 2021, : 804 - 811
  • [2] Adaptive Auto-Scaling of Delay-Sensitive Serverless Services with Reinforcement Learning
    Zhang, Zhiyu
    Wang, Tao
    Li, An
    Zhang, Wenbo
    [J]. 2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 866 - 871
  • [3] Optimized resource usage with hybrid auto-scaling system for knative serverless edge computing
    Tran, Minh-Ngoc
    Kim, Younghan
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 152 : 304 - 316
  • [4] Faa$T: A Transparent Auto-Scaling Cache for Serverless Applications
    Romero, Francisco
    Chaudhry, Gohar Irfan
    Goiri, Inigo
    Gopa, Pragna
    Batum, Paul
    Yadwadkar, Neeraja J.
    Fonseca, Rodrigo
    Kozyrakis, Christos
    Bianchini, Ricardo
    [J]. PROCEEDINGS OF THE 2021 ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '21), 2021, : 122 - 137
  • [5] Concurrent service auto-scaling for Knative resource quota-based serverless system
    Tran, Minh-Ngoc
    Kim, YoungHan
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 160 : 326 - 339
  • [6] Resource auto-scaling for SQL-like queries in the cloud based on parallel reinforcement learning
    Kandi, Mohamed Mehdi
    Yin, Shaoyi
    Hameurlain, Abdelkader
    [J]. INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2019, 10 (06) : 654 - 671
  • [7] Deep Learning Based Resource Allocation For Auto-Scaling VNFs
    Patel, Yashwant Singh
    Verma, Deepak
    Misra, Rajiv
    [J]. 13TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED NETWORKS AND TELECOMMUNICATION SYSTEMS (IEEE ANTS), 2019,
  • [8] Resource auto-scaling for SQL-like queries in the cloud based on parallel reinforcement learning
    Kandi, Mohamed Mehdi
    Yin, Shaoyi
    Hameurlain, Abdelkader
    [J]. International Journal of Grid and Utility Computing, 2019, 10 (06): : 654 - 671
  • [9] Auto-Scaling Approach for Cloud based Mobile Learning Applications
    Almutlaq, Amani Nasser
    Daadaa, Yassine
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (01) : 472 - 479
  • [10] Machine learning-based auto-scaling for containerized applications
    Imdoukh, Mahmoud
    Ahmad, Imtiaz
    Alfailakawi, Mohammad Gh
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (13): : 9745 - 9760