COUNSEL: Cloud Resource Configuration Management using Deep Reinforcement Learning

被引：0

作者：

Hegde, Adithya ^{[1
]}

Kulkarni, Sameer G. ^{[2
]}

Prasad, Abhinandan S. ^{[1
]}

机构：

[1] Natl Inst Engn, Mysuru, India

[2] Indian Inst Technol Gandhinagar, Gandhinagar, India

来源：

2023 IEEE/ACM 23RD INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING, CCGRID | 2023年

关键词：

Cloud computing; Microservices; Configuration Management; Autoscaling; Deep Reinforcement Learning;

D O I：

10.1109/CCGRID57682.2023.00035

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Internet Clouds are essentially service factories that offer various networked services through different service models, viz., Infrastructure, Platform, Software, and Functions as a Service. Meeting the desired service level objectives (SLOs) while ensuring efficient resource utilization requires significant efforts to provision the associated cloud resources correctly and on time. Therefore, one of the critical issues for any cloud service provider is resource configuration management. On one end, i.e., from the cloud operator's perspective, resource management affects overall resource utilization and efficiency. In contrast, from the cloud user/customer perspective, resource configuration affects the performance, cost, and offered SLOs. However, the state-of-the-art solutions for finding the configurations are limited to a single component or handle static workloads. Further, these solutions are computationally expensive and introduce profiling overhead, limiting scalability. Therefore, we propose COUNSEL, a deep reinforcement learning-based framework to handle the dynamic workloads and efficiently manage the configurations of an arbitrary multi-component service. We evaluate COUNSEL with three initial policies: over-provisioning, under-provisioning, and expert provisioning. In all the cases, COUNSEL eliminates the profiling overhead and achieves the average reward between 20- 60% without violating the SLOs and budget constraints. Moreover, the inference time of COUNSEL has a constant time complexity.

引用

页码：286 / 298

页数：13

共 50 条

[21] Implementation of Trusted Traceability Query Using Blockchain and Deep Reinforcement Learning in Resource Management
Jiang, Yunting
Lei, Yalin
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
[22] Resource management of cloud-enabled systems using model-free reinforcement learning
Yue Jin
Makram Bouzid
Dimitre Kostadinov
Armen Aghasaryan
Annals of Telecommunications, 2019, 74 : 625 - 636
[23] Resource management of cloud-enabled systems using model-free reinforcement learning
Jin, Yue
Bouzid, Makram
Kostadinov, Dimitre
Aghasaryan, Armen
ANNALS OF TELECOMMUNICATIONS, 2019, 74 (9-10) : 625 - 636
[24] Model-free Resource Management of Cloud-based applications using Reinforcement Learning
Jin, Yue
Bouzid, Makram
Kostadinov, Dimitre
Aghasaryan, Armen
2018 21ST CONFERENCE ON INNOVATION IN CLOUDS, INTERNET AND NETWORKS AND WORKSHOPS (ICIN), 2018,
[25] Resource Management in Multi-Cloud Scenarios via Reinforcement Learning
Pietrabissa, Antonio
Battilotti, Stefano
Facchinei, Francisco
Giuseppi, Alessandro
Oddi, Guido
Panfili, Martina
Suraci, Vincenzo
2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 9084 - 9089
[26] Deep-Hill: An Innovative Cloud Resource Optimization Algorithm by Predicting SaaS Instance Configuration Using Deep Learning
Abouelyazid, Mahmoud
IEEE ACCESS, 2024, 12 : 92573 - 92584
[27] Resource allocation for content distribution in IoT edge cloud computing environments using deep reinforcement learning
Neelakantan, Puligundla
Gangappa, Malige
Rajasekar, Mummalaneni
Kumar, Talluri Sunil
Reddy, Gali Suresh
JOURNAL OF HIGH SPEED NETWORKS, 2024, 30 (03) : 409 - 426
[28] Resource Allocation Strategy Using Deep Reinforcement Learning in Cloud-Edge Collaborative Computing Environment
Cen, Junjie
Li, Yongbo
MOBILE INFORMATION SYSTEMS, 2022, 2022
[29] Computational Resource Sharing in a Vehicular Cloud Network via Deep Reinforcement Learning
Xu, Shilin
Guo, Caili
Hu, Rose Qingyang
Qian, Yi
2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
[30] Adaptive and Efficient Resource Allocation in Cloud Datacenters Using Actor-Critic Deep Reinforcement Learning
Chen, Zheyi
Hu, Jia
Min, Geyong
Luo, Chunbo
El-Ghazawi, Tarek
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (08) : 1911 - 1923

← 1 2 3 4 5 →