COUNSEL: Cloud Resource Configuration Management using Deep Reinforcement Learning

被引:0
|
作者
Hegde, Adithya [1 ]
Kulkarni, Sameer G. [2 ]
Prasad, Abhinandan S. [1 ]
机构
[1] Natl Inst Engn, Mysuru, India
[2] Indian Inst Technol Gandhinagar, Gandhinagar, India
关键词
Cloud computing; Microservices; Configuration Management; Autoscaling; Deep Reinforcement Learning;
D O I
10.1109/CCGRID57682.2023.00035
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Internet Clouds are essentially service factories that offer various networked services through different service models, viz., Infrastructure, Platform, Software, and Functions as a Service. Meeting the desired service level objectives (SLOs) while ensuring efficient resource utilization requires significant efforts to provision the associated cloud resources correctly and on time. Therefore, one of the critical issues for any cloud service provider is resource configuration management. On one end, i.e., from the cloud operator's perspective, resource management affects overall resource utilization and efficiency. In contrast, from the cloud user/customer perspective, resource configuration affects the performance, cost, and offered SLOs. However, the state-of-the-art solutions for finding the configurations are limited to a single component or handle static workloads. Further, these solutions are computationally expensive and introduce profiling overhead, limiting scalability. Therefore, we propose COUNSEL, a deep reinforcement learning-based framework to handle the dynamic workloads and efficiently manage the configurations of an arbitrary multi-component service. We evaluate COUNSEL with three initial policies: over-provisioning, under-provisioning, and expert provisioning. In all the cases, COUNSEL eliminates the profiling overhead and achieves the average reward between 20- 60% without violating the SLOs and budget constraints. Moreover, the inference time of COUNSEL has a constant time complexity.
引用
收藏
页码:286 / 298
页数:13
相关论文
共 50 条
  • [21] Implementation of Trusted Traceability Query Using Blockchain and Deep Reinforcement Learning in Resource Management
    Jiang, Yunting
    Lei, Yalin
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [22] Resource management of cloud-enabled systems using model-free reinforcement learning
    Yue Jin
    Makram Bouzid
    Dimitre Kostadinov
    Armen Aghasaryan
    Annals of Telecommunications, 2019, 74 : 625 - 636
  • [23] Resource management of cloud-enabled systems using model-free reinforcement learning
    Jin, Yue
    Bouzid, Makram
    Kostadinov, Dimitre
    Aghasaryan, Armen
    ANNALS OF TELECOMMUNICATIONS, 2019, 74 (9-10) : 625 - 636
  • [24] Model-free Resource Management of Cloud-based applications using Reinforcement Learning
    Jin, Yue
    Bouzid, Makram
    Kostadinov, Dimitre
    Aghasaryan, Armen
    2018 21ST CONFERENCE ON INNOVATION IN CLOUDS, INTERNET AND NETWORKS AND WORKSHOPS (ICIN), 2018,
  • [25] Resource Management in Multi-Cloud Scenarios via Reinforcement Learning
    Pietrabissa, Antonio
    Battilotti, Stefano
    Facchinei, Francisco
    Giuseppi, Alessandro
    Oddi, Guido
    Panfili, Martina
    Suraci, Vincenzo
    2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 9084 - 9089
  • [26] Deep-Hill: An Innovative Cloud Resource Optimization Algorithm by Predicting SaaS Instance Configuration Using Deep Learning
    Abouelyazid, Mahmoud
    IEEE ACCESS, 2024, 12 : 92573 - 92584
  • [27] Resource allocation for content distribution in IoT edge cloud computing environments using deep reinforcement learning
    Neelakantan, Puligundla
    Gangappa, Malige
    Rajasekar, Mummalaneni
    Kumar, Talluri Sunil
    Reddy, Gali Suresh
    JOURNAL OF HIGH SPEED NETWORKS, 2024, 30 (03) : 409 - 426
  • [28] Resource Allocation Strategy Using Deep Reinforcement Learning in Cloud-Edge Collaborative Computing Environment
    Cen, Junjie
    Li, Yongbo
    MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [29] Computational Resource Sharing in a Vehicular Cloud Network via Deep Reinforcement Learning
    Xu, Shilin
    Guo, Caili
    Hu, Rose Qingyang
    Qian, Yi
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [30] Adaptive and Efficient Resource Allocation in Cloud Datacenters Using Actor-Critic Deep Reinforcement Learning
    Chen, Zheyi
    Hu, Jia
    Min, Geyong
    Luo, Chunbo
    El-Ghazawi, Tarek
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (08) : 1911 - 1923