Elastic Management of Cloud Applications using Adaptive Reinforcement Learning

被引:0
|
作者
Lolos, Konstantinos [1 ]
Konstantinou, Ioannis [1 ]
Kantere, Verena [1 ]
Koziris, Nectarios [1 ]
机构
[1] Natl Tech Univ Athens, Sch Elect & Comp Engn, Athens, Greece
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modern large-scale computing deployments consist of complex applications running over machine clusters. An important issue in these is the offering of elasticity, i.e., the dynamic allocation of resources to applications to meet fluctuating workload demands. Threshold based approaches are typically employed, yet they are difficult to calibrate and optimize. Approaches based on reinforcement learning (RL) have been proposed, but they require a large number of states in order to model complex application behavior. Methods that adaptively partition the state space have been proposed, but their partitioning criteria and strategies are sub-optimal. In this work we present MDP_DT, a novel full-model based reinforcement learning algorithm for elastic resource management that employs adaptive state space partitioning. We propose two novel statistical criteria and three strategies and we experimentally prove that they correctly decide both where and when to partition, outperforming existing approaches. We experimentally evaluate MDP_DT in a real large scale cluster over variable not-encountered workloads and we show that it takes more informed decisions compared to static, model-free and threshold approaches, while requiring a minimal amount of training data. We experimentally show that this adaptation enabled MDP_DT to optimize the achieved profit while being 40% cheaper than calibrated RL and threshold approaches.
引用
收藏
页码:203 / 212
页数:10
相关论文
共 50 条
  • [1] Efficient Adaptive Resource Provisioning for Cloud Applications using Reinforcement Learning
    John, Indu
    Bhatnagar, Shalabh
    Sreekantan, Aiswarya
    [J]. 2019 IEEE 4TH INTERNATIONAL WORKSHOPS ON FOUNDATIONS AND APPLICATIONS OF SELF* SYSTEMS (FAS*W 2019), 2019, : 271 - 272
  • [2] Model-free Resource Management of Cloud-based applications using Reinforcement Learning
    Jin, Yue
    Bouzid, Makram
    Kostadinov, Dimitre
    Aghasaryan, Armen
    [J]. 2018 21ST CONFERENCE ON INNOVATION IN CLOUDS, INTERNET AND NETWORKS AND WORKSHOPS (ICIN), 2018,
  • [3] Adaptive Service Management in Mobile Cloud Computing by Means of Supervised and Reinforcement Learning
    Nawrocki, Piotr
    Sniezynski, Bartlomiej
    [J]. JOURNAL OF NETWORK AND SYSTEMS MANAGEMENT, 2018, 26 (01) : 1 - 22
  • [4] Adaptive Service Management in Mobile Cloud Computing by Means of Supervised and Reinforcement Learning
    Piotr Nawrocki
    Bartlomiej Sniezynski
    [J]. Journal of Network and Systems Management, 2018, 26 : 1 - 22
  • [5] An Adaptive Power Management Scheme for WLANs using Reinforcement Learning
    Lim, Tae Hyun
    Rhee, Seung Hyong
    [J]. 2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC): ICT CONVERGENCE LEADING THE AUTONOMOUS FUTURE, 2019, : 412 - 415
  • [6] Adaptive Control for Building Energy Management Using Reinforcement Learning
    Eller, Lukas
    Siafara, Lydia C.
    Sauter, Thilo
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2018, : 1562 - 1567
  • [7] COUNSEL: Cloud Resource Configuration Management using Deep Reinforcement Learning
    Hegde, Adithya
    Kulkarni, Sameer G.
    Prasad, Abhinandan S.
    [J]. 2023 IEEE/ACM 23RD INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING, CCGRID, 2023, : 286 - 298
  • [8] Adaptive Traffic Grooming Using Reinforcement Learning in Multilayer Elastic Optical Networks
    Tanaka, Takafumi
    [J]. 2023 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXHIBITION, OFC, 2023,
  • [9] An Adaptive Decision Making Approach based on Reinforcement Learning for Self-Managed Cloud Applications
    Yan, Yongming
    Zhang, Bin
    Guo, Jun
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS), 2016, : 720 - 723
  • [10] Adaptive Parameterized Control for Coordinated Traffic Management Using Reinforcement Learning
    Sun, Dingshan
    Jamshidnejad, Anahita
    De Schutter, Bart
    [J]. IFAC PAPERSONLINE, 2023, 56 (02): : 5463 - 5468