Model-Based Reinforcement Learning Framework of Online Network Resource Allocation

被引:1
|
作者
Bakhshi, Bahador [1 ]
Mangues-Bafalluy, Josep [1 ]
机构
[1] Ctr Tecnol Telecomunicac Catalunya CTTC CERCA, Barcelona, Spain
关键词
Online Resource Allocation; Model-based Reinforcement Learning; Service Federation; Continual Learning;
D O I
10.1109/ICC45855.2022.9838782
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Online Network Resource Allocation (ONRA) for service provisioning is a fundamental problem in communication networks. As a sequential decision-making under uncertainty problem, it is promising to approach ONRA via Reinforcement Learning (RL). But, RL solutions suffer from the sample complexity issue; i.e., a large number of interactions with the environment are needed to find an efficient policy. This is a barrier to utilize RL for ONRA as on one hand, it is not practical to train the RL agent offline due to lack of information about future requests, and on the other hand, online training in the real network leads to significant performance loss because of the sub-optimal policy during the prolonged learning time. The performance degradation is even higher in non-stationary ONRA where the agent should continually adapt the policy with the changes in service requests. To address this issue, we develop a general resource allocation framework, RADAR, using model-based RL for a class of ONRA problems with known immediate rewards. RADAR improves sample efficiency via exploring the state space in background and exploiting the policy in decision-time using synthetic samples by the model of the environment, which is trained by real interactions. Applying RADAR on the multi-domain service federation problem to maximize profit via selecting proper domains for service requests deployment, shows its continual learning capability and up to 44% performance improvement w.r.t. the standard model-free RL solution.
引用
收藏
页码:4456 / 4461
页数:6
相关论文
共 50 条
  • [1] Model-Based Reinforcement Learning With Kernels for Resource Allocation in RAN Slices
    Alcaraz, Juan J.
    Losilla, Fernando
    Zanella, Andrea
    Zorzi, Michele
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (01) : 486 - 501
  • [2] Reinforcement learning-based online resource allocation for edge computing network
    Li Y.-J.
    Jiang H.-T.
    Gao M.-H.
    [J]. Kongzhi yu Juece/Control and Decision, 2022, 37 (11): : 2880 - 2886
  • [3] Deep Reinforcement Learning for Online Resource Allocation in Network Slicing
    Cai, Yue
    Cheng, Peng
    Chen, Zhuo
    Ding, Ming
    Vucetic, Branka
    Li, Yonghui
    [J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (06) : 7099 - 7116
  • [4] CLARA: A Constrained Reinforcement Learning Based Resource Allocation Framework for Network Slicing
    Liu, Yongshuai
    Ding, Jiaxin
    Zhang, Zhi-Li
    Liu, Xin
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 1427 - 1437
  • [5] Model-based reinforcement learning approach for federated learning resource allocation and parameter optimization
    Karami, Farzan
    Khalaj, Babak Hossein
    [J]. COMPUTER COMMUNICATIONS, 2024, 228
  • [6] Reinforcement Learning with Model-Based Approaches for Dynamic Resource Allocation in a Tandem Queue
    Tournaire, Thomas
    Barthelemy, Jeanne
    Castel-Taleb, Hind
    Hyon, Emmanuel
    [J]. PERFORMANCE ENGINEERING AND STOCHASTIC MODELING, 2021, 13104 : 243 - 263
  • [7] Online Constrained Model-based Reinforcement Learning
    van Niekerk, Benjamin
    Damianou, Andreas
    Rosman, Benjamin
    [J]. CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI2017), 2017,
  • [8] Reinforcement learning based monotonic policy for online resource allocation
    Mishra, Pankaj
    Moustafa, Ahmed
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 138 : 313 - 327
  • [9] MIRAS: Model-based Reinforcement Learning for Microservice Resource Allocation over Scientific Workflows
    Yang, Zhe
    Phuong Nguyen
    Jin, Haiming
    Nahrstedt, Klara
    [J]. 2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 122 - 132
  • [10] Network Resource Allocation Strategy Based on Deep Reinforcement Learning
    Zhang, Shidong
    Wang, Chao
    Zhang, Junsan
    Duan, Youxiang
    You, Xinhong
    Zhang, Peiying
    [J]. IEEE OPEN JOURNAL OF THE COMPUTER SOCIETY, 2020, 1 (01): : 86 - 94