Delay-Aware Stochastic Resource Management for Mobile Edge Computing Systems via Constrained Reinforcement Learning

被引:1
|
作者
Tian, Chang [1 ]
Liu, An [2 ]
Luo, Wu [1 ]
机构
[1] Peking Univ, Dept Elect, State Key Lab Adv Opt Commun Syst & Networks, Beijing 100871, Peoples R China
[2] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
基金
美国国家科学基金会;
关键词
Task analysis; Resource management; Delays; Servers; Edge computing; Computer architecture; Reinforcement learning; Mobile edge computing; delay-constrained; constrained reinforcement learning; application-specific design; ALLOCATION;
D O I
10.1109/LWC.2021.3112984
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We design a joint radio and computational resource allocation policy for a multi-user mobile edge computing system, such that the expected power consumption is minimized while satisfying long-term delay constraints. The problem is formulated as a constrained Markov decision process (CMDP) that is efficiently solved by the proposed constrained reinforcement learning (CRL) algorithm, called successive convex programming based policy optimization (SCPPO). SCPPO solves a convex objective/feasibility surrogate problem at each update and it can provably converge to a Karush-Kuhn-Tucker (KKT) point of the original CMDP problem almost surely under some mild conditions. Moreover, SCPPO adopts an application-specific policy architecture and employs a data-efficient estimation strategy that can reuse old experiences, such that SCPPO can realize fast learning with low computational complexity.
引用
收藏
页码:2708 / 2712
页数:5
相关论文
共 50 条
  • [21] An actor-critic reinforcement learning-based resource management in mobile edge computing systems
    Fu, Fang
    Zhang, Zhicai
    Yu, Fei Richard
    Yan, Qiao
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (08) : 1875 - 1889
  • [22] An actor-critic reinforcement learning-based resource management in mobile edge computing systems
    Fang Fu
    Zhicai Zhang
    Fei Richard Yu
    Qiao Yan
    [J]. International Journal of Machine Learning and Cybernetics, 2020, 11 : 1875 - 1889
  • [23] Energy-efficient and delay-aware multitask offloading for mobile edge computing networks
    Chanyour, Tarik
    El Ghmary, Mohamed
    Hmimz, Youssef
    Malki, Mohammed Oucamah Cherkaoui
    [J]. TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2022, 33 (03)
  • [24] Energy-efficient and delay-aware multitask offloading for mobile edge computing networks
    Chanyour, Tarik
    El Ghmary, Mohamed
    Hmimz, Youssef
    Malki, Mohammed Oucamah Cherkaoui
    [J]. MOLECULES, 2022, 27 (05):
  • [25] Deep Reinforcement Learning for Performance-Aware Adaptive Resource Allocation in Mobile Edge Computing
    Huang, Binbin
    Li, Zhongjin
    Xu, Yunqiu
    Pan, Linxuan
    Wang, Shangguang
    Hu, Haiyang
    Chang, Victor
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2020, 2020
  • [26] Delay-Aware Power Control for Downlink Multi-User MIMO via Constrained Deep Reinforcement Learning
    Tian, Chang
    Huang, Guan
    Liu, An
    Luo, Wu
    [J]. 2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [27] PDMA: Probabilistic service migration approach for delay-aware and mobility-aware mobile edge computing
    Xu, Minxian
    Zhou, Qiheng
    Wu, Huaming
    Lin, Weiwei
    Ye, Kejiang
    Xu, Chengzhong
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2022, 52 (02): : 394 - 414
  • [28] Mobile-aware dynamic resource management for edge computing
    Filiposka, Sonja
    Mishev, Anastas
    Gilly, Katja
    [J]. TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2019, 30 (06):
  • [29] Request-Aware Task Offloading in Mobile Edge Computing via Deep Reinforcement Learning
    Sheng, Ziwen
    Mao, Yingchi
    Wang, Jiajun
    Nie, Hua
    Huang, Jianxin
    [J]. 2022 TENTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA, CBD, 2022, : 294 - 299
  • [30] A Computing Offloading Resource Allocation Scheme Using Deep Reinforcement Learning in Mobile Edge Computing Systems
    Li, Xuezhu
    [J]. JOURNAL OF GRID COMPUTING, 2021, 19 (03)