Delay-Aware Stochastic Resource Management for Mobile Edge Computing Systems via Constrained Reinforcement Learning

被引:1
|
作者
Tian, Chang [1 ]
Liu, An [2 ]
Luo, Wu [1 ]
机构
[1] Peking Univ, Dept Elect, State Key Lab Adv Opt Commun Syst & Networks, Beijing 100871, Peoples R China
[2] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
基金
美国国家科学基金会;
关键词
Task analysis; Resource management; Delays; Servers; Edge computing; Computer architecture; Reinforcement learning; Mobile edge computing; delay-constrained; constrained reinforcement learning; application-specific design; ALLOCATION;
D O I
10.1109/LWC.2021.3112984
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We design a joint radio and computational resource allocation policy for a multi-user mobile edge computing system, such that the expected power consumption is minimized while satisfying long-term delay constraints. The problem is formulated as a constrained Markov decision process (CMDP) that is efficiently solved by the proposed constrained reinforcement learning (CRL) algorithm, called successive convex programming based policy optimization (SCPPO). SCPPO solves a convex objective/feasibility surrogate problem at each update and it can provably converge to a Karush-Kuhn-Tucker (KKT) point of the original CMDP problem almost surely under some mild conditions. Moreover, SCPPO adopts an application-specific policy architecture and employs a data-efficient estimation strategy that can reuse old experiences, such that SCPPO can realize fast learning with low computational complexity.
引用
收藏
页码:2708 / 2712
页数:5
相关论文
共 50 条
  • [11] Delay-Aware NFV Resource Allocation with Deep Reinforcement Learning
    Yuan, Ningcheng
    He, Wenchen
    Shen, Jing
    Qiu, Xuesong
    Guo, Shaoyong
    Li, Wenjing
    [J]. NOMS 2020 - PROCEEDINGS OF THE 2020 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM 2020: MANAGEMENT IN THE AGE OF SOFTWARIZATION AND ARTIFICIAL INTELLIGENCE, 2020,
  • [12] Stochastic Resource Allocation and Delay Analysis for Mobile Edge Computing Systems
    Wang, Yitu
    Wang, Wei
    Lau, Vincent K. N.
    Nakachi, Takayuki
    Zhang, Zhaoyang
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2023, 71 (07) : 4018 - 4033
  • [13] Cost and Delay-Aware Service Replication for Scalable Mobile Edge Computing
    Mohamed, Shimaa A.
    Sorour, Sameh
    Elsayed, Sara A.
    Hassanein, Hossam S.
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (06): : 10937 - 10950
  • [14] Delay-Aware Energy Minimization Offloading Scheme for Mobile Edge Computing
    Jiang, Fan
    Wei, Fengmiao
    Wang, Junxuan
    Liu, Xinying
    [J]. 2020 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2020, : 717 - 722
  • [15] Network Delay-Aware Energy Management for Mobile Systems
    Ju, Minho
    Kim, Hyeonggyu
    Kim, Soontae
    [J]. PROCEEDINGS OF THE 2016 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2016, : 157 - 162
  • [16] Delay-aware concurrent data management method for IoT collaborative mobile edge computing environment
    Kavitha, B. C.
    Vallikannu, R.
    Sankaran, K. Sakthidasan
    [J]. MICROPROCESSORS AND MICROSYSTEMS, 2020, 74
  • [17] Group Delay-Aware Scalable Mobile Edge Computing Using Service Replication
    Mohamed, Shimaa A. A.
    Sorour, Sameh
    Hassanein, Hossam S. S.
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (11) : 11911 - 11920
  • [18] Secrecy-Based Delay-Aware Computation Offloading via Mobile Edge Computing for Internet of Things
    Wu, Yuan
    Shi, Jiajun
    Ni, Kejie
    Qian, Liping
    Zhu, Wei
    Shi, Zhiguo
    Meng, Limin
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (03): : 4201 - 4213
  • [19] Computation Offloading and Resource Allocation in Mobile Edge Computing via Reinforcement Learning
    Wang, Danfeng
    Zhao, Jian
    [J]. 2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,
  • [20] On Network plus : Network Delay-Aware Management for Mobile Systems
    Kim, Hyeonggyu
    Ju, Minho
    Kim, Soontae
    [J]. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2018, 17 (03)