Delay-Aware Stochastic Resource Management for Mobile Edge Computing Systems via Constrained Reinforcement Learning

被引：1

作者：

Tian, Chang ^{[1
]}

Liu, An ^{[2
]}

Luo, Wu ^{[1
]}

机构：

[1] Peking Univ, Dept Elect, State Key Lab Adv Opt Commun Syst & Networks, Beijing 100871, Peoples R China

[2] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China

来源：

IEEE WIRELESS COMMUNICATIONS LETTERS | 2021年 / 10卷 / 12期

基金：

美国国家科学基金会;

关键词：

Task analysis; Resource management; Delays; Servers; Edge computing; Computer architecture; Reinforcement learning; Mobile edge computing; delay-constrained; constrained reinforcement learning; application-specific design; ALLOCATION;

D O I：

10.1109/LWC.2021.3112984

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We design a joint radio and computational resource allocation policy for a multi-user mobile edge computing system, such that the expected power consumption is minimized while satisfying long-term delay constraints. The problem is formulated as a constrained Markov decision process (CMDP) that is efficiently solved by the proposed constrained reinforcement learning (CRL) algorithm, called successive convex programming based policy optimization (SCPPO). SCPPO solves a convex objective/feasibility surrogate problem at each update and it can provably converge to a Karush-Kuhn-Tucker (KKT) point of the original CMDP problem almost surely under some mild conditions. Moreover, SCPPO adopts an application-specific policy architecture and employs a data-efficient estimation strategy that can reuse old experiences, such that SCPPO can realize fast learning with low computational complexity.

引用

页码：2708 / 2712

页数：5

共 50 条

[21] An actor-critic reinforcement learning-based resource management in mobile edge computing systems
Fu, Fang
Zhang, Zhicai
Yu, Fei Richard
Yan, Qiao
[J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (08) : 1875 - 1889
[22] An actor-critic reinforcement learning-based resource management in mobile edge computing systems
Fang Fu
Zhicai Zhang
Fei Richard Yu
Qiao Yan
[J]. International Journal of Machine Learning and Cybernetics, 2020, 11 : 1875 - 1889
[23] Energy-efficient and delay-aware multitask offloading for mobile edge computing networks
Chanyour, Tarik
El Ghmary, Mohamed
Hmimz, Youssef
Malki, Mohammed Oucamah Cherkaoui
[J]. TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2022, 33 (03)
[24] Energy-efficient and delay-aware multitask offloading for mobile edge computing networks
Chanyour, Tarik
El Ghmary, Mohamed
Hmimz, Youssef
Malki, Mohammed Oucamah Cherkaoui
[J]. MOLECULES, 2022, 27 (05):
[25] Deep Reinforcement Learning for Performance-Aware Adaptive Resource Allocation in Mobile Edge Computing
Huang, Binbin
Li, Zhongjin
Xu, Yunqiu
Pan, Linxuan
Wang, Shangguang
Hu, Haiyang
Chang, Victor
[J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2020, 2020
[26] Delay-Aware Power Control for Downlink Multi-User MIMO via Constrained Deep Reinforcement Learning
Tian, Chang
Huang, Guan
Liu, An
Luo, Wu
[J]. 2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
[27] PDMA: Probabilistic service migration approach for delay-aware and mobility-aware mobile edge computing
Xu, Minxian
Zhou, Qiheng
Wu, Huaming
Lin, Weiwei
Ye, Kejiang
Xu, Chengzhong
[J]. SOFTWARE-PRACTICE & EXPERIENCE, 2022, 52 (02): : 394 - 414
[28] Mobile-aware dynamic resource management for edge computing
Filiposka, Sonja
Mishev, Anastas
Gilly, Katja
[J]. TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2019, 30 (06):
[29] Request-Aware Task Offloading in Mobile Edge Computing via Deep Reinforcement Learning
Sheng, Ziwen
Mao, Yingchi
Wang, Jiajun
Nie, Hua
Huang, Jianxin
[J]. 2022 TENTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA, CBD, 2022, : 294 - 299
[30] A Computing Offloading Resource Allocation Scheme Using Deep Reinforcement Learning in Mobile Edge Computing Systems
Li, Xuezhu
[J]. JOURNAL OF GRID COMPUTING, 2021, 19 (03)

← 1 2 3 4 5 →