Delay-Aware Stochastic Resource Management for Mobile Edge Computing Systems via Constrained Reinforcement Learning

被引:1
|
作者
Tian, Chang [1 ]
Liu, An [2 ]
Luo, Wu [1 ]
机构
[1] Peking Univ, Dept Elect, State Key Lab Adv Opt Commun Syst & Networks, Beijing 100871, Peoples R China
[2] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
基金
美国国家科学基金会;
关键词
Task analysis; Resource management; Delays; Servers; Edge computing; Computer architecture; Reinforcement learning; Mobile edge computing; delay-constrained; constrained reinforcement learning; application-specific design; ALLOCATION;
D O I
10.1109/LWC.2021.3112984
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We design a joint radio and computational resource allocation policy for a multi-user mobile edge computing system, such that the expected power consumption is minimized while satisfying long-term delay constraints. The problem is formulated as a constrained Markov decision process (CMDP) that is efficiently solved by the proposed constrained reinforcement learning (CRL) algorithm, called successive convex programming based policy optimization (SCPPO). SCPPO solves a convex objective/feasibility surrogate problem at each update and it can provably converge to a Karush-Kuhn-Tucker (KKT) point of the original CMDP problem almost surely under some mild conditions. Moreover, SCPPO adopts an application-specific policy architecture and employs a data-efficient estimation strategy that can reuse old experiences, such that SCPPO can realize fast learning with low computational complexity.
引用
收藏
页码:2708 / 2712
页数:5
相关论文
共 50 条
  • [41] A Survey on Delay-Aware Resource Control for Wireless Systems-Large Deviation Theory, Stochastic Lyapunov Drift, and Distributed Stochastic Learning
    Cui, Ying
    Lau, Vincent K. N.
    Wang, Rui
    Huang, Huang
    Zhang, Shunqing
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2012, 58 (03) : 1677 - 1701
  • [42] Efficient Algorithms for Delay-Aware NFV-Enabled Multicasting in Mobile Edge Clouds With Resource Sharing
    Ren, Haozhe
    Xu, Zichuan
    Liang, Weifa
    Xia, Qiufen
    Zhou, Pan
    Rana, Omer F.
    Galis, Alex
    Wu, Guowei
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (09) : 2050 - 2066
  • [43] Security-Aware Task Offloading Using Deep Reinforcement Learning in Mobile Edge Computing Systems
    Lu, Haodong
    He, Xiaoming
    Zhang, Dengyin
    [J]. ELECTRONICS, 2024, 13 (15)
  • [44] Spectrum-Aware Mobile Edge Computing for UAVs Using Reinforcement Learning
    Badnava, Babak
    Kim, Taejoon
    Cheung, Kenny
    Ali, Zaheer
    Hashemi, Morteza
    [J]. 2021 ACM/IEEE 6TH SYMPOSIUM ON EDGE COMPUTING (SEC 2021), 2021, : 376 - 380
  • [45] Delay-Aware with Resource Block Management Scheduling Algorithm in LTE
    Kaewmongkol, Korn
    Jansang, Aphirak
    Phonphoem, Anan
    [J]. 2015 INTERNATIONAL COMPUTER SCIENCE AND ENGINEERING CONFERENCE (ICSEC), 2015, : 290 - 295
  • [46] Delay-sensitive Task Scheduling with Deep Reinforcement Learning in Mobile-edge Computing Systems
    Meng, Hao
    Chao, Daichong
    Guo, Qianying
    Li, Xiaowei
    [J]. 2019 3RD INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2019), 2019, 1229
  • [47] Delay-Aware Routing in Software-Defined Networks via Network Tomography and Reinforcement Learning
    Tao, Xu
    Monaco, Doriana
    Sacco, Alessio
    Silvestri, Simone
    Marchetto, Guido
    [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (04): : 3383 - 3397
  • [48] DACOM: Learning Delay-Aware Communication for Multi-Agent Reinforcement Learning
    Yuan, Tingting
    Chung, Hwei-Ming
    Yuan, Jie
    Fu, Xiaoming
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 11763 - 11771
  • [49] Resource Allocation for Edge Computing in IoT Networks via Reinforcement Learning
    Liu, Xiaolan
    Qin, Zhijin
    Gao, Yue
    [J]. ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
  • [50] Joint DNN partitioning and resource allocation for completion rate maximization of delay-aware DNN inference tasks in wireless powered mobile edge computing
    Tian, Xianzhong
    Xu, Pengcheng
    Shen, Yifan
    Shao, Yuheng
    [J]. PEER-TO-PEER NETWORKING AND APPLICATIONS, 2023, 16 (06) : 2865 - 2878