Multiuser Resource Control With Deep Reinforcement Learning in IoT Edge Computing

被引:47
|
作者
Lei, Lei [1 ]
Xu, Huijuan [2 ]
Xiong, Xiong [3 ]
Zheng, Kan [3 ]
Xiang, Wei [1 ]
Wang, Xianbin [4 ]
机构
[1] James Cook Univ, Coll Sci & Engn, Cairns, Qld 4878, Australia
[2] Beijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R China
[3] Beijing Univ Posts & Telecommun, Intelligent Comp & Commun Lab, Key Lab Univ Wireless Commun, Minist Educ, Beijing 100876, Peoples R China
[4] Western Univ, Dept Elect & Comp Engn, London, ON N6A 5B9, Canada
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning (DRL); Internet of Things (IoT); mobile edge computing (MEC); INTERNET; THINGS; RADIO; OPTIMIZATION; MANAGEMENT; ALLOCATION;
D O I
10.1109/JIOT.2019.2935543
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
By leveraging the concept of mobile edge computing (MEC), massive amount of data generated by a large number of Internet of Things (IoT) devices could be offloaded to MEC server at the edge of wireless network for further computational intensive processing. However, due to the resource constraint of IoT devices and wireless network, both communications and computation resources need to be allocated and scheduled efficiently for better system performance. In this article, we propose a joint computation offloading and multiuser scheduling algorithm for IoT edge computing system to minimize the long-term average weighted sum of delay and power consumption under stochastic traffic arrival. We formulate the dynamic optimization problem as an infinite-horizon average-reward continuous-time Markov decision process (CTMDP) model. One critical challenge in solving this MDP problem for the multiuser resource control is the curse-of-dimensionality problem, where the state space of the MDP model and the computation complexity increase exponentially with the growing number of users or IoT devices. In order to overcome this challenge, we use the deep reinforcement learning (RL) techniques and propose a neural network architecture to approximate the value functions for the post-decision system states. The designed algorithm to solve the CTMDP problem supports semidistributed auction-based implementation, where the IoT devices submit bids to the BS to make the resource control decisions centrally. The simulation results show that the proposed algorithm provides significant performance improvement over the baseline algorithms, and also outperforms the RL algorithms based on other neural network architectures.
引用
收藏
页码:10119 / 10133
页数:15
相关论文
共 50 条
  • [1] Resource Allocation Based on Deep Reinforcement Learning in IoT Edge Computing
    Xiong, Xiong
    Zheng, Kan
    Lei, Lei
    Hou, Lu
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2020, 38 (06) : 1133 - 1146
  • [2] Blockchain-Based Edge Computing Resource Allocation in IoT: A Deep Reinforcement Learning Approach
    He, Ying
    Wang, Yuhang
    Qiu, Chao
    Lin, Qiuzhen
    Li, Jianqiang
    Ming, Zhong
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (04) : 2226 - 2237
  • [3] Edge Computing Resource Allocation Algorithm for NB-IoT Based on Deep Reinforcement Learning
    Chu, Jiawen
    Pan, Chunyun
    Wang, Yafei
    Yun, Xiang
    LI, Xuehua
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS, 2023, E106B (05) : 439 - 447
  • [4] Resource Allocation for Edge Computing in IoT Networks via Reinforcement Learning
    Liu, Xiaolan
    Qin, Zhijin
    Gao, Yue
    [J]. ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
  • [5] Resource allocation for content distribution in IoT edge cloud computing environments using deep reinforcement learning
    Neelakantan, Puligundla
    Gangappa, Malige
    Rajasekar, Mummalaneni
    Kumar, Talluri Sunil
    Reddy, Gali Suresh
    [J]. JOURNAL OF HIGH SPEED NETWORKS, 2024, 30 (03) : 409 - 426
  • [6] Deep Reinforcement Learning for IoT Network Dynamic Clustering in Edge Computing
    Liu, Qingzhi
    Cheng, Long
    Ozcelebi, Tanir
    Murphy, John
    Lukkien, Johan
    [J]. 2019 19TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2019, : 600 - 603
  • [7] Resource Allocation in IoT Edge Computing via Concurrent Federated Reinforcement Learning
    Tianqing Zhu
    Zhou, Wei
    Ye, Dayong
    Cheng, Zishuo
    Li, Jin
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (02) : 1414 - 1426
  • [8] Quantum Deep Reinforcement Learning for Dynamic Resource Allocation in Mobile Edge Computing-Based IoT Systems
    Ansere, James Adu
    Gyamfi, Eric
    Sharma, Vishal
    Shin, Hyundong
    Dobre, Octavia A.
    Duong, Trung Q.
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (06) : 6221 - 6233
  • [9] Deep Reinforcement Learning for Task Offloading in Edge Computing Assisted Power IoT
    Hu, Jiangyi
    Li, Yang
    Zhao, Gaofeng
    Xu, Bo
    Ni, Yiyang
    Zhao, Haitao
    [J]. IEEE ACCESS, 2021, 9 : 93892 - 93901
  • [10] Deep Reinforcement Learning-Based Task Scheduling in IoT Edge Computing
    Sheng, Shuran
    Chen, Peng
    Chen, Zhimin
    Wu, Lenan
    Yao, Yuxuan
    [J]. SENSORS, 2021, 21 (05) : 1 - 19