Multiuser Resource Control With Deep Reinforcement Learning in IoT Edge Computing

被引：47

作者：

Lei, Lei ^{[1
]}

Xu, Huijuan ^{[2
]}

Xiong, Xiong ^{[3
]}

Zheng, Kan ^{[3
]}

Xiang, Wei ^{[1
]}

Wang, Xianbin ^{[4
]}

机构：

[1] James Cook Univ, Coll Sci & Engn, Cairns, Qld 4878, Australia

[2] Beijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R China

[3] Beijing Univ Posts & Telecommun, Intelligent Comp & Commun Lab, Key Lab Univ Wireless Commun, Minist Educ, Beijing 100876, Peoples R China

[4] Western Univ, Dept Elect & Comp Engn, London, ON N6A 5B9, Canada

来源：

IEEE INTERNET OF THINGS JOURNAL | 2019年 / 6卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Deep reinforcement learning (DRL); Internet of Things (IoT); mobile edge computing (MEC); INTERNET; THINGS; RADIO; OPTIMIZATION; MANAGEMENT; ALLOCATION;

D O I：

10.1109/JIOT.2019.2935543

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

By leveraging the concept of mobile edge computing (MEC), massive amount of data generated by a large number of Internet of Things (IoT) devices could be offloaded to MEC server at the edge of wireless network for further computational intensive processing. However, due to the resource constraint of IoT devices and wireless network, both communications and computation resources need to be allocated and scheduled efficiently for better system performance. In this article, we propose a joint computation offloading and multiuser scheduling algorithm for IoT edge computing system to minimize the long-term average weighted sum of delay and power consumption under stochastic traffic arrival. We formulate the dynamic optimization problem as an infinite-horizon average-reward continuous-time Markov decision process (CTMDP) model. One critical challenge in solving this MDP problem for the multiuser resource control is the curse-of-dimensionality problem, where the state space of the MDP model and the computation complexity increase exponentially with the growing number of users or IoT devices. In order to overcome this challenge, we use the deep reinforcement learning (RL) techniques and propose a neural network architecture to approximate the value functions for the post-decision system states. The designed algorithm to solve the CTMDP problem supports semidistributed auction-based implementation, where the IoT devices submit bids to the BS to make the resource control decisions centrally. The simulation results show that the proposed algorithm provides significant performance improvement over the baseline algorithms, and also outperforms the RL algorithms based on other neural network architectures.

引用

页码：10119 / 10133

页数：15

共 50 条

[1] Resource Allocation Based on Deep Reinforcement Learning in IoT Edge Computing
Xiong, Xiong
Zheng, Kan
Lei, Lei
Hou, Lu
[J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2020, 38 (06) : 1133 - 1146
[2] Blockchain-Based Edge Computing Resource Allocation in IoT: A Deep Reinforcement Learning Approach
He, Ying
Wang, Yuhang
Qiu, Chao
Lin, Qiuzhen
Li, Jianqiang
Ming, Zhong
[J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (04) : 2226 - 2237
[3] Edge Computing Resource Allocation Algorithm for NB-IoT Based on Deep Reinforcement Learning
Chu, Jiawen
Pan, Chunyun
Wang, Yafei
Yun, Xiang
LI, Xuehua
[J]. IEICE TRANSACTIONS ON COMMUNICATIONS, 2023, E106B (05) : 439 - 447
[4] Resource Allocation for Edge Computing in IoT Networks via Reinforcement Learning
Liu, Xiaolan
Qin, Zhijin
Gao, Yue
[J]. ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
[5] Resource allocation for content distribution in IoT edge cloud computing environments using deep reinforcement learning
Neelakantan, Puligundla
Gangappa, Malige
Rajasekar, Mummalaneni
Kumar, Talluri Sunil
Reddy, Gali Suresh
[J]. JOURNAL OF HIGH SPEED NETWORKS, 2024, 30 (03) : 409 - 426
[6] Deep Reinforcement Learning for IoT Network Dynamic Clustering in Edge Computing
Liu, Qingzhi
Cheng, Long
Ozcelebi, Tanir
Murphy, John
Lukkien, Johan
[J]. 2019 19TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2019, : 600 - 603
[7] Resource Allocation in IoT Edge Computing via Concurrent Federated Reinforcement Learning
Tianqing Zhu
Zhou, Wei
Ye, Dayong
Cheng, Zishuo
Li, Jin
[J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (02) : 1414 - 1426
[8] Quantum Deep Reinforcement Learning for Dynamic Resource Allocation in Mobile Edge Computing-Based IoT Systems
Ansere, James Adu
Gyamfi, Eric
Sharma, Vishal
Shin, Hyundong
Dobre, Octavia A.
Duong, Trung Q.
[J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (06) : 6221 - 6233
[9] Deep Reinforcement Learning for Task Offloading in Edge Computing Assisted Power IoT
Hu, Jiangyi
Li, Yang
Zhao, Gaofeng
Xu, Bo
Ni, Yiyang
Zhao, Haitao
[J]. IEEE ACCESS, 2021, 9 : 93892 - 93901
[10] Deep Reinforcement Learning-Based Task Scheduling in IoT Edge Computing
Sheng, Shuran
Chen, Peng
Chen, Zhimin
Wu, Lenan
Yao, Yuxuan
[J]. SENSORS, 2021, 21 (05) : 1 - 19

← 1 2 3 4 5 →