Deep Multiagent Reinforcement-Learning-Based Resource Allocation for Internet of Controllable Things

被引：47

作者：

Gu, Bo ^{[1
]}

Zhang, Xu ^{[1
]}

Lin, Ziqi ^{[1
]}

Alazab, Mamoun ^{[2
]}

机构：

[1] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Guangzhou 510275, Peoples R China

[2] Charles Darwin Univ, Coll Engn IT & Environm, Darwin, NT 0810, Australia

来源：

IEEE INTERNET OF THINGS JOURNAL | 2021年 / 8卷 / 05期

关键词：

DDQN; deep reinforcement learning (DRL); delay critical; device-to-device (D2D); Internet of Things (IoT); spectral efficiency; POWER ALLOCATION; D2D COMMUNICATION; JOINT SUBCARRIER; CHANNEL; INTERFERENCE; ASSIGNMENT; NETWORKS;

D O I：

10.1109/JIOT.2020.3023111

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Ultrareliable and low-latency communication (URLLC) is a prerequisite for the successful implementation of the Internet of Controllable Things. In this article, we investigate the potential of deep reinforcement learning (DRL) for joint subcarrier-power allocation to achieve low latency and high reliability in a general form of device-to-device (D2D) networks, where each subcarrier can be allocated to multiple D2D pairs and each D2D pair is permitted to utilize multiple subcarriers. We first formulate the above problem as a Markov decision process and then propose a double deep Q-network (DQN)-based resource allocation algorithm to learn the optimal policy in the absence of full instantaneous channel state information (CSI). Specifically, each D2D pair acts as a learning agent that adjusts its own subcarrier-power allocation strategy iteratively through interactions with the operating environment in a trial-and-error fashion. Simulation results demonstrate that the proposed algorithm achieves near-optimal performance in real time. It is worth mentioning that the proposed algorithm is especially suitable for cases where the environmental dynamics are not accurate and the CSI delay cannot be ignored.

引用

页码：3066 / 3074

页数：9

共 50 条

[31] Reinforcement-Learning-Based Dynamic Spectrum Access for Software-Defined Cognitive Industrial Internet of Things
Liu, Xin
Sun, Can
Yu, Wei
Zhou, Mu
[J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (06) : 4244 - 4253
[32] Dynamical Resource Allocation in Edge for Trustable Internet-of-Things Systems: A Reinforcement Learning Method
Deng, Shuiguang
Xiang, Zhengzhe
Zhao, Peng
Taheri, Javid
Gao, Honghao
Yin, Jianwei
Zomaya, Albert Y.
[J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (09) : 6103 - 6113
[33] Dynamic multiple access based on deep reinforcement learning for Internet of Things
Liu, Xin
Li, Zengqi
[J]. COMPUTER COMMUNICATIONS, 2023, 210 : 331 - 341
[34] A Deep Reinforcement Learning-Based Caching Strategy for Internet of Things
Nasehzadeh, Ali
Wang, Ping
[J]. 2020 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2020, : 969 - 974
[35] Dual-attention assisted deep reinforcement learning algorithm for energy-efficient resource allocation in Industrial Internet of Things
Wang, Ying
Shang, Fengjun
Lei, Jianjun
Zhu, Xiangwei
Qin, Haoming
Wen, Jiayu
[J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 142 : 150 - 164
[36] Decentralized Resource Allocation-Based Multiagent Deep Learning in Vehicular Network
Mafuta, Armeline D.
Maharaj, Bodhaswar T. J.
Alfa, Attahiru S.
[J]. IEEE SYSTEMS JOURNAL, 2023, 17 (01): : 87 - 98
[37] Priority-Aware Reinforcement-Learning-Based Integrated Design of Networking and Control for Industrial Internet of Things
Xu, Hansong
Liu, Xing
Hatcher, William Grant
Xu, Guobin
Liao, Weixian
Yu, Wei
[J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (06) : 4668 - 4680
[38] Deep Reinforcement Learning for Resource Allocation in Blockchain-based Federated Learning
Dai, Yueyue
Yang, Huijiong
Yang, Huiran
[J]. ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 179 - 184
[39] Deep-Reinforcement-Learning-Based Energy-Efficient Resource Management for Social and Cognitive Internet of Things
Yang, Helin
Zhong, Wen-De
Chen, Chen
Alphones, Arokiaswami
Xie, Xianzhong
[J]. IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (06) : 5677 - 5689
[40] Deep reinforcement learning-based resource reservation method for Power Emergency Internet-of-things Slice
Wen, Mingshi
Hai, Tianxiang
Zhang, Li
Hao, Jiakai
Zhao, Guanghuai
Zhen, Zerui
Zhao, Yikun
Feng, Lei
[J]. IWCMC 2021: 2021 17TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2021, : 63 - 67

← 1 2 3 4 5 →