Deep Multiagent Reinforcement-Learning-Based Resource Allocation for Internet of Controllable Things

被引:47
|
作者
Gu, Bo [1 ]
Zhang, Xu [1 ]
Lin, Ziqi [1 ]
Alazab, Mamoun [2 ]
机构
[1] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Guangzhou 510275, Peoples R China
[2] Charles Darwin Univ, Coll Engn IT & Environm, Darwin, NT 0810, Australia
关键词
DDQN; deep reinforcement learning (DRL); delay critical; device-to-device (D2D); Internet of Things (IoT); spectral efficiency; POWER ALLOCATION; D2D COMMUNICATION; JOINT SUBCARRIER; CHANNEL; INTERFERENCE; ASSIGNMENT; NETWORKS;
D O I
10.1109/JIOT.2020.3023111
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ultrareliable and low-latency communication (URLLC) is a prerequisite for the successful implementation of the Internet of Controllable Things. In this article, we investigate the potential of deep reinforcement learning (DRL) for joint subcarrier-power allocation to achieve low latency and high reliability in a general form of device-to-device (D2D) networks, where each subcarrier can be allocated to multiple D2D pairs and each D2D pair is permitted to utilize multiple subcarriers. We first formulate the above problem as a Markov decision process and then propose a double deep Q-network (DQN)-based resource allocation algorithm to learn the optimal policy in the absence of full instantaneous channel state information (CSI). Specifically, each D2D pair acts as a learning agent that adjusts its own subcarrier-power allocation strategy iteratively through interactions with the operating environment in a trial-and-error fashion. Simulation results demonstrate that the proposed algorithm achieves near-optimal performance in real time. It is worth mentioning that the proposed algorithm is especially suitable for cases where the environmental dynamics are not accurate and the CSI delay cannot be ignored.
引用
收藏
页码:3066 / 3074
页数:9
相关论文
共 50 条
  • [31] Reinforcement-Learning-Based Dynamic Spectrum Access for Software-Defined Cognitive Industrial Internet of Things
    Liu, Xin
    Sun, Can
    Yu, Wei
    Zhou, Mu
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (06) : 4244 - 4253
  • [32] Dynamical Resource Allocation in Edge for Trustable Internet-of-Things Systems: A Reinforcement Learning Method
    Deng, Shuiguang
    Xiang, Zhengzhe
    Zhao, Peng
    Taheri, Javid
    Gao, Honghao
    Yin, Jianwei
    Zomaya, Albert Y.
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (09) : 6103 - 6113
  • [33] Dynamic multiple access based on deep reinforcement learning for Internet of Things
    Liu, Xin
    Li, Zengqi
    [J]. COMPUTER COMMUNICATIONS, 2023, 210 : 331 - 341
  • [34] A Deep Reinforcement Learning-Based Caching Strategy for Internet of Things
    Nasehzadeh, Ali
    Wang, Ping
    [J]. 2020 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2020, : 969 - 974
  • [35] Dual-attention assisted deep reinforcement learning algorithm for energy-efficient resource allocation in Industrial Internet of Things
    Wang, Ying
    Shang, Fengjun
    Lei, Jianjun
    Zhu, Xiangwei
    Qin, Haoming
    Wen, Jiayu
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 142 : 150 - 164
  • [36] Decentralized Resource Allocation-Based Multiagent Deep Learning in Vehicular Network
    Mafuta, Armeline D.
    Maharaj, Bodhaswar T. J.
    Alfa, Attahiru S.
    [J]. IEEE SYSTEMS JOURNAL, 2023, 17 (01): : 87 - 98
  • [37] Priority-Aware Reinforcement-Learning-Based Integrated Design of Networking and Control for Industrial Internet of Things
    Xu, Hansong
    Liu, Xing
    Hatcher, William Grant
    Xu, Guobin
    Liao, Weixian
    Yu, Wei
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (06) : 4668 - 4680
  • [38] Deep Reinforcement Learning for Resource Allocation in Blockchain-based Federated Learning
    Dai, Yueyue
    Yang, Huijiong
    Yang, Huiran
    [J]. ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 179 - 184
  • [39] Deep-Reinforcement-Learning-Based Energy-Efficient Resource Management for Social and Cognitive Internet of Things
    Yang, Helin
    Zhong, Wen-De
    Chen, Chen
    Alphones, Arokiaswami
    Xie, Xianzhong
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (06) : 5677 - 5689
  • [40] Deep reinforcement learning-based resource reservation method for Power Emergency Internet-of-things Slice
    Wen, Mingshi
    Hai, Tianxiang
    Zhang, Li
    Hao, Jiakai
    Zhao, Guanghuai
    Zhen, Zerui
    Zhao, Yikun
    Feng, Lei
    [J]. IWCMC 2021: 2021 17TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2021, : 63 - 67