Double Deep Q-Network Based Distributed Resource Matching Algorithm for D2D Communication

被引:16
|
作者
Yuan, Yazhou [1 ,2 ]
Li, Zhijie [1 ,2 ]
Liu, Zhixin [1 ,2 ]
Yang, Yi [1 ,2 ]
Guan, Xinping [3 ]
机构
[1] Minist Educ Intelligent Control Syst & Intelligen, Engn Res Ctr, Qinhuangdao 066004, Hebei, Peoples R China
[2] Yanshan Univ, Sch Elect Engn, Qinhuangdao 066004, Hebei, Peoples R China
[3] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Device-to-device communication; Resource management; Cellular networks; Reinforcement learning; Games; Deep learning; Copper; Device-to-device communications; deep reinforcement learning; communication resource; non-cooperative game; MULTIPLE-ACCESS; ALLOCATION;
D O I
10.1109/TVT.2021.3130159
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Device-to-Device (D2D) communication with short communication distance is an efficient way to improve spectrum efficiency and mitigate interference. To realize the optimal resource configuration including wireless channel matching and power allocation, a distributed resource matching scheme is proposed based on deep reinforcement learning(DRL). The reward is defined as the difference of achieve rate of D2D users and the consumed power, which is limited by the Signal to Interference plus Noise Ratio (SINR) of the other cellular users on the current channel. The proposed algorithm maximizes the D2D throughput and energy efficiency in a distributed manner, without online coordination and message exchange between users. The considered resource allocation problem is formulated as a random non-cooperative game with multiple players (D2D pairs), where each player is a learning agent, whose task is to learn its best strategy based on locally observed information, multi-user communication resource matching algorithm is proposed based on a Double Deep Q-network (DDQN), where the total cellular throughput and user energy efficiency could converge to the Nash equilibrium (NE) under the mixed strategy. Simulation results show that the proposed algorithm can improve the communication rate and energy efficiency of each user by selecting the optimal strategy, and has better convergence performance compared with existing schemes.
引用
收藏
页码:984 / 993
页数:10
相关论文
共 50 条
  • [21] A Resource Allocation Algorithm based on DILA for D2D communication in LTE-A Multi-cell Network
    Wang, Junshe
    Wang, Songhua
    PROCEEDINGS OF 2018 IEEE 3RD ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC 2018), 2018, : 114 - 117
  • [22] Relay selection algorithm based on social network combined with Q-learning for vehicle D2D communication
    Qian, Hongzhi
    Yu, Jinming
    Hua, Licheng
    IET COMMUNICATIONS, 2019, 13 (20) : 3582 - 3587
  • [23] RESOURCE ALLOCATION FOR D2D COMMUNICATIONS WITH A NOVEL DISTRIBUTED Q-LEARNING ALGORITHM IN HETEROGENEOUS NETWORKS
    Huang, Yung-Fa
    Tan, Tan-Hsu
    Wang, Neng-Chung
    Chen, Young-Long
    Li, Yu-Ling
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 2, 2018, : 533 - 537
  • [24] Hybrid Optimization Algorithm for Resource Allocation in LTE-Based D2D Communication
    Austine A.
    Suji Pramila R.
    Computer Systems Science and Engineering, 2023, 46 (02): : 2263 - 2276
  • [25] Manufacturing Resource Scheduling Based on Deep Q-Network
    ZHANG Yufei
    ZOU Yuanhao
    ZHAO Xiaodong
    Wuhan University Journal of Natural Sciences, 2022, 27 (06) : 531 - 538
  • [26] Mode Selection and Resource Allocation Algorithm Based on Interference Control for D2D Communication
    Liao, Xiaoqin
    Xu, Yang
    2018 10TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS (ICCCAS 2018), 2018, : 286 - 290
  • [27] Energy efficiency resource allocation for D2D communication network based on relay selection
    Gang Feng
    Xizhong Qin
    Zhenhong Jia
    Shaohua Li
    Wireless Networks, 2021, 27 : 3689 - 3699
  • [28] Resource Sharing For Energy Harvesting Based D2D Communication Underlaying Cellular network
    Khuntia, Partap
    Hazra, Ranjay
    Akhter, Javed
    Ravi, Anuradha
    13TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED NETWORKS AND TELECOMMUNICATION SYSTEMS (IEEE ANTS), 2019,
  • [29] Matching Theory for Resource Allocation in Energy Harvesting Aided D2D Communication
    Meng, Yue
    Zhang, Ping
    Huang, Yuzhen
    Zhang, Zhi
    2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2019,
  • [30] Energy efficiency resource allocation for D2D communication network based on relay selection
    Feng, Gang
    Qin, Xizhong
    Jia, Zhenhong
    Li, Shaohua
    WIRELESS NETWORKS, 2021, 27 (05) : 3689 - 3699