Double Deep Q-Network Based Distributed Resource Matching Algorithm for D2D Communication

被引:16
|
作者
Yuan, Yazhou [1 ,2 ]
Li, Zhijie [1 ,2 ]
Liu, Zhixin [1 ,2 ]
Yang, Yi [1 ,2 ]
Guan, Xinping [3 ]
机构
[1] Minist Educ Intelligent Control Syst & Intelligen, Engn Res Ctr, Qinhuangdao 066004, Hebei, Peoples R China
[2] Yanshan Univ, Sch Elect Engn, Qinhuangdao 066004, Hebei, Peoples R China
[3] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Device-to-device communication; Resource management; Cellular networks; Reinforcement learning; Games; Deep learning; Copper; Device-to-device communications; deep reinforcement learning; communication resource; non-cooperative game; MULTIPLE-ACCESS; ALLOCATION;
D O I
10.1109/TVT.2021.3130159
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Device-to-Device (D2D) communication with short communication distance is an efficient way to improve spectrum efficiency and mitigate interference. To realize the optimal resource configuration including wireless channel matching and power allocation, a distributed resource matching scheme is proposed based on deep reinforcement learning(DRL). The reward is defined as the difference of achieve rate of D2D users and the consumed power, which is limited by the Signal to Interference plus Noise Ratio (SINR) of the other cellular users on the current channel. The proposed algorithm maximizes the D2D throughput and energy efficiency in a distributed manner, without online coordination and message exchange between users. The considered resource allocation problem is formulated as a random non-cooperative game with multiple players (D2D pairs), where each player is a learning agent, whose task is to learn its best strategy based on locally observed information, multi-user communication resource matching algorithm is proposed based on a Double Deep Q-network (DDQN), where the total cellular throughput and user energy efficiency could converge to the Nash equilibrium (NE) under the mixed strategy. Simulation results show that the proposed algorithm can improve the communication rate and energy efficiency of each user by selecting the optimal strategy, and has better convergence performance compared with existing schemes.
引用
收藏
页码:984 / 993
页数:10
相关论文
共 50 条
  • [31] Estimation of Distribution Algorithm for Joint Resource Management in D2D Communication
    Mushtaq Ahmad
    Muhammad Naeem
    Muhammad Iqbal
    Wireless Personal Communications, 2019, 108 : 1113 - 1129
  • [32] Estimation of Distribution Algorithm for Joint Resource Management in D2D Communication
    Ahmad, Mushtaq
    Naeem, Muhammad
    Iqbal, Muhammad
    WIRELESS PERSONAL COMMUNICATIONS, 2019, 108 (02) : 1113 - 1129
  • [33] DYNAMIC RESOURCE ALLOCATIONS BASED ON Q-LEARNING FOR D2D COMMUNICATION IN CELLULAR NETWORKS
    Luo, Yong
    Shi, Zhiping
    Zhou, Xin
    Liu, Qiaoyan
    Yi, Qicong
    2014 11TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2014, : 385 - 388
  • [34] Firefly inspired Improved Distributed Proximity Algorithm for D2D Communication
    Pratap, Ajay
    Misra, Rajiv
    2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, 2015, : 323 - 328
  • [35] A Simple Distributed Channel Allocation Algorithm for D2D Communication Pairs
    Zhao, Haitao
    Ding, Kaiqi
    Sarkar, Nurul, I
    Wei, Jibo
    Xiong, Jun
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2018, 67 (11) : 10960 - 10969
  • [36] Distributed Deep Learning Power Allocation for D2D Network Based on Outdated Information
    Shi, Jiaqi
    Zhang, Qianqian
    Liang, Ying-Chang
    Yuan, Xiaojun
    2020 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2020,
  • [37] On Joint Offloading and Resource Allocation: A Double Deep Q-Network Approach
    Khoramnejad, Fahime
    Erol-Kantarci, Melike
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2021, 7 (04) : 1126 - 1141
  • [38] Resource optimization for cellular network assisted multichannel D2D communication
    Wang, Jiaheng
    Zhu, Daohua
    Zhang, Hua
    Zhao, Chunming
    Li, James C. F.
    Lei, Ming
    SIGNAL PROCESSING, 2014, 100 : 23 - 31
  • [39] Quantum-Based Deep Q-Network Bandwidth Resource Allocation Algorithm for UASN
    Gao, Jia
    Wang, Jingjing
    Gu, Jianlei
    Shi, Wei
    IEEE Internet of Things Journal, 2024, 11 (24) : 39932 - 39940
  • [40] Resource Allocation Scheme for D2D Communication Based on ILA
    Gu, Zhifang
    Xu, Pingping
    Wu, Guilu
    Liu, Hao
    AD HOC NETWORKS, ADHOCNETS 2018, 2019, 258 : 39 - 48