Double Deep Q-Network Based Distributed Resource Matching Algorithm for D2D Communication

被引:16
|
作者
Yuan, Yazhou [1 ,2 ]
Li, Zhijie [1 ,2 ]
Liu, Zhixin [1 ,2 ]
Yang, Yi [1 ,2 ]
Guan, Xinping [3 ]
机构
[1] Minist Educ Intelligent Control Syst & Intelligen, Engn Res Ctr, Qinhuangdao 066004, Hebei, Peoples R China
[2] Yanshan Univ, Sch Elect Engn, Qinhuangdao 066004, Hebei, Peoples R China
[3] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Device-to-device communication; Resource management; Cellular networks; Reinforcement learning; Games; Deep learning; Copper; Device-to-device communications; deep reinforcement learning; communication resource; non-cooperative game; MULTIPLE-ACCESS; ALLOCATION;
D O I
10.1109/TVT.2021.3130159
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Device-to-Device (D2D) communication with short communication distance is an efficient way to improve spectrum efficiency and mitigate interference. To realize the optimal resource configuration including wireless channel matching and power allocation, a distributed resource matching scheme is proposed based on deep reinforcement learning(DRL). The reward is defined as the difference of achieve rate of D2D users and the consumed power, which is limited by the Signal to Interference plus Noise Ratio (SINR) of the other cellular users on the current channel. The proposed algorithm maximizes the D2D throughput and energy efficiency in a distributed manner, without online coordination and message exchange between users. The considered resource allocation problem is formulated as a random non-cooperative game with multiple players (D2D pairs), where each player is a learning agent, whose task is to learn its best strategy based on locally observed information, multi-user communication resource matching algorithm is proposed based on a Double Deep Q-network (DDQN), where the total cellular throughput and user energy efficiency could converge to the Nash equilibrium (NE) under the mixed strategy. Simulation results show that the proposed algorithm can improve the communication rate and energy efficiency of each user by selecting the optimal strategy, and has better convergence performance compared with existing schemes.
引用
收藏
页码:984 / 993
页数:10
相关论文
共 50 条
  • [1] Autoencoder and Matching-based Resource Allocation Scheme for D2D Communication
    Rathod, Tejal
    Tanwar, Sudeep
    2023 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS, ICC WORKSHOPS, 2023, : 1271 - 1276
  • [2] An Optimal Algorithm for Resource Allocation in D2D Communication
    Alyousif, Shahad
    Dauwed, Mohammed
    Nader, Rafal
    Ali, Mohammed Hasan
    Jabar, Mustafa Musa
    Alkhayyat, Ahmed
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (01): : 531 - 546
  • [3] A Joint Resource Allocation Algorithm for D2D Communication
    Hamid, Abdul Kadir
    Widaa, Lamia Osman
    Al-Wesabi, Fahd N.
    Khan, Imran
    Hilal, Anwer Mustafa
    Hamza, Manar Ahmed
    Zaman, Abu Sarwar
    Rizwanullah, Mohammed
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (02): : 3751 - 3762
  • [4] Semi-Distributed Resource Selection for D2D Communication in LTE-A Network
    Park, Seungil
    Choi, Sunghyun
    2015 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2015,
  • [5] Resource allocation algorithm for secure communication in UAV-assisted D2D communication network
    Zeng X.
    Wang H.
    Huang L.
    Ma D.
    Tongxin Xuebao/Journal on Communications, 45 (02): : 115 - 126
  • [6] The Distributed Resource Allocation for D2D Communication with Game Theory
    Dun, Hui
    Ye, Fang
    Jiao, Shuhong
    Li, Yibing
    Jiang, Tao
    PROCEEDINGS OF THE 2019 9TH IEEE-APS TOPICAL CONFERENCE ON ANTENNAS AND PROPAGATION IN WIRELESS COMMUNICATIONS (IEEE APWC' 19), 2019, : 104 - 108
  • [7] Matching Based Two-Timescale Resource Allocation for Cooperative D2D Communication
    Yuan, Yiling
    Yang, Tao
    Hu, Yulin
    Feng, Hui
    Hu, Bo
    2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,
  • [8] Chaotic Deep Network for Mobile D2D Communication
    Li, Lixiang
    Chen, Yixin
    Peng, Haipeng
    Yang, Yixian
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (10): : 8078 - 8096
  • [9] Joint Resource Allocation Algorithm Based on Throughput Maximization in D2D Communication
    Qiu, Yue
    Wang, Yinghe
    Tan, Chong
    Zheng, Min
    Yu, Kai
    PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 1074 - 1078
  • [10] Q- Learning Based Power Control Algorithm for D2D Communication
    Nie, Shiwen
    Fan, Zhicliang
    Zhao, Ming
    Gu, Xinyu
    Zhang, Lin
    2016 IEEE 27TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR, AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2016, : 1405 - 1410