User Association and Power Allocation for User-Centric Smart-Duplex Networks via Tree-Structured Deep Reinforcement Learning

被引:4
|
作者
Wang, Dan [1 ]
Li, Ran [2 ,3 ]
Huang, Chuan [2 ,3 ]
Xu, Xiaodong [1 ,4 ]
Chen, Hao [1 ]
机构
[1] Peng Cheng Lab, Dept Broadband Commun, Shenzhen 518055, Peoples R China
[2] Chinese Univ Hong Kong, Sch Sci & Engn, Shenzhen 518172, Peoples R China
[3] Chinese Univ Hong Kong, Future Network Intelligence Inst, Shenzhen 518172, Peoples R China
[4] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
关键词
Deep reinforcement learning (DRL); multiagent tree-structured policy gradient (MATSPG); ultradense network (UDN); user centric (UC); ULTRA-DENSE NETWORKS; RESOURCE-ALLOCATION; JOINT UPLINK; RELAY;
D O I
10.1109/JIOT.2023.3283775
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article considers a smart-duplex (SD) powered user-centric ultra dense networks (UC-UDNs), where each user is served cooperatively by multiple access points (APs) adopting the de-cellular concept to achieve desired Quality-of-Service (QoS). The average QoS satisfaction ratio maximization problem for the considered SD UC-UDN is formulated as a Markov decision process (MDP) with large discrete action space by designing the user association and power allocation. To reduce the action space, user association and power allocation are modeled as a two-layer tree, and selecting an action for each user is equivalent to finding the path from the root to one leaf of the constructed tree. Then, a multiagent tree-structured policy gradient (MATSPG)-based deep reinforcement learning (DRL) algorithm is proposed to solve the MDP problem, whose training process is shown to be equivalent to that of the two-layer neural networks. Next, the time and space complexity of searching one action in the proposed MATSPG are also proved to be lower than the conventional DRL algorithms. Finally, simulations show that the proposed MATSPG algorithm significantly improves the average QoS satisfaction ratio than the conventional multiagent deep deterministic policy gradient and multiagent deep Q-network methods in typical scenarios.
引用
收藏
页码:20216 / 20229
页数:14
相关论文
共 50 条
  • [31] Resource Allocation and User Association Using Reinforcement Learning via Curriculum in a Wireless Network with High User Mobility
    Kim, Dong Uk
    Park, Seong Bae
    Hong, Choong Seon
    Huh, Eui Nam
    2023 INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, ICOIN, 2023, : 382 - 386
  • [32] Dynamic User Association and Computation Offloading in Satellite Edge Computing Networks via Deep Reinforcement Learning
    Zhang, Hangyu
    Zhao, Hongbo
    Liu, Rongke
    Gao, Xiangqiang
    Xu, Shenzhan
    IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2024, 8 (04): : 1888 - 1901
  • [33] Intelligent User Association for Symbiotic Radio Networks Using Deep Reinforcement Learning
    Zhang, Qianqian
    Liang, Ying-Chang
    Poor, H. Vincent
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (07) : 4535 - 4548
  • [34] Intelligent User Association for Symbiotic Radio Networks using Deep Reinforcement Learning
    Zhang, Qianqian
    Liang, Ying-Chang
    Poor, H. Vincent
    2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
  • [35] Deep Learning Based Radio Resource Management in NOMA Networks: User Association, Subchannel and Power Allocation
    Zhang, Haijun
    Zhang, Haisen
    Long, Keping
    Karagiannidis, George K.
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2020, 7 (04): : 2406 - 2415
  • [36] Hypergraph-Based SCMA Codebook Allocation in User-Centric Ultra-Dense Networks with Machine Learning
    Yu, Lisu
    Zhang, Hongliang
    Zhang, Long
    Song, Lingyang
    Han, Zhu
    Fan, Pingzhi
    2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,
  • [37] Inverse Reinforcement Learning Meets Power Allocation in Multi-user Cellular Networks
    Zhang, Ruichen
    Xiong, Ke
    Tian, Xingcong
    Lu, Yang
    Fan, Pingyi
    Ben Letaief, Khaled
    IEEE INFOCOM 2022 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2022,
  • [38] Adaptive User Scheduling and Resource Allocation in Wireless Federated Learning Networks : A Deep Reinforcement Learning Approach
    Wu, Changxiang
    Ren, Yijing
    So, Daniel K. C.
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 1219 - 1225
  • [39] Parallel Deep Reinforcement Learning based Online User Association Optimization in Heterogeneous Networks
    Li, Zhiyang
    Chen, Ming
    Wang, Kezhi
    Pan, Cunhua
    Huang, Nuo
    Hu, Yuntao
    2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2020,
  • [40] Deep Reinforcement Learning Based Caching Placement and User Association for Dynamic Cellular Networks
    Wang, Yue
    Feng, Chunyan
    Zhang, Tiankui
    2021 IEEE 32ND ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2021,