User Association and Power Allocation for User-Centric Smart-Duplex Networks via Tree-Structured Deep Reinforcement Learning

被引：4

作者：

Wang, Dan ^{[1
]}

Li, Ran ^{[2
,3
]}

Huang, Chuan ^{[2
,3
]}

Xu, Xiaodong ^{[1
,4
]}

Chen, Hao ^{[1
]}

机构：

[1] Peng Cheng Lab, Dept Broadband Commun, Shenzhen 518055, Peoples R China

[2] Chinese Univ Hong Kong, Sch Sci & Engn, Shenzhen 518172, Peoples R China

[3] Chinese Univ Hong Kong, Future Network Intelligence Inst, Shenzhen 518172, Peoples R China

[4] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China

来源：

IEEE INTERNET OF THINGS JOURNAL | 2023年 / 10卷 / 22期

关键词：

Deep reinforcement learning (DRL); multiagent tree-structured policy gradient (MATSPG); ultradense network (UDN); user centric (UC); ULTRA-DENSE NETWORKS; RESOURCE-ALLOCATION; JOINT UPLINK; RELAY;

D O I：

10.1109/JIOT.2023.3283775

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article considers a smart-duplex (SD) powered user-centric ultra dense networks (UC-UDNs), where each user is served cooperatively by multiple access points (APs) adopting the de-cellular concept to achieve desired Quality-of-Service (QoS). The average QoS satisfaction ratio maximization problem for the considered SD UC-UDN is formulated as a Markov decision process (MDP) with large discrete action space by designing the user association and power allocation. To reduce the action space, user association and power allocation are modeled as a two-layer tree, and selecting an action for each user is equivalent to finding the path from the root to one leaf of the constructed tree. Then, a multiagent tree-structured policy gradient (MATSPG)-based deep reinforcement learning (DRL) algorithm is proposed to solve the MDP problem, whose training process is shown to be equivalent to that of the two-layer neural networks. Next, the time and space complexity of searching one action in the proposed MATSPG are also proved to be lower than the conventional DRL algorithms. Finally, simulations show that the proposed MATSPG algorithm significantly improves the average QoS satisfaction ratio than the conventional multiagent deep deterministic policy gradient and multiagent deep Q-network methods in typical scenarios.

引用

页码：20216 / 20229

页数：14

共 50 条

[31] Resource Allocation and User Association Using Reinforcement Learning via Curriculum in a Wireless Network with High User Mobility
Kim, Dong Uk
Park, Seong Bae
Hong, Choong Seon
Huh, Eui Nam
2023 INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, ICOIN, 2023, : 382 - 386
[32] Dynamic User Association and Computation Offloading in Satellite Edge Computing Networks via Deep Reinforcement Learning
Zhang, Hangyu
Zhao, Hongbo
Liu, Rongke
Gao, Xiangqiang
Xu, Shenzhan
IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2024, 8 (04): : 1888 - 1901
[33] Intelligent User Association for Symbiotic Radio Networks Using Deep Reinforcement Learning
Zhang, Qianqian
Liang, Ying-Chang
Poor, H. Vincent
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (07) : 4535 - 4548
[34] Intelligent User Association for Symbiotic Radio Networks using Deep Reinforcement Learning
Zhang, Qianqian
Liang, Ying-Chang
Poor, H. Vincent
2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
[35] Deep Learning Based Radio Resource Management in NOMA Networks: User Association, Subchannel and Power Allocation
Zhang, Haijun
Zhang, Haisen
Long, Keping
Karagiannidis, George K.
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2020, 7 (04): : 2406 - 2415
[36] Hypergraph-Based SCMA Codebook Allocation in User-Centric Ultra-Dense Networks with Machine Learning
Yu, Lisu
Zhang, Hongliang
Zhang, Long
Song, Lingyang
Han, Zhu
Fan, Pingzhi
2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,
[37] Inverse Reinforcement Learning Meets Power Allocation in Multi-user Cellular Networks
Zhang, Ruichen
Xiong, Ke
Tian, Xingcong
Lu, Yang
Fan, Pingyi
Ben Letaief, Khaled
IEEE INFOCOM 2022 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2022,
[38] Adaptive User Scheduling and Resource Allocation in Wireless Federated Learning Networks : A Deep Reinforcement Learning Approach
Wu, Changxiang
Ren, Yijing
So, Daniel K. C.
ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 1219 - 1225
[39] Parallel Deep Reinforcement Learning based Online User Association Optimization in Heterogeneous Networks
Li, Zhiyang
Chen, Ming
Wang, Kezhi
Pan, Cunhua
Huang, Nuo
Hu, Yuntao
2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2020,
[40] Deep Reinforcement Learning Based Caching Placement and User Association for Dynamic Cellular Networks
Wang, Yue
Feng, Chunyan
Zhang, Tiankui
2021 IEEE 32ND ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2021,

← 1 2 3 4 5 →