Delay-Aware Power Control for Downlink Multi-User MIMO via Constrained Deep Reinforcement Learning

被引：0

作者：

Tian, Chang ^{[1
]}

Huang, Guan ^{[2
]}

Liu, An ^{[2
]}

Luo, Wu ^{[1
]}

机构：

[1] Peking Univ, Dept Elect, State Key Lab Adv Opt Commun Syst & Networks, Beijing, Peoples R China

[2] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou, Peoples R China

来源：

2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM) | 2021年

基金：

国家重点研发计划;

关键词：

D O I：

10.1109/GLOBECOM46510.2021.9685617

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We investigate the downlink transmission for multi-user multi-input multi-out (MU-MIMO) system, in which the regularized zero forcing (RZF) precoder is adopted and the power allocation and regularization factor are optimized. Our aim is to find a power allocation and regularization factor control policy that can minimize the long-term average power consumption subject to long-term delay constraint for each user. The induced optimization problem is formulated as a constrained Markov decision process (CMDP), which is efficiently solved by the proposed constrained deep reinforcement learning algorithm, called successive convex approximation policy optimization (SCAPO). The SCAPO is based on solving a sequence of convex objective/feasibility optimization problems obtained by replacing the objective and constraint functions in the original problems with convex surrogate functions. At each iteration, the SCAPO merely needs to estimate the first-order information and solve a convex surrogate problem that can be efficiently parallel tackled. Moreover, the SCAPO enables to reuse old experiences from previous updates, thereby significantly reducing the implementation cost. Numerical results have shown that the novel SCAPO can achieve the state-of-the-art performance over advanced baselines.

引用

页数：6

共 50 条

[1] Multi-User Delay-Constrained Scheduling With Deep Recurrent Reinforcement Learning
Hu, Pihe
Chen, Yu
Pan, Ling
Fang, Zhixuan
Xiao, Fu
Huang, Longbo
IEEE-ACM TRANSACTIONS ON NETWORKING, 2024, 32 (03) : 2344 - 2359
[2] Effective Multi-User Delay-Constrained Scheduling with Deep Recurrent Reinforcement Learning
Hu, Pihe
Pan, Ling
Chen, Yu
Fang, Zhixuan
Huang, Longbo
PROCEEDINGS OF THE 2022 THE TWENTY-THIRD INTERNATIONAL SYMPOSIUM ON THEORY, ALGORITHMIC FOUNDATIONS, AND PROTOCOL DESIGN FOR MOBILE NETWORKS AND MOBILE COMPUTING, MOBIHOC 2022, 2022, : 1 - 10
[3] Distributed Multi-Cell Multi-User MISO Downlink Beamforming via Deep Reinforcement Learning
JIA Haonan
HE Zhenqing
TAN Wanlong
RUI Hua
LIN Wei
ZTECommunications, 2022, 20 (04) : 69 - 77
[4] SINR Constrained Beamforming for a MIMO Multi-user Downlink System
Shi, Qingjiang
Razaviyayn, Meisam
Hong, Mingyi
Luo, Zhi-Quan
2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 1991 - 1995
[5] Deep Reinforcement Learning for Multi-User Massive MIMO with Channel Aging
Feng, Zhenyuan
Clerckx, Bruno
IEEE Transactions on Machine Learning in Communications and Networking, 2023, 1 : 360 - 375
[6] Novel Channel Aware Power Control for a Multi-User Downlink NOMA Network
Jee, Anand
Prakriya, Shankar
IEEE WIRELESS COMMUNICATIONS LETTERS, 2024, 13 (02) : 392 - 396
[7] Delay-aware Cellular Traffic Scheduling with Deep Reinforcement Learning
Zhang, Ticao
Shen, Shuyi
Mao, Shiwen
Chang, Gee-Kung
2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
[8] Delay-Aware NFV Resource Allocation with Deep Reinforcement Learning
Yuan, Ningcheng
He, Wenchen
Shen, Jing
Qiu, Xuesong
Guo, Shaoyong
Li, Wenjing
NOMS 2020 - PROCEEDINGS OF THE 2020 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM 2020: MANAGEMENT IN THE AGE OF SOFTWARIZATION AND ARTIFICIAL INTELLIGENCE, 2020,
[9] Delay-aware BS-DTX Control and User Scheduling for Energy Harvesting Downlink Coordinated MIMO Systems
Cui, Ying
Lau, Vincent K. N.
2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 5852 - 5857
[10] Delay-Aware BS Discontinuous Transmission Control and User Scheduling for Energy Harvesting Downlink Coordinated MIMO Systems
Cui, Ying
Lau, Vincent K. N.
Wu, Yueping
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2012, 60 (07) : 3786 - 3795

← 1 2 3 4 5 →