Delay-Aware Power Control for Downlink Multi-User MIMO via Constrained Deep Reinforcement Learning

被引：0

作者：

Tian, Chang ^{[1
]}

Huang, Guan ^{[2
]}

Liu, An ^{[2
]}

Luo, Wu ^{[1
]}

机构：

[1] Peking Univ, Dept Elect, State Key Lab Adv Opt Commun Syst & Networks, Beijing, Peoples R China

[2] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou, Peoples R China

来源：

2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM) | 2021年

基金：

国家重点研发计划;

关键词：

D O I：

10.1109/GLOBECOM46510.2021.9685617

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We investigate the downlink transmission for multi-user multi-input multi-out (MU-MIMO) system, in which the regularized zero forcing (RZF) precoder is adopted and the power allocation and regularization factor are optimized. Our aim is to find a power allocation and regularization factor control policy that can minimize the long-term average power consumption subject to long-term delay constraint for each user. The induced optimization problem is formulated as a constrained Markov decision process (CMDP), which is efficiently solved by the proposed constrained deep reinforcement learning algorithm, called successive convex approximation policy optimization (SCAPO). The SCAPO is based on solving a sequence of convex objective/feasibility optimization problems obtained by replacing the objective and constraint functions in the original problems with convex surrogate functions. At each iteration, the SCAPO merely needs to estimate the first-order information and solve a convex surrogate problem that can be efficiently parallel tackled. Moreover, the SCAPO enables to reuse old experiences from previous updates, thereby significantly reducing the implementation cost. Numerical results have shown that the novel SCAPO can achieve the state-of-the-art performance over advanced baselines.

引用

页数：6

共 50 条

[31] Dynamic Power Allocation Based on SLNR Precoding for Multi-User MIMO Downlink
Wang Jing-jing
Xie Xian-zhong
2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 1567 - 1570
[32] Power Allocation for the Downlink of Nonregenerative Cooperative Multi-User MIMO Communication System
Heliot, Fabien
Hoshyar, Reza
Tafazolli, Rahim
2010 IEEE 21ST INTERNATIONAL SYMPOSIUM ON PERSONAL INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2010, : 905 - 910
[33] Joint transmit beamforming and power control in multi-user MIMO downlink using a game theoretic approach
ZENG Yi
The Journal of China Universities of Posts and Telecommunications, 2007, (02) : 14 - 18
[34] Delay-Aware Uplink User Association and Power Control in Heterogeneous Cellular Networks
Chen, Zheng
Qiu, Ling
Jin, Yinghao
Liang, Xiaowen
IEEE WIRELESS COMMUNICATIONS LETTERS, 2015, 4 (06) : 661 - 664
[35] Smart Power Control for Quality-Driven Multi-User Video Transmissions: A Deep Reinforcement Learning Approach
Zhang, Ticao
Mao, Shiwen
IEEE Access, 2020, 8
[36] Smart Power Control for Quality-Driven Multi-User Video Transmissions: A Deep Reinforcement Learning Approach
Zhang, Ticao
Mao, Shiwen
IEEE ACCESS, 2020, 8 : 611 - 622
[37] PAR-Aware Large-Scale Multi-User MIMO-OFDM Downlink
Studer, Christoph
Larsson, Erik G.
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2013, 31 (02) : 303 - 313
[38] A Deep Learning-Based Channel Aware Single Step Signal Detection in Downlink Multi-User NOMA
Kumar, Sarang
Elnourani, Mohamed
Beferull-Lozano, Baltasar
Redhu, Surender
2022 IEEE 96TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-FALL), 2022,
[39] Deep Reinforcement Learning For Multi-User Access Control in Non-Terrestrial Networks
Cao, Yang
Lien, Shao-Yu
Liang, Ying-Chang
IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (03) : 1605 - 1619
[40] Optimal Power Analysis of Downlink Multi-user SA-MIMO Systems use
Dai, Jianxin
Zhou, Jun
Qi, Jie
Chen, Ming
Yuan, Tao
Zhao, Jun
INTERNATIONAL CONFERENCE MACHINERY, ELECTRONICS AND CONTROL SIMULATION, 2014, 614 : 530 - 534

← 1 2 3 4 5 →