Delay-Aware Power Control for Downlink Multi-User MIMO via Constrained Deep Reinforcement Learning

被引:0
|
作者
Tian, Chang [1 ]
Huang, Guan [2 ]
Liu, An [2 ]
Luo, Wu [1 ]
机构
[1] Peking Univ, Dept Elect, State Key Lab Adv Opt Commun Syst & Networks, Beijing, Peoples R China
[2] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou, Peoples R China
基金
国家重点研发计划;
关键词
D O I
10.1109/GLOBECOM46510.2021.9685617
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We investigate the downlink transmission for multi-user multi-input multi-out (MU-MIMO) system, in which the regularized zero forcing (RZF) precoder is adopted and the power allocation and regularization factor are optimized. Our aim is to find a power allocation and regularization factor control policy that can minimize the long-term average power consumption subject to long-term delay constraint for each user. The induced optimization problem is formulated as a constrained Markov decision process (CMDP), which is efficiently solved by the proposed constrained deep reinforcement learning algorithm, called successive convex approximation policy optimization (SCAPO). The SCAPO is based on solving a sequence of convex objective/feasibility optimization problems obtained by replacing the objective and constraint functions in the original problems with convex surrogate functions. At each iteration, the SCAPO merely needs to estimate the first-order information and solve a convex surrogate problem that can be efficiently parallel tackled. Moreover, the SCAPO enables to reuse old experiences from previous updates, thereby significantly reducing the implementation cost. Numerical results have shown that the novel SCAPO can achieve the state-of-the-art performance over advanced baselines.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Delay-aware dynamic access control for mMTC in wireless networks using deep reinforcement learning
    Pacheco-Paramo, Diego
    Tello-Oquendo, Luis
    COMPUTER NETWORKS, 2020, 182 (182)
  • [22] Power Allocation in Multi-User Cellular Networks: Deep Reinforcement Learning Approaches
    Meng, Fan
    Chen, Peng
    Wu, Lenan
    Cheng, Julian
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (10) : 6255 - 6267
  • [23] Delay-aware model-based reinforcement learning for continuous control
    Chen, Baiming
    Xu, Mengdi
    Li, Liang
    Zhao, Ding
    NEUROCOMPUTING, 2021, 450 : 119 - 128
  • [24] Channel Norm-Based Power Control in Downlink Multi-User Distributed MIMO Systems
    Oh, Yonghwi
    Park, Jonghyun
    Sung, Wonjin
    2010 IEEE 72ND VEHICULAR TECHNOLOGY CONFERENCE FALL, 2010,
  • [25] Downlink Power Control for Cell-Free Massive MIMO With Deep Reinforcement Learning
    Luo, Lirui
    Zhang, Jiayi
    Chen, Shuaifei
    Zhang, Xiaodan
    Ai, Bo
    Ng, Derrick Wing Kwan
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (06) : 6772 - 6777
  • [26] SINR Constrained Beamforming for a MIMO Multi-User Downlink System: Algorithms and Convergence Analysis
    Shi, Qingjiang
    Razaviyayn, Meisam
    Hong, Mingyi
    Luo, Zhi-Quan
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2016, 64 (11) : 2920 - 2933
  • [27] Robust Joint Transceiver Power Allocation for Multi-User Downlink MIMO Transmissions
    Kotchasarn, Chirawat
    12TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY: ICT FOR GREEN GROWTH AND SUSTAINABLE DEVELOPMENT, VOLS 1 AND 2, 2010, : 1708 - 1712
  • [28] On Throughput Gains by Exploiting Green Interference Power in the Multi-User MIMO Downlink
    Masouros, C.
    2013 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2013, : 3654 - 3658
  • [29] Joint power distribution algorithm for multi-user downlink of JT MIMO system
    Gong, Yi
    Xie, Xian-Zhong
    2007 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1 AND 2: VOL 1: COMMUNICATION THEORY AND SYSTEMS; VOL 2: SIGNAL PROCESSING, COMPUTATIONAL INTELLIGENCE, CIRCUITS AND SYSTEMS, 2007, : 163 - +
  • [30] Secure Simultaneous Information and Power Transfer for Downlink Multi-User Massive MIMO
    Goli, Zahra
    Razavizadeh, S. Mohammad
    Farhadi, Hamed
    Svensson, Tommy
    IEEE ACCESS, 2020, 8 : 150514 - 150526