Delay-Aware Power Control for Downlink Multi-User MIMO via Constrained Deep Reinforcement Learning

被引:0
|
作者
Tian, Chang [1 ]
Huang, Guan [2 ]
Liu, An [2 ]
Luo, Wu [1 ]
机构
[1] Peking Univ, Dept Elect, State Key Lab Adv Opt Commun Syst & Networks, Beijing, Peoples R China
[2] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou, Peoples R China
基金
国家重点研发计划;
关键词
D O I
10.1109/GLOBECOM46510.2021.9685617
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We investigate the downlink transmission for multi-user multi-input multi-out (MU-MIMO) system, in which the regularized zero forcing (RZF) precoder is adopted and the power allocation and regularization factor are optimized. Our aim is to find a power allocation and regularization factor control policy that can minimize the long-term average power consumption subject to long-term delay constraint for each user. The induced optimization problem is formulated as a constrained Markov decision process (CMDP), which is efficiently solved by the proposed constrained deep reinforcement learning algorithm, called successive convex approximation policy optimization (SCAPO). The SCAPO is based on solving a sequence of convex objective/feasibility optimization problems obtained by replacing the objective and constraint functions in the original problems with convex surrogate functions. At each iteration, the SCAPO merely needs to estimate the first-order information and solve a convex surrogate problem that can be efficiently parallel tackled. Moreover, the SCAPO enables to reuse old experiences from previous updates, thereby significantly reducing the implementation cost. Numerical results have shown that the novel SCAPO can achieve the state-of-the-art performance over advanced baselines.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Multi-User Delay-Constrained Scheduling With Deep Recurrent Reinforcement Learning
    Hu, Pihe
    Chen, Yu
    Pan, Ling
    Fang, Zhixuan
    Xiao, Fu
    Huang, Longbo
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2024, 32 (03) : 2344 - 2359
  • [2] Effective Multi-User Delay-Constrained Scheduling with Deep Recurrent Reinforcement Learning
    Hu, Pihe
    Pan, Ling
    Chen, Yu
    Fang, Zhixuan
    Huang, Longbo
    PROCEEDINGS OF THE 2022 THE TWENTY-THIRD INTERNATIONAL SYMPOSIUM ON THEORY, ALGORITHMIC FOUNDATIONS, AND PROTOCOL DESIGN FOR MOBILE NETWORKS AND MOBILE COMPUTING, MOBIHOC 2022, 2022, : 1 - 10
  • [3] Distributed Multi-Cell Multi-User MISO Downlink Beamforming via Deep Reinforcement Learning
    JIA Haonan
    HE Zhenqing
    TAN Wanlong
    RUI Hua
    LIN Wei
    ZTECommunications, 2022, 20 (04) : 69 - 77
  • [4] SINR Constrained Beamforming for a MIMO Multi-user Downlink System
    Shi, Qingjiang
    Razaviyayn, Meisam
    Hong, Mingyi
    Luo, Zhi-Quan
    2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 1991 - 1995
  • [5] Deep Reinforcement Learning for Multi-User Massive MIMO with Channel Aging
    Feng, Zhenyuan
    Clerckx, Bruno
    IEEE Transactions on Machine Learning in Communications and Networking, 2023, 1 : 360 - 375
  • [6] Novel Channel Aware Power Control for a Multi-User Downlink NOMA Network
    Jee, Anand
    Prakriya, Shankar
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2024, 13 (02) : 392 - 396
  • [7] Delay-aware Cellular Traffic Scheduling with Deep Reinforcement Learning
    Zhang, Ticao
    Shen, Shuyi
    Mao, Shiwen
    Chang, Gee-Kung
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [8] Delay-Aware NFV Resource Allocation with Deep Reinforcement Learning
    Yuan, Ningcheng
    He, Wenchen
    Shen, Jing
    Qiu, Xuesong
    Guo, Shaoyong
    Li, Wenjing
    NOMS 2020 - PROCEEDINGS OF THE 2020 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM 2020: MANAGEMENT IN THE AGE OF SOFTWARIZATION AND ARTIFICIAL INTELLIGENCE, 2020,
  • [9] Delay-aware BS-DTX Control and User Scheduling for Energy Harvesting Downlink Coordinated MIMO Systems
    Cui, Ying
    Lau, Vincent K. N.
    2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 5852 - 5857
  • [10] Delay-Aware BS Discontinuous Transmission Control and User Scheduling for Energy Harvesting Downlink Coordinated MIMO Systems
    Cui, Ying
    Lau, Vincent K. N.
    Wu, Yueping
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2012, 60 (07) : 3786 - 3795