Multi-agent reinforcement learning based transmission scheme for IRS-assisted multi-UAV systems

被引:0
|
作者
Mei, Yumo
Liu, Chen
Song, Yunchao [1 ,2 ]
Wang, Ge
Liang, Huibin
机构
[1] Nanjing Univ Posts & Telecommun, Coll Elect & Opt Engn, Nanjing 210023, Jiangsu, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Coll Flexible Elect Future Technol, Nanjing 210023, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
multi-agent systems; MIMO communication; MASSIVE MIMO; OPTIMIZATION; COMMUNICATION;
D O I
10.1049/cmu2.12674
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, a transmission scheme based on multi-agent reinforcement learning for intelligent reflecting surface (IRS)-assisted multiple unmanned aerial vehicles (UAVs) systems is proposed. The proposed scheme is based on reinforcement learning and alternating optimization algorithm, which can effectively improve communication quality and ensure fairness. The scheme is divided into two parts. In the first part, the multi-UAV cooperation problem is modeled as a markov decision process. The objective of each UAV is to maximize the minimum user channel gain. To achieve stable strategies for all agents, the Multi-agent Deep Deterministic Policy Gradient (MADDPG) algorithm is applied to train UAVs trajectories to reach the Nash equilibrium. The MADDPG algorithm is centralized trained at the base station and executed in a distributed manner by each UAV, ensuring efficient and effective coordination among agents. In the second part, an alternating optimization algorithm is formulated to optimize active and passive beamforming. Considering the non-convexity of the fairness objective, by using auxiliary variables and semi-definite relaxation method, the problem of maximizing the minimum user achievable rate is transformed into a feasibility problem. Simulation results show that the proposed scheme can effectively train UAVs trajectories and improve the communication performance of all users fairly.
引用
收藏
页码:2019 / 2029
页数:11
相关论文
共 50 条
  • [31] UAV Confrontation and Evolutionary Upgrade Based on Multi-Agent Reinforcement Learning
    Deng, Xin
    Dong, Zhaoqi
    Ding, Jishiyu
    DRONES, 2024, 8 (08)
  • [32] Optimal Frequency Reuse and Power Control in Multi-UAV Wireless Networks: Hierarchical Multi-Agent Reinforcement Learning Perspective
    Lee, Seungmin
    Lim, Suhyeon
    Chae, Seong Ho
    Jung, Bang Chul
    Park, Chan Yi
    Lee, Howon
    IEEE ACCESS, 2022, 10 : 39555 - 39565
  • [33] Optimal formation tracking control based on reinforcement learning for multi-UAV systems
    Wang, Weizhen
    Chen, Xin
    Jia, Jiangbo
    Wu, Kaili
    Xie, Mingyang
    CONTROL ENGINEERING PRACTICE, 2023, 141
  • [34] Multi-agent Computation Offloading in UAV Assisted MEC via Deep Reinforcement Learning
    He, Hang
    Ren, Tao
    Qiu, Yuan
    Hu, Zheyuan
    Li, Yanqi
    SMART COMPUTING AND COMMUNICATION, 2022, 13202 : 416 - 426
  • [35] A reinforcement learning scheme for a multi-agent card game
    Fujita, H
    Matsuno, Y
    Ishii, S
    2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 4071 - 4078
  • [36] Study of reinforcement learning based on multi-agent robot systems
    College of Computer and Information Engineering, Harbin University of Commerce, Harbin 150028, China
    J. Comput. Inf. Syst., 2007, 5 (2001-2006): : 2001 - 2006
  • [37] Multi-Agent Deep Reinforcement Learning for Secure UAV Communications
    Zhang, Yu
    Zhuang, Zirui
    Gao, Feifei
    Wang, Jingyu
    Han, Zhu
    2020 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2020,
  • [38] Multi-Agent Reinforcement Learning Based UAV Swarm Communications Against Jamming
    Lv, Zefang
    Xiao, Liang
    Du, Yousong
    Niu, Guohang
    Xing, Chengwen
    Xu, Wenyuan
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (12) : 9063 - 9075
  • [39] Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks
    Cui, Jingjing
    Liu, Yuanwei
    Nallanathan, Arumugam
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (02) : 729 - 743
  • [40] Individual Reward Assisted Multi-Agent Reinforcement Learning
    Wang, Li
    Zhang, Yupeng
    Hu, Yujing
    Wang, Weixun
    Zhang, Chongjie
    Gao, Yang
    Hao, Jianye
    Lv, Tangjie
    Fan, Changjie
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,