Multi-agent reinforcement learning based transmission scheme for IRS-assisted multi-UAV systems

被引：0

作者：

Mei, Yumo

Liu, Chen

Song, Yunchao ^{[1
,2
]}

Wang, Ge

Liang, Huibin

机构：

[1] Nanjing Univ Posts & Telecommun, Coll Elect & Opt Engn, Nanjing 210023, Jiangsu, Peoples R China

[2] Nanjing Univ Posts & Telecommun, Coll Flexible Elect Future Technol, Nanjing 210023, Jiangsu, Peoples R China

来源：

IET COMMUNICATIONS | 2023年 / 17卷 / 17期

基金：

中国国家自然科学基金;

关键词：

multi-agent systems; MIMO communication; MASSIVE MIMO; OPTIMIZATION; COMMUNICATION;

D O I：

10.1049/cmu2.12674

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, a transmission scheme based on multi-agent reinforcement learning for intelligent reflecting surface (IRS)-assisted multiple unmanned aerial vehicles (UAVs) systems is proposed. The proposed scheme is based on reinforcement learning and alternating optimization algorithm, which can effectively improve communication quality and ensure fairness. The scheme is divided into two parts. In the first part, the multi-UAV cooperation problem is modeled as a markov decision process. The objective of each UAV is to maximize the minimum user channel gain. To achieve stable strategies for all agents, the Multi-agent Deep Deterministic Policy Gradient (MADDPG) algorithm is applied to train UAVs trajectories to reach the Nash equilibrium. The MADDPG algorithm is centralized trained at the base station and executed in a distributed manner by each UAV, ensuring efficient and effective coordination among agents. In the second part, an alternating optimization algorithm is formulated to optimize active and passive beamforming. Considering the non-convexity of the fairness objective, by using auxiliary variables and semi-definite relaxation method, the problem of maximizing the minimum user achievable rate is transformed into a feasibility problem. Simulation results show that the proposed scheme can effectively train UAVs trajectories and improve the communication performance of all users fairly.

引用

页码：2019 / 2029

页数：11

共 50 条

[31] UAV Confrontation and Evolutionary Upgrade Based on Multi-Agent Reinforcement Learning
Deng, Xin
Dong, Zhaoqi
Ding, Jishiyu
DRONES, 2024, 8 (08)
[32] Optimal Frequency Reuse and Power Control in Multi-UAV Wireless Networks: Hierarchical Multi-Agent Reinforcement Learning Perspective
Lee, Seungmin
Lim, Suhyeon
Chae, Seong Ho
Jung, Bang Chul
Park, Chan Yi
Lee, Howon
IEEE ACCESS, 2022, 10 : 39555 - 39565
[33] Optimal formation tracking control based on reinforcement learning for multi-UAV systems
Wang, Weizhen
Chen, Xin
Jia, Jiangbo
Wu, Kaili
Xie, Mingyang
CONTROL ENGINEERING PRACTICE, 2023, 141
[34] Multi-agent Computation Offloading in UAV Assisted MEC via Deep Reinforcement Learning
He, Hang
Ren, Tao
Qiu, Yuan
Hu, Zheyuan
Li, Yanqi
SMART COMPUTING AND COMMUNICATION, 2022, 13202 : 416 - 426
[35] A reinforcement learning scheme for a multi-agent card game
Fujita, H
Matsuno, Y
Ishii, S
2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 4071 - 4078
[36] Study of reinforcement learning based on multi-agent robot systems
College of Computer and Information Engineering, Harbin University of Commerce, Harbin 150028, China
J. Comput. Inf. Syst., 2007, 5 (2001-2006): : 2001 - 2006
[37] Multi-Agent Deep Reinforcement Learning for Secure UAV Communications
Zhang, Yu
Zhuang, Zirui
Gao, Feifei
Wang, Jingyu
Han, Zhu
2020 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2020,
[38] Multi-Agent Reinforcement Learning Based UAV Swarm Communications Against Jamming
Lv, Zefang
Xiao, Liang
Du, Yousong
Niu, Guohang
Xing, Chengwen
Xu, Wenyuan
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (12) : 9063 - 9075
[39] Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks
Cui, Jingjing
Liu, Yuanwei
Nallanathan, Arumugam
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (02) : 729 - 743
[40] Individual Reward Assisted Multi-Agent Reinforcement Learning
Wang, Li
Zhang, Yupeng
Hu, Yujing
Wang, Weixun
Zhang, Chongjie
Gao, Yang
Hao, Jianye
Lv, Tangjie
Fan, Changjie
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,

← 1 2 3 4 5 →