UAV Confrontation and Evolutionary Upgrade Based on Multi-Agent Reinforcement Learning

被引：0

作者：

Deng, Xin ^{[1
,2
]}

Dong, Zhaoqi ^{[1
]}

Ding, Jishiyu ^{[3
]}

机构：

[1] Beijing Inst Technol, Adv Res Inst Multidisciplinary Sci, Beijing 100081, Peoples R China

[2] Beijing Inst Technol, Yangtze Delta Reg Acad, Jiaxing 314000, Peoples R China

[3] Intelligent Sci Technol Acad Ltd CASIC, Beijing 100041, Peoples R China

来源：

DRONES | 2024年 / 8卷 / 08期

基金：

中国国家自然科学基金;

关键词：

UAV confrontation; MARL; semi-static training method; evolutionary upgrade;

D O I：

10.3390/drones8080368

中图分类号：

TP7 [遥感技术];

学科分类号：

081102 ; 0816 ; 081602 ; 083002 ; 1404 ;

摘要：

Unmanned aerial vehicle (UAV) confrontation scenarios play a crucial role in the study of agent behavior selection and decision planning. Multi-agent reinforcement learning (MARL) algorithms serve as a universally effective method guiding agents toward appropriate action strategies. They determine subsequent actions based on the state of the agents and the environmental information that the agents receive. However, traditional MARL settings often result in one party agent consistently outperforming the other party due to superior strategies, or both agents reaching a strategic stalemate with no further improvement. To solve this issue, we propose a semi-static deep deterministic policy gradient algorithm based on MARL. This algorithm employs a centralized training and decentralized execution approach, dynamically adjusting the training intensity based on the comparative strengths and weaknesses of both agents' strategies. Experimental results show that during the training process, the strategy of the winning team drives the losing team's strategy to upgrade continuously, and the relationship between the winning team and the losing team keeps changing, thus achieving mutual improvement of the strategies of both teams. The semi-static reinforcement learning algorithm improves the win-loss relationship conversion by 8% and reduces the training time by 40% compared with the traditional reinforcement learning algorithm.

引用

页数：20

共 50 条

[21] Graph Convolutional Multi-Agent Reinforcement Learning for UAV Coverage Control
Dai, Anna
Li, Rongpeng
Zhaot, Zhifeng
Zhang, Honggang
[J]. 2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2020, : 1106 - 1111
[22] Autonomous and cooperative control of UAV cluster with multi-agent reinforcement learning
Xu, D.
Chen, G.
[J]. AERONAUTICAL JOURNAL, 2022, 126 (1300): : 932 - 951
[23] Autonomous and cooperative control of UAV cluster with multi-agent reinforcement learning
Xu, D.
Chen, G.
[J]. Aeronautical Journal, 2022,
[24] Multi-Agent Reinforcement Learning
Stankovic, Milos
[J]. 2016 13TH SYMPOSIUM ON NEURAL NETWORKS AND APPLICATIONS (NEUREL), 2016, : 43 - 43
[25] Multi-UAV Cooperative Searching and Tracking for Moving Targets Based on Multi-Agent Reinforcement Learning
Su, Kai
Qian, Feng
[J]. APPLIED SCIENCES-BASEL, 2023, 13 (21):
[26] Multi-agent Reinforcement Learning-based Offloading Decision for UAV Cluster Combat Tasks
Li, Jiajian
Shi, Yanjun
Yang, Yu
Li, Bo
Zhao, Xijun
[J]. Binggong Xuebao/Acta Armamentarii, 2023, 44 (11): : 3295 - 3309
[27] UAV Frequency-based Crowdsensing Using Grouping Multi-agent Deep Reinforcement Learning
Cui ZHANG
En WANG
Funing YANG
Yongjian YANG
Nan JIANG
[J]. 计算机科学, 2023, 50 (02) : 57 - 68
[28] UAV intelligent attack strategy generation model based on multi-agent game reinforcement learning
Zhao, Zhiruo
Cao, Lei
Chen, Xiliang
Lai, Jun
Zhang, Legui
[J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2023, 45 (10): : 3165 - 3171
[29] Decentralized Trajectory and Power Control Based on Multi-Agent Deep Reinforcement Learning in UAV Networks
Chen, Binqiang
Liu, Dong
Hanzo, Lajos
[J]. IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 3983 - 3988
[30] A HYBRID APPROACH BASED ON MULTI-AGENT GEOSIMULATION AND REINFORCEMENT LEARNING TO SOLVE A UAV PATROLLING PROBLEM
Perron, Jimmy
Hogan, Jimmy
Moulin, Bernard
Berger, Jean
Belanger, Micheline
[J]. 2008 WINTER SIMULATION CONFERENCE, VOLS 1-5, 2008, : 1259 - +

← 1 2 3 4 5 →