UAV Confrontation and Evolutionary Upgrade Based on Multi-Agent Reinforcement Learning

被引:0
|
作者
Deng, Xin [1 ,2 ]
Dong, Zhaoqi [1 ]
Ding, Jishiyu [3 ]
机构
[1] Beijing Inst Technol, Adv Res Inst Multidisciplinary Sci, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Yangtze Delta Reg Acad, Jiaxing 314000, Peoples R China
[3] Intelligent Sci Technol Acad Ltd CASIC, Beijing 100041, Peoples R China
基金
中国国家自然科学基金;
关键词
UAV confrontation; MARL; semi-static training method; evolutionary upgrade;
D O I
10.3390/drones8080368
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Unmanned aerial vehicle (UAV) confrontation scenarios play a crucial role in the study of agent behavior selection and decision planning. Multi-agent reinforcement learning (MARL) algorithms serve as a universally effective method guiding agents toward appropriate action strategies. They determine subsequent actions based on the state of the agents and the environmental information that the agents receive. However, traditional MARL settings often result in one party agent consistently outperforming the other party due to superior strategies, or both agents reaching a strategic stalemate with no further improvement. To solve this issue, we propose a semi-static deep deterministic policy gradient algorithm based on MARL. This algorithm employs a centralized training and decentralized execution approach, dynamically adjusting the training intensity based on the comparative strengths and weaknesses of both agents' strategies. Experimental results show that during the training process, the strategy of the winning team drives the losing team's strategy to upgrade continuously, and the relationship between the winning team and the losing team keeps changing, thus achieving mutual improvement of the strategies of both teams. The semi-static reinforcement learning algorithm improves the win-loss relationship conversion by 8% and reduces the training time by 40% compared with the traditional reinforcement learning algorithm.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] UAV Swarm Confrontation Based on Multi-agent Deep Reinforcement Learning
    Wang, Zhi
    Liu, Fan
    Guo, Jing
    Hong, Chen
    Chen, Ming
    Wang, Ershen
    Zhao, Yunbo
    [J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 4996 - 5001
  • [2] UAV Cooperative Air Combat Maneuvering Confrontation Based on Multi-agent Reinforcement Learning
    Gong, Zihao
    Xu, Yang
    Luo, Delin
    [J]. UNMANNED SYSTEMS, 2023, 11 (03) : 273 - 286
  • [3] An evolutionary multi-agent reinforcement learning algorithm for multi-UAV air combat
    Wang, Baolai
    Gao, Xianzhong
    Xie, Tao
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 299
  • [4] Multi-Agent Evolutionary Reinforcement Learning Based on Cooperative Games
    Yu, Jin
    Zhang, Ya
    Sun, Changyin
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [5] Evolutionary reinforcement learning algorithm for large-scale multi-agent cooperation and confrontation applications
    Haiying Liu
    ZhiHao Li
    Kuihua Huang
    Rui Wang
    Guangquan Cheng
    Tiexiang Li
    [J]. The Journal of Supercomputing, 2024, 80 : 2319 - 2346
  • [6] Evolutionary reinforcement learning algorithm for large-scale multi-agent cooperation and confrontation applications
    Liu, Haiying
    Li, ZhiHao
    Huang, Kuihua
    Wang, Rui
    Cheng, Guangquan
    Li, Tiexiang
    [J]. JOURNAL OF SUPERCOMPUTING, 2024, 80 (02): : 2319 - 2346
  • [7] The Application of Multi-Agent Reinforcement Learning in UAV Networks
    Cui, Jingjing
    Liu, Yuanwei
    Nallanathan, Arumugam
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2019,
  • [8] Multi-UAV Path Planning and Following Based on Multi-Agent Reinforcement Learning
    Zhao, Xiaoru
    Yang, Rennong
    Zhong, Liangsheng
    Hou, Zhiwei
    [J]. DRONES, 2024, 8 (01)
  • [9] Evolutionary game theory and multi-agent reinforcement learning
    Tuyls, K
    Nowé, A
    [J]. KNOWLEDGE ENGINEERING REVIEW, 2005, 20 (01): : 63 - 90
  • [10] Multi-Agent Reinforcement Learning Based UAV Swarm Communications Against Jamming
    Lv, Zefang
    Xiao, Liang
    Du, Yousong
    Niu, Guohang
    Xing, Chengwen
    Xu, Wenyuan
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (12) : 9063 - 9075