UAV Confrontation and Evolutionary Upgrade Based on Multi-Agent Reinforcement Learning

被引:0
|
作者
Deng, Xin [1 ,2 ]
Dong, Zhaoqi [1 ]
Ding, Jishiyu [3 ]
机构
[1] Beijing Inst Technol, Adv Res Inst Multidisciplinary Sci, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Yangtze Delta Reg Acad, Jiaxing 314000, Peoples R China
[3] Intelligent Sci Technol Acad Ltd CASIC, Beijing 100041, Peoples R China
基金
中国国家自然科学基金;
关键词
UAV confrontation; MARL; semi-static training method; evolutionary upgrade;
D O I
10.3390/drones8080368
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Unmanned aerial vehicle (UAV) confrontation scenarios play a crucial role in the study of agent behavior selection and decision planning. Multi-agent reinforcement learning (MARL) algorithms serve as a universally effective method guiding agents toward appropriate action strategies. They determine subsequent actions based on the state of the agents and the environmental information that the agents receive. However, traditional MARL settings often result in one party agent consistently outperforming the other party due to superior strategies, or both agents reaching a strategic stalemate with no further improvement. To solve this issue, we propose a semi-static deep deterministic policy gradient algorithm based on MARL. This algorithm employs a centralized training and decentralized execution approach, dynamically adjusting the training intensity based on the comparative strengths and weaknesses of both agents' strategies. Experimental results show that during the training process, the strategy of the winning team drives the losing team's strategy to upgrade continuously, and the relationship between the winning team and the losing team keeps changing, thus achieving mutual improvement of the strategies of both teams. The semi-static reinforcement learning algorithm improves the win-loss relationship conversion by 8% and reduces the training time by 40% compared with the traditional reinforcement learning algorithm.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Graph Convolutional Multi-Agent Reinforcement Learning for UAV Coverage Control
    Dai, Anna
    Li, Rongpeng
    Zhaot, Zhifeng
    Zhang, Honggang
    [J]. 2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2020, : 1106 - 1111
  • [22] Autonomous and cooperative control of UAV cluster with multi-agent reinforcement learning
    Xu, D.
    Chen, G.
    [J]. AERONAUTICAL JOURNAL, 2022, 126 (1300): : 932 - 951
  • [23] Autonomous and cooperative control of UAV cluster with multi-agent reinforcement learning
    Xu, D.
    Chen, G.
    [J]. Aeronautical Journal, 2022,
  • [24] Multi-Agent Reinforcement Learning
    Stankovic, Milos
    [J]. 2016 13TH SYMPOSIUM ON NEURAL NETWORKS AND APPLICATIONS (NEUREL), 2016, : 43 - 43
  • [25] Multi-UAV Cooperative Searching and Tracking for Moving Targets Based on Multi-Agent Reinforcement Learning
    Su, Kai
    Qian, Feng
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (21):
  • [26] Multi-agent Reinforcement Learning-based Offloading Decision for UAV Cluster Combat Tasks
    Li, Jiajian
    Shi, Yanjun
    Yang, Yu
    Li, Bo
    Zhao, Xijun
    [J]. Binggong Xuebao/Acta Armamentarii, 2023, 44 (11): : 3295 - 3309
  • [27] UAV Frequency-based Crowdsensing Using Grouping Multi-agent Deep Reinforcement Learning
    Cui ZHANG
    En WANG
    Funing YANG
    Yongjian YANG
    Nan JIANG
    [J]. 计算机科学, 2023, 50 (02) : 57 - 68
  • [28] UAV intelligent attack strategy generation model based on multi-agent game reinforcement learning
    Zhao, Zhiruo
    Cao, Lei
    Chen, Xiliang
    Lai, Jun
    Zhang, Legui
    [J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2023, 45 (10): : 3165 - 3171
  • [29] Decentralized Trajectory and Power Control Based on Multi-Agent Deep Reinforcement Learning in UAV Networks
    Chen, Binqiang
    Liu, Dong
    Hanzo, Lajos
    [J]. IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 3983 - 3988
  • [30] A HYBRID APPROACH BASED ON MULTI-AGENT GEOSIMULATION AND REINFORCEMENT LEARNING TO SOLVE A UAV PATROLLING PROBLEM
    Perron, Jimmy
    Hogan, Jimmy
    Moulin, Bernard
    Berger, Jean
    Belanger, Micheline
    [J]. 2008 WINTER SIMULATION CONFERENCE, VOLS 1-5, 2008, : 1259 - +