Large-scale UAV swarm confrontation based on hierarchical attention actor-critic algorithm

被引：6

作者：

Nian, Xiaohong ^{[1
]}

Li, Mengmeng ^{[1
]}

Wang, Haibo ^{[1
]}

Gong, Yalei ^{[1
]}

Xiong, Hongyun ^{[1
]}

机构：

[1] Cent South Univ, Clustered Unmanned Syst Res Inst, Sch Automat, Changsha 410073, Hunan, Peoples R China

来源：

APPLIED INTELLIGENCE | 2024年 / 54卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Large-scale UAV swarm confrontation; Hierarchical attention actor-critic; Multi-agent reinforcement learning;

D O I：

10.1007/s10489-024-05293-5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In large-scale unmanned aerial vehicle (UAV) swarm confrontation scenarios, the design of decision-making and coordination strategies becomes extremely difficult. Multi-Agent Reinforcement Learning (MARL), as a novel decision-making approach to address this issue, faces challenges such as poor scalability and the curse of dimensionality. To overcome these challenges, the paper proposes a Hierarchical Attention Actor-Critic (HAAC) algorithm. The HAAC algorithm includes a centralized critic network based on a Hierarchical Two-stage Attention Network (H2ANet), along with a hierarchical actor policy network that combines rules and reinforcement learning approaches. H2ANet is specifically designed to model the relationships between UAVs and extract crucial information from neighboring UAVs, enabling the generation of advanced cooperative and competitive strategies. The HAAC algorithm effectively reduces the dimensionality of both action and state spaces. Experimental results conducted demonstrate that the HAAC algorithm outperforms existing methods and is able to extend its learned policies to large-scale scenarios.

引用

页码：3279 / 3294

页数：16

共 50 条

[31] Modeling-Learning-Based Actor-Critic Algorithm with Gaussian Process Approximator
Shan Zhong
Jack Tan
Husheng Dong
Xuemei Chen
Shengrong Gong
Zhenjiang Qian
Journal of Grid Computing, 2020, 18 : 181 - 195
[32] Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation
Barakat, Anas
Bianchi, Pascal
Lehmann, Julien
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
[33] Research on UAV Swarm Confrontation Task Based on MADDPG Algorithm
Xiang, Lei
Xie, Tao
2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 1513 - 1518
[34] Mapless Navigation for Mobile Robots Based on Improved Soft Actor-Critic Algorithm
Yang, Binglin
Wang, Hongwei
Xia, Hao
39TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION, YAC 2024, 2024, : 755 - 761
[35] Deployment Algorithm of Service Function Chain Based on Transfer Actor-Critic Learning
Tang Lun
He Xiaoyu
Wang Xiao
Chen Qianbin
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2020, 42 (11) : 2671 - 2679
[36] Optimal scheduling of virtual power plant based on Soft Actor-Critic algorithm
Pan, Pengfei
Song, Minggang
Zou, Nan
Qin, Junhan
Li, Guangdi
Ma, Hongyuan
2024 6TH ASIA ENERGY AND ELECTRICAL ENGINEERING SYMPOSIUM, AEEES 2024, 2024, : 835 - 840
[37] Optimization of Robot Environment Interaction Based on Asynchronous Advantage Actor-Critic Algorithm
Xu, Jitang
Chen, Qiang
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (06) : 1350 - 1359
[38] An inertia wheel pendulum control method based on actor-critic learning algorithm
Liu Huanlong
Wang Zhengjie
Jiang Bin
Peng Hongyu
2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 1281 - 1285
[39] Network Congestion Control Algorithm Based on Actor-Critic Reinforcement Learning Model
Xu, Tao
Gong, Lina
Zhang, Wei
Li, Xuhong
Wang, Xia
Pan, Wenwen
ADVANCES IN MATERIALS, MACHINERY, ELECTRONICS II, 2018, 1955
[40] Modeling-Learning-Based Actor-Critic Algorithm with Gaussian Process Approximator
Zhong, Shan
Tan, Jack
Dong, Husheng
Chen, Xuemei
Gong, Shengrong
Qian, Zhenjiang
JOURNAL OF GRID COMPUTING, 2020, 18 (02) : 181 - 195

← 1 2 3 4 5 →