Large-scale UAV swarm confrontation based on hierarchical attention actor-critic algorithm

被引：6

作者：

Nian, Xiaohong ^{[1
]}

Li, Mengmeng ^{[1
]}

Wang, Haibo ^{[1
]}

Gong, Yalei ^{[1
]}

Xiong, Hongyun ^{[1
]}

机构：

[1] Cent South Univ, Clustered Unmanned Syst Res Inst, Sch Automat, Changsha 410073, Hunan, Peoples R China

来源：

APPLIED INTELLIGENCE | 2024年 / 54卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Large-scale UAV swarm confrontation; Hierarchical attention actor-critic; Multi-agent reinforcement learning;

D O I：

10.1007/s10489-024-05293-5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In large-scale unmanned aerial vehicle (UAV) swarm confrontation scenarios, the design of decision-making and coordination strategies becomes extremely difficult. Multi-Agent Reinforcement Learning (MARL), as a novel decision-making approach to address this issue, faces challenges such as poor scalability and the curse of dimensionality. To overcome these challenges, the paper proposes a Hierarchical Attention Actor-Critic (HAAC) algorithm. The HAAC algorithm includes a centralized critic network based on a Hierarchical Two-stage Attention Network (H2ANet), along with a hierarchical actor policy network that combines rules and reinforcement learning approaches. H2ANet is specifically designed to model the relationships between UAVs and extract crucial information from neighboring UAVs, enabling the generation of advanced cooperative and competitive strategies. The HAAC algorithm effectively reduces the dimensionality of both action and state spaces. Experimental results conducted demonstrate that the HAAC algorithm outperforms existing methods and is able to extend its learned policies to large-scale scenarios.

引用

页码：3279 / 3294

页数：16

共 50 条

[1] Large-scale UAV swarm confrontation based on hierarchical attention actor-critic algorithm
Xiaohong Nian
Mengmeng Li
Haibo Wang
Yalei Gong
Hongyun Xiong
Applied Intelligence, 2024, 54 : 3279 - 3294
[2] A Large-Scale UAV Swarm Confrontation Method Based on Fuzzy Reinforcement Learning
Hu, Chunyang
Li, Jingchen
Yang, Yusen
Gu, Qiong
Wu, Zhao
Ning, Bin
INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2025,
[3] Large-scale Interactive Conversational Recommendation System using Actor-Critic Framework
Montazeralghaem, Ali
Allan, James
Thomas, Philip S.
15TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS 2021), 2021, : 220 - 229
[4] Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning
Zhong, Shan
Liu, Quan
Fu, QiMing
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2016, 2016
[5] Reduce UAV Coverage Energy Consumption through Actor-Critic Algorithm
Liu, Bo
Zhang, Yue
Fu, Shupo
Liu, Xuan
2019 15TH INTERNATIONAL CONFERENCE ON MOBILE AD-HOC AND SENSOR NETWORKS (MSN 2019), 2019, : 332 - 337
[6] Autonomous Decision-Making Generation of UAV based on Soft Actor-Critic Algorithm
Cheng, Yan
Song, Yong
PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7350 - 7355
[7] Multi-Agent Actor-Critic with Hierarchical Graph Attention Network
Ryu, Heechang
Shin, Hayong
Park, Jinkyoo
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7236 - 7243
[8] Graph Soft Actor-Critic Reinforcement Learning for Large-Scale Distributed Multirobot Coordination
Hu, Yifan
Fu, Junjie
Wen, Guanghui
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 12
[9] Swarm Reinforcement Learning Method Based on an Actor-Critic Method
Iima, Hitoshi
Kuroe, Yasuaki
SIMULATED EVOLUTION AND LEARNING, 2010, 6457 : 279 - 288
[10] Weighted mean field reinforcement learning for large-scale UAV swarm confrontation
Baolai Wang
Shengang Li
Xianzhong Gao
Tao Xie
Applied Intelligence, 2023, 53 : 5274 - 5289

← 1 2 3 4 5 →