Large-scale UAV swarm confrontation based on hierarchical attention actor-critic algorithm

被引:6
|
作者
Nian, Xiaohong [1 ]
Li, Mengmeng [1 ]
Wang, Haibo [1 ]
Gong, Yalei [1 ]
Xiong, Hongyun [1 ]
机构
[1] Cent South Univ, Clustered Unmanned Syst Res Inst, Sch Automat, Changsha 410073, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Large-scale UAV swarm confrontation; Hierarchical attention actor-critic; Multi-agent reinforcement learning;
D O I
10.1007/s10489-024-05293-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In large-scale unmanned aerial vehicle (UAV) swarm confrontation scenarios, the design of decision-making and coordination strategies becomes extremely difficult. Multi-Agent Reinforcement Learning (MARL), as a novel decision-making approach to address this issue, faces challenges such as poor scalability and the curse of dimensionality. To overcome these challenges, the paper proposes a Hierarchical Attention Actor-Critic (HAAC) algorithm. The HAAC algorithm includes a centralized critic network based on a Hierarchical Two-stage Attention Network (H2ANet), along with a hierarchical actor policy network that combines rules and reinforcement learning approaches. H2ANet is specifically designed to model the relationships between UAVs and extract crucial information from neighboring UAVs, enabling the generation of advanced cooperative and competitive strategies. The HAAC algorithm effectively reduces the dimensionality of both action and state spaces. Experimental results conducted demonstrate that the HAAC algorithm outperforms existing methods and is able to extend its learned policies to large-scale scenarios.
引用
收藏
页码:3279 / 3294
页数:16
相关论文
共 50 条
  • [31] Modeling-Learning-Based Actor-Critic Algorithm with Gaussian Process Approximator
    Shan Zhong
    Jack Tan
    Husheng Dong
    Xuemei Chen
    Shengrong Gong
    Zhenjiang Qian
    Journal of Grid Computing, 2020, 18 : 181 - 195
  • [32] Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation
    Barakat, Anas
    Bianchi, Pascal
    Lehmann, Julien
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [33] Research on UAV Swarm Confrontation Task Based on MADDPG Algorithm
    Xiang, Lei
    Xie, Tao
    2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 1513 - 1518
  • [34] Mapless Navigation for Mobile Robots Based on Improved Soft Actor-Critic Algorithm
    Yang, Binglin
    Wang, Hongwei
    Xia, Hao
    39TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION, YAC 2024, 2024, : 755 - 761
  • [35] Deployment Algorithm of Service Function Chain Based on Transfer Actor-Critic Learning
    Tang Lun
    He Xiaoyu
    Wang Xiao
    Chen Qianbin
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2020, 42 (11) : 2671 - 2679
  • [36] Optimal scheduling of virtual power plant based on Soft Actor-Critic algorithm
    Pan, Pengfei
    Song, Minggang
    Zou, Nan
    Qin, Junhan
    Li, Guangdi
    Ma, Hongyuan
    2024 6TH ASIA ENERGY AND ELECTRICAL ENGINEERING SYMPOSIUM, AEEES 2024, 2024, : 835 - 840
  • [37] Optimization of Robot Environment Interaction Based on Asynchronous Advantage Actor-Critic Algorithm
    Xu, Jitang
    Chen, Qiang
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (06) : 1350 - 1359
  • [38] An inertia wheel pendulum control method based on actor-critic learning algorithm
    Liu Huanlong
    Wang Zhengjie
    Jiang Bin
    Peng Hongyu
    2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 1281 - 1285
  • [39] Network Congestion Control Algorithm Based on Actor-Critic Reinforcement Learning Model
    Xu, Tao
    Gong, Lina
    Zhang, Wei
    Li, Xuhong
    Wang, Xia
    Pan, Wenwen
    ADVANCES IN MATERIALS, MACHINERY, ELECTRONICS II, 2018, 1955
  • [40] Modeling-Learning-Based Actor-Critic Algorithm with Gaussian Process Approximator
    Zhong, Shan
    Tan, Jack
    Dong, Husheng
    Chen, Xuemei
    Gong, Shengrong
    Qian, Zhenjiang
    JOURNAL OF GRID COMPUTING, 2020, 18 (02) : 181 - 195