Large-scale UAV swarm confrontation based on hierarchical attention actor-critic algorithm

被引:6
|
作者
Nian, Xiaohong [1 ]
Li, Mengmeng [1 ]
Wang, Haibo [1 ]
Gong, Yalei [1 ]
Xiong, Hongyun [1 ]
机构
[1] Cent South Univ, Clustered Unmanned Syst Res Inst, Sch Automat, Changsha 410073, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Large-scale UAV swarm confrontation; Hierarchical attention actor-critic; Multi-agent reinforcement learning;
D O I
10.1007/s10489-024-05293-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In large-scale unmanned aerial vehicle (UAV) swarm confrontation scenarios, the design of decision-making and coordination strategies becomes extremely difficult. Multi-Agent Reinforcement Learning (MARL), as a novel decision-making approach to address this issue, faces challenges such as poor scalability and the curse of dimensionality. To overcome these challenges, the paper proposes a Hierarchical Attention Actor-Critic (HAAC) algorithm. The HAAC algorithm includes a centralized critic network based on a Hierarchical Two-stage Attention Network (H2ANet), along with a hierarchical actor policy network that combines rules and reinforcement learning approaches. H2ANet is specifically designed to model the relationships between UAVs and extract crucial information from neighboring UAVs, enabling the generation of advanced cooperative and competitive strategies. The HAAC algorithm effectively reduces the dimensionality of both action and state spaces. Experimental results conducted demonstrate that the HAAC algorithm outperforms existing methods and is able to extend its learned policies to large-scale scenarios.
引用
收藏
页码:3279 / 3294
页数:16
相关论文
共 50 条
  • [1] Large-scale UAV swarm confrontation based on hierarchical attention actor-critic algorithm
    Xiaohong Nian
    Mengmeng Li
    Haibo Wang
    Yalei Gong
    Hongyun Xiong
    Applied Intelligence, 2024, 54 : 3279 - 3294
  • [2] A Large-Scale UAV Swarm Confrontation Method Based on Fuzzy Reinforcement Learning
    Hu, Chunyang
    Li, Jingchen
    Yang, Yusen
    Gu, Qiong
    Wu, Zhao
    Ning, Bin
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2025,
  • [3] Large-scale Interactive Conversational Recommendation System using Actor-Critic Framework
    Montazeralghaem, Ali
    Allan, James
    Thomas, Philip S.
    15TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS 2021), 2021, : 220 - 229
  • [4] Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning
    Zhong, Shan
    Liu, Quan
    Fu, QiMing
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2016, 2016
  • [5] Reduce UAV Coverage Energy Consumption through Actor-Critic Algorithm
    Liu, Bo
    Zhang, Yue
    Fu, Shupo
    Liu, Xuan
    2019 15TH INTERNATIONAL CONFERENCE ON MOBILE AD-HOC AND SENSOR NETWORKS (MSN 2019), 2019, : 332 - 337
  • [6] Autonomous Decision-Making Generation of UAV based on Soft Actor-Critic Algorithm
    Cheng, Yan
    Song, Yong
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7350 - 7355
  • [7] Multi-Agent Actor-Critic with Hierarchical Graph Attention Network
    Ryu, Heechang
    Shin, Hayong
    Park, Jinkyoo
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7236 - 7243
  • [8] Graph Soft Actor-Critic Reinforcement Learning for Large-Scale Distributed Multirobot Coordination
    Hu, Yifan
    Fu, Junjie
    Wen, Guanghui
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 12
  • [9] Swarm Reinforcement Learning Method Based on an Actor-Critic Method
    Iima, Hitoshi
    Kuroe, Yasuaki
    SIMULATED EVOLUTION AND LEARNING, 2010, 6457 : 279 - 288
  • [10] Weighted mean field reinforcement learning for large-scale UAV swarm confrontation
    Baolai Wang
    Shengang Li
    Xianzhong Gao
    Tao Xie
    Applied Intelligence, 2023, 53 : 5274 - 5289