Dynamic Attention Network for Multi-UAV Reinforcement Learning

被引：0

作者：

Xu, Dongsheng ^{[1
]}

Wu, Shang ^{[1
]}

机构：

[1] Natl Univ Def Technol, Sci & Technol Parallel & Distributed Proc Lab, Coll Comp, Changsha, Hunan, Peoples R China

来源：

INTERNATIONAL CONFERENCE ON ALGORITHMS, HIGH PERFORMANCE COMPUTING, AND ARTIFICIAL INTELLIGENCE (AHPCAI 2021) | 2021年 / 12156卷

关键词：

MADDPG; Transfer learning; Attention; Reinforcement learning; LEVEL;

D O I：

10.1117/12.2626437

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent methods for multi-agent reinforcement learning problems make use of Deep Neural Networks and provide stateof-the-art performance with dedicated neural network architectures and comprehensive training tricks. However, these deep reinforcement learning methods suffer from reproducibility issues, especially in transfer learning. Since the fixed size of the network input, it is difficult for the existing network structure to transfer the strategies learned from a small scale to a large scale. We argue that proper network architecture design is crucial to the cross-scale reinforcement transfer learning. In this paper, we use transfer training with attention network to solve multi-agent combat problems from aerial unmanned aerial vehicle (UAV) combat scenarios, and extend the small-scale learning to large-scale complex scenarios. We combine the attention neural network with the MADDPG algorithm to process the agent observation. It started training from a small-scale multi-UAV combat scenario and gradually increases the number of UAV. The experimental results show that methods for multi-agent UAV combat problems trained by attention transfer learning can achieve the target performance faster and provide better performance than the method without attention transfer learning.

引用

页数：6

共 50 条

[31] Multi-UAV Trajectory Design and Power Control Based on Deep Reinforcement Learning
Zhang C.Y.
Liang S.Y.
He C.L.
Wang K.Z.
Journal of Communications and Information Networks, 2022, 7 (02): : 192 - 201
[32] Optimal formation tracking control based on reinforcement learning for multi-UAV systems
Wang, Weizhen
Chen, Xin
Jia, Jiangbo
Wu, Kaili
Xie, Mingyang
CONTROL ENGINEERING PRACTICE, 2023, 141
[33] Multi-UAV Assisted Offloading Optimization: A Game Combined Reinforcement Learning Approach
Gao, Ang
Wang, Qi
Chen, Kaiyue
Liang, Wei
IEEE COMMUNICATIONS LETTERS, 2021, 25 (08) : 2629 - 2633
[34] Reinforcement Learning based Approach for Multi-UAV Cooperative Searching in Unknown Environments
Yue, Wei
Guan, Xianhe
Xi, Yun
2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 2018 - 2023
[35] Multi-Agent Deep Reinforcement Learning for Full-Duplex Multi-UAV Networks
Dai, Chen
Zhu, Kun
Hossain, Ekram
2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2022, : 2232 - 2237
[36] Joint Task Offloading and Resource Allocation in Multi-UAV Multi-Server Systems: An Attention-Based Deep Reinforcement Learning Approach
Wu, Guohua
Liu, Zelin
Fan, Mingfeng
Wu, Keyu
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (08) : 11964 - 11978
[37] Enhancing multi-UAV air combat decision making via hierarchical reinforcement learning
Wang, Huan
Wang, Jintao
SCIENTIFIC REPORTS, 2024, 14 (01)
[38] Integrating human experience in deep reinforcement learning for multi-UAV collision detection and avoidance
Wang, Guanzheng
Xu, Yinbo
Liu, Zhihong
Xu, Xin
Wang, Xiangke
Yan, Jiarun
Industrial Robot, 2022, 49 (02): : 256 - 270
[39] Reinforcement-Learning-Assisted Multi-UAV Task Allocation and Path Planning for IIoT
Zhao, Guodong
Wang, Ye
Mu, Tong
Meng, Zhijun
Wang, Zichen
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (16): : 26766 - 26777
[40] Age-of-Information based Multi-UAV Trajectories Using Deep Reinforcement Learning
Kaur, Amanjot
Jha, Shashi Shekhar
IETE TECHNICAL REVIEW, 2024, 41 (06) : 659 - 671

← 1 2 3 4 5 →