Integrating human experience in deep reinforcement learning for multi-UAV collision detection and avoidance

被引:0
|
作者
Wang, Guanzheng [1 ]
Xu, Yinbo [1 ]
Liu, Zhihong [1 ]
Xu, Xin [1 ]
Wang, Xiangke [1 ]
Yan, Jiarun [1 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; Collision detection and avoidance; Fully distributed; HEBA; Integrating human experience; PERCEPTION; ALGORITHM;
D O I
10.1108/IR-06-2021-0116
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Purpose This paper aims to realize a fully distributed multi-UAV collision detection and avoidance based on deep reinforcement learning (DRL). To deal with the problem of low sample efficiency in DRL and speed up the training. To improve the applicability and reliability of the DRL-based approach in multi-UAV control problems. Design/methodology/approach In this paper, a fully distributed collision detection and avoidance approach for multi-UAV based on DRL is proposed. A method that integrates human experience into policy training via a human experience-based adviser is proposed. The authors propose a hybrid control method which combines the learning-based policy with traditional model-based control. Extensive experiments including simulations, real flights and comparative experiments are conducted to evaluate the performance of the approach. Findings A fully distributed multi-UAV collision detection and avoidance method based on DRL is realized. The reward curve shows that the training process when integrating human experience is significantly accelerated and the mean episode reward is higher than the pure DRL method. The experimental results show that the DRL method with human experience integration has a significant improvement than the pure DRL method for multi-UAV collision detection and avoidance. Moreover, the safer flight brought by the hybrid control method has also been validated. Originality/value The fully distributed architecture is suitable for large-scale unmanned aerial vehicle (UAV) swarms and real applications. The DRL method with human experience integration has significantly accelerated the training compared to the pure DRL method. The proposed hybrid control strategy makes up for the shortcomings of two-dimensional light detection and ranging and other puzzles in applications.
引用
收藏
页码:256 / 270
页数:15
相关论文
共 50 条
  • [21] Autonomous target tracking of multi-UAV: A two-stage deep reinforcement learning approach with expert experience
    Wang, Jiahua
    Zhang, Ping
    Wang, Yang
    [J]. APPLIED SOFT COMPUTING, 2023, 145
  • [22] Nonlinear MPC for the multi-UAV system with allocated priority for collision avoidance
    Novak, Dora
    Tebbani, Sihem
    [J]. 2023 31ST MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION, MED, 2023, : 7 - 12
  • [23] Multi-Agent Deep Reinforcement Learning for Full-Duplex Multi-UAV Networks
    Dai, Chen
    Zhu, Kun
    Hossain, Ekram
    [J]. 2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2022, : 2232 - 2237
  • [24] Dynamic Attention Network for Multi-UAV Reinforcement Learning
    Xu, Dongsheng
    Wu, Shang
    [J]. INTERNATIONAL CONFERENCE ON ALGORITHMS, HIGH PERFORMANCE COMPUTING, AND ARTIFICIAL INTELLIGENCE (AHPCAI 2021), 2021, 12156
  • [25] Age-of-Information based Multi-UAV Trajectories Using Deep Reinforcement Learning
    Kaur, Amanjot
    Jha, Shashi Shekhar
    [J]. IETE TECHNICAL REVIEW, 2024, 41 (06) : 659 - 671
  • [26] Multi-UAV reconnaissance mission planning via deep reinforcement learning with simulated annealing
    Department of Mechanical Engineering, National University of Singapore, Singapore
    不详
    不详
    不详
    [J]. Swarm Evol. Comput, 2025, 93
  • [27] Minimum Throughput Maximization for Multi-UAV Enabled WPCN: A Deep Reinforcement Learning Method
    Tang, Jie
    Song, Jingru
    Ou, Junhui
    Luo, Jingci
    Zhang, Xiuyin
    Wong, Kai-Kit
    [J]. IEEE ACCESS, 2020, 8 : 9124 - 9132
  • [28] Trajectory Design and Resource Allocation for Multi-UAV Networks: Deep Reinforcement Learning Approaches
    Chang, Zheng
    Deng, Hengwei
    You, Li
    Min, Geyong
    Garg, Sahil
    Kaddoum, Georges
    [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (05): : 2940 - 2951
  • [29] Minimum Throughput Maximization for Multi-UAV Enabled WPCN: A Deep Reinforcement Learning Method
    Tang, Jie
    Song, Jingru
    Ou, Junhui
    Luo, Jingci
    Zhang, Xiuyin
    Wong, Kai-Kit
    [J]. Ou, Junhui (oujunhui@scut.edu.cn), 1600, Institute of Electrical and Electronics Engineers Inc., United States (08): : 9124 - 9132
  • [30] Scalable and Cooperative Deep Reinforcement Learning Approaches for Multi-UAV Systems: A Systematic Review
    Frattolillo, Francesco
    Brunori, Damiano
    Iocchi, Luca
    [J]. DRONES, 2023, 7 (04)