Integrating human experience in deep reinforcement learning for multi-UAV collision detection and avoidance

被引:0
|
作者
Wang, Guanzheng [1 ]
Xu, Yinbo [1 ]
Liu, Zhihong [1 ]
Xu, Xin [1 ]
Wang, Xiangke [1 ]
Yan, Jiarun [1 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; Collision detection and avoidance; Fully distributed; HEBA; Integrating human experience; PERCEPTION; ALGORITHM;
D O I
10.1108/IR-06-2021-0116
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Purpose This paper aims to realize a fully distributed multi-UAV collision detection and avoidance based on deep reinforcement learning (DRL). To deal with the problem of low sample efficiency in DRL and speed up the training. To improve the applicability and reliability of the DRL-based approach in multi-UAV control problems. Design/methodology/approach In this paper, a fully distributed collision detection and avoidance approach for multi-UAV based on DRL is proposed. A method that integrates human experience into policy training via a human experience-based adviser is proposed. The authors propose a hybrid control method which combines the learning-based policy with traditional model-based control. Extensive experiments including simulations, real flights and comparative experiments are conducted to evaluate the performance of the approach. Findings A fully distributed multi-UAV collision detection and avoidance method based on DRL is realized. The reward curve shows that the training process when integrating human experience is significantly accelerated and the mean episode reward is higher than the pure DRL method. The experimental results show that the DRL method with human experience integration has a significant improvement than the pure DRL method for multi-UAV collision detection and avoidance. Moreover, the safer flight brought by the hybrid control method has also been validated. Originality/value The fully distributed architecture is suitable for large-scale unmanned aerial vehicle (UAV) swarms and real applications. The DRL method with human experience integration has significantly accelerated the training compared to the pure DRL method. The proposed hybrid control strategy makes up for the shortcomings of two-dimensional light detection and ranging and other puzzles in applications.
引用
收藏
页码:256 / 270
页数:15
相关论文
共 50 条
  • [1] Integrating human experience in deep reinforcement learning for multi-UAV collision detection and avoidance
    Wang, Guanzheng
    Xu, Yinbo
    Liu, Zhihong
    Xu, Xin
    Wang, Xiangke
    Yan, Jiarun
    [J]. Industrial Robot, 2022, 49 (02): : 256 - 270
  • [2] Collision Detection and Avoidance for Multi-UAV based on Deep Reinforcement Learning
    Wang, Guanzheng
    Liu, Zhihong
    Xiao, Kun
    Xu, Yinbo
    Yang, Lingjie
    Wang, Xiangke
    [J]. 2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 7783 - 7789
  • [3] Vision-based Distributed Multi-UAV Collision Avoidance via Deep Reinforcement Learning for Navigation
    Huang, Huaxing
    Zhu, Guijie
    Fan, Zhun
    Zhai, Hao
    Cai, Yuwei
    Shi, Ze
    Dong, Zhaohui
    Hao, Zhifeng
    [J]. 2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 13745 - 13752
  • [4] DroneARchery: Human-Drone Interaction through Augmented Reality with Haptic Feedback and Multi-UAV Collision Avoidance Driven by Deep Reinforcement Learning
    Dorzhieva, Ekaterina
    Baza, Ahmed
    Gupta, Ayush
    Fedoseev, Aleksey
    Cabrera, Miguel Altamirano
    Karmanova, Ekaterina
    Tsetserukou, Dzmitry
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY (ISMAR 2022), 2022, : 270 - 277
  • [5] A Two-Stage Reinforcement Learning Approach for Multi-UAV Collision Avoidance Under Imperfect Sensing
    Wang, Dawei
    Fan, Tingxiang
    Han, Tao
    Pan, Jia
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02): : 3098 - 3105
  • [6] Multi-UAV Dynamic Wireless Networking With Deep Reinforcement Learning
    Wang, Qiang
    Zhang, Wenqi
    Liu, Yuanwei
    Liu, Ying
    [J]. IEEE COMMUNICATIONS LETTERS, 2019, 23 (12) : 2243 - 2246
  • [7] A Multi-Agent Deep Reinforcement Learning Approach for Practical Decentralized UAV Collision Avoidance
    Thumiger, Nicholas
    Deghat, Mohammad
    [J]. IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 2174 - 2179
  • [8] Deep-Reinforcement-Learning-Based Collision Avoidance in UAV Environment
    Ouahouah, Sihem
    Bagaa, Miloud
    Prados-Garzon, Jonathan
    Taleb, Tarik
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (06) : 4015 - 4030
  • [9] Formation Shape Control of Multi-UAV with Collision Avoidance
    Jia, Zhen
    Wan, You-Hong
    Zhou, Ying-Jiang
    Jiang, Guo-Ping
    Zhang, Dou
    [J]. PROCEEDINGS 2018 33RD YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2018, : 305 - 310
  • [10] Dynamic deployment of multi-UAV base stations with deep reinforcement learning
    Wu, Guanhan
    Jia, Weimin
    Zhao, Jianwei
    [J]. ELECTRONICS LETTERS, 2021, 57 (15) : 600 - 602