A Hybrid Human-in-the-Loop Deep Reinforcement Learning Method for UAV Motion Planning for Long Trajectories with Unpredictable Obstacles

被引:7
|
作者
Zhang, Sitong [1 ]
Li, Yibing [1 ]
Ye, Fang [2 ]
Geng, Xiaoyu [1 ]
Zhou, Zitao [1 ]
Shi, Tuo [3 ]
机构
[1] Harbin Engn Univ, Coll Informat & Commun Engn, Key Lab Adv Marine Commun & Informat Technol, Harbin 150001, Peoples R China
[2] Harbin Engn Univ, Coll Informat & Commun Engn, Natl Key Lab Underwater Acoust Technol, Harbin 150001, Peoples R China
[3] Tianjin Univ, Coll Intelligence & Comp, Sch Comp Sci & Technol, Tianjin 300350, Peoples R China
基金
中国国家自然科学基金;
关键词
unmanned aerial vehicles; collision avoidance; global path planning; DRL-based motion planning; RRT; NAVIGATION;
D O I
10.3390/drones7050311
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Unmanned Aerial Vehicles (UAVs) can be an important component in the Internet of Things (IoT) ecosystem due to their ability to collect and transmit data from remote and hard-to-reach areas. Ensuring collision-free navigation for these UAVs is crucial in achieving this goal. However, existing UAV collision-avoidance methods face two challenges: conventional path-planning methods are energy-intensive and computationally demanding, while deep reinforcement learning (DRL)-based motion-planning methods are prone to make UAVs trapped in complex environments-especially for long trajectories with unpredictable obstacles-due to UAVs' limited sensing ability. To address these challenges, we propose a hybrid collision-avoidance method for the real-time navigation of UAVs in complex environments with unpredictable obstacles. We firstly develop a Human-in-the-Loop DRL (HL-DRL) training module for mapless obstacle avoidance and secondly establish a global-planning module that generates a few points as waypoint guidance. Moreover, a novel goal-updating algorithm is proposed to integrate the HL-DRL training module with the global-planning module by adaptively determining the to-be-reached waypoint. The proposed method is evaluated in different simulated environments. Results demonstrate that our approach can rapidly adapt to changes in environments with short replanning time and prevent the UAV from getting stuck in maze-like environments.
引用
收藏
页数:26
相关论文
共 31 条
  • [1] Human-In-The-Loop Task and Motion Planning for Imitation Learning
    Mandlekar, Ajay
    Garrett, Caelan
    Xu, Danfei
    Fox, Dieter
    [J]. CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [2] HEX: Human-in-the-loop explainability via deep reinforcement learning
    Lash, Michael T.
    [J]. Decision Support Systems, 2024, 187
  • [3] An End-to-End Deep Reinforcement Learning Method for UAV Autonomous Motion Planning
    Cui, Yangjie
    Dong, Xin
    Li, Daochun
    Tu, Zhan
    [J]. 2022 7TH INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION ENGINEERING, ICRAE, 2022, : 100 - 104
  • [4] Personalization of Hearing Aid Compression by Human-in-the-Loop Deep Reinforcement Learning
    Alamdari, Nasim
    Lobarinas, Edward
    Kehtarnavaz, Nasser
    [J]. IEEE ACCESS, 2020, 8 : 203503 - 203515
  • [5] Thermal comfort management leveraging deep reinforcement learning and human-in-the-loop
    Cicirelli, Franco
    Guerrieri, Antonio
    Mastroianni, Carlo
    Spezzano, Giandomenico
    Vinci, Andrea
    [J]. PROCEEDINGS OF THE 2020 IEEE INTERNATIONAL CONFERENCE ON HUMAN-MACHINE SYSTEMS (ICHMS), 2020, : 160 - 165
  • [6] A UAV Path Planning Method Based on Deep Reinforcement Learning
    Li, Yibing
    Zhang, Sitong
    Ye, Fang
    Jiang, Tao
    Li, Yingsong
    [J]. 2020 IEEE USNC-CNC-URSI NORTH AMERICAN RADIO SCIENCE MEETING (JOINT WITH AP-S SYMPOSIUM), 2020, : 93 - 94
  • [7] Relevant experience learning: A deep reinforcement learning method for UAV autonomous motion planning in complex unknown environments
    Hu, Zijian
    Gao, Xiaoguang
    Wan, Kaifang
    Zhai, Yiwei
    Wang, Qianglong
    [J]. CHINESE JOURNAL OF AERONAUTICS, 2021, 34 (12) : 187 - 204
  • [8] Relevant experience learning: A deep reinforcement learning method for UAV autonomous motion planning in complex unknown environments
    Zijian HU
    Xiaoguang GAO
    Kaifang WAN
    Yiwei ZHAI
    Qianglong WANG
    [J]. Chinese Journal of Aeronautics, 2021, 34 (12) : 187 - 204
  • [9] Deep Reinforcement Active Learning for Human-In-The-Loop Person Re-Identification
    Liu, Zimo
    Wang, Jingya
    Gong, Shaogang
    Lu, Huchuan
    Tao, Dacheng
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6121 - 6130
  • [10] Relevant experience learning: A deep reinforcement learning method for UAV autonomous motion planning in complex unknown environments
    Zijian HU
    Xiaoguang GAO
    Kaifang WAN
    Yiwei ZHAI
    Qianglong WANG
    [J]. Chinese Journal of Aeronautics, 2021, (12) : 187 - 204