AUTONOMOUS NAVIGATION OF UAV IN LARGE-SCALE UNKNOWN COMPLEX ENVIRONMENT WITH DEEP REINFORCEMENT LEARNING

被引:0
|
作者
Wang, Chao [1 ]
Wang, Jian [1 ]
Zhang, Xudong [1 ]
Zhang, Xiao [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
关键词
large-scale autonomous navigation; UAV delivery; deep reinforcement learning; partially observable Markov decision process;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Unmanned Aerial Vehicles (UAVs) based delivery is thriving. In this paper, we model autonomous navigation of UAV in large-scale unknown complex environment as a discrete-time continuous control problem and solve it using deep reinforcement learning. Without path planning or map construction, our method enables UAVs to navigate from arbitrary departure places to destinations using only sensory information of local environment and GPS signal. We argue the navigation task is a partially observable Markov decision process (POMDP) and extant recurrent deterministic policy gradient algorithm is less efficient. Consequently, we derive a faster policy learning algorithm for POMDP based on actor-critic architecture. To validate our ideas, we simulate five virtual environments and a virtual UAV flying at a fixed altitude with constant speed. Cognition of local environment is achieved by measuring distances from UAV to obstacles in multiple directions. Simulation results demonstrate the effectiveness of our method.
引用
收藏
页码:858 / 862
页数:5
相关论文
共 50 条
  • [11] Relevant experience learning: A deep reinforcement learning method for UAV autonomous motion planning in complex unknown environments
    Zijian HU
    Xiaoguang GAO
    Kaifang WAN
    Yiwei ZHAI
    Qianglong WANG
    [J]. Chinese Journal of Aeronautics., 2021, 34 (12) - 204
  • [12] Relevant experience learning: A deep reinforcement learning method for UAV autonomous motion planning in complex unknown environments
    Hu, Zijian
    Gao, Xiaoguang
    Wan, Kaifang
    Zhai, Yiwei
    Wang, Qianglong
    [J]. CHINESE JOURNAL OF AERONAUTICS, 2021, 34 (12) : 187 - 204
  • [13] Relevant experience learning: A deep reinforcement learning method for UAV autonomous motion planning in complex unknown environments
    Zijian HU
    Xiaoguang GAO
    Kaifang WAN
    Yiwei ZHAI
    Qianglong WANG
    [J]. Chinese Journal of Aeronautics, 2021, (12) : 187 - 204
  • [14] An Autonomous UAV Navigation System for Unknown Flight Environment
    Huang, Haitao
    Gu, Jingling
    Wang, Qiuhong
    Zhuang, Yi
    [J]. 2019 15TH INTERNATIONAL CONFERENCE ON MOBILE AD-HOC AND SENSOR NETWORKS (MSN 2019), 2019, : 63 - 68
  • [15] Autonomous Navigation of the UAV through Deep Reinforcement Learning with Sensor Perception Enhancement
    Zhao, Senyan
    Wang, Wei
    Li, Jun
    Huang, Subin
    Liu, Sanmin
    Lolli, Francesco
    [J]. Mathematical Problems in Engineering, 2023, 2023
  • [16] Autonomous UAV Navigation: A DDPG-based Deep Reinforcement Learning Approach
    Bouhamed, Omar
    Ghazzai, Hakim
    Besbes, Hichem
    Massoud, Yehia
    [J]. 2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
  • [17] Deep-Reinforcement-Learning-Based Autonomous UAV Navigation With Sparse Rewards
    Wang, Chao
    Wang, Jian
    Wang, Jingjing
    Zhang, Xudong
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (07): : 6180 - 6190
  • [18] Tractable large-scale deep reinforcement learning
    Sarang, Nima
    Poullis, Charalambos
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 232
  • [19] Crowd Navigation in an Unknown and Dynamic Environment Based on Deep Reinforcement Learning
    Sun, Libo
    Zhai, Jinfeng
    Qin, Wenhu
    [J]. IEEE ACCESS, 2019, 7 : 109544 - 109554
  • [20] Large-scale traffic control using autonomous vehicles and decentralized deep reinforcement learning
    Maske, Harshal
    Chu, Tianshu
    Kalabic, Uros
    [J]. 2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 3816 - 3821