Real-time Obstacle Avoidance for AUV Based on Reinforcement Learning and Dynamic Window Approach

被引:0
|
作者
Shen, Yue [1 ]
Xu, Han [1 ]
Wang, Dianrui [1 ]
Zhang, Yixiao [1 ]
Yan, Tianhong [2 ]
He, Bo [1 ]
机构
[1] Ocean Univ China, Sch Informat Sci & Engn, Qingdao, Peoples R China
[2] China Jiliang Univ, Sch Mech Elect Engn, Hangzhou, Peoples R China
关键词
autonomous underwater vehicle; obstacle avoidance; dynamic window approach; Q-learning;
D O I
10.1109/IEEECONF38699.2020.9389357
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
As an important tool for exploring the ocean, autonomous underwater vehicle (AUV) plays an irreplaceable role in various marine activities. Due to the complexity and uncertainty of the marine environment, AUV is required to develop in a more intelligent direction. How to ensure that AUV avoids obstacles and reaches the target point smoothly is a key research issue of the AUV. The dynamic window approach (DWA) is adopted to AUV in this paper to achieve AUV's autonomous obstacle avoidance for static obstacles. The DWA is used to search for the optimal velocity command in its admissible velocity space by maximizing the objective function, however, the weights of its objective function are constant, which makes AUV lack flexibility in complex environments, and even unable to avoid obstacles. To address the above problem, reinforcement learning is introduced to optimize DWA. Q-learning, a reinforcement learning algorithm, is used to learn the weights of the DWA's objective function, which enables appropriate weights can be selected in different environments and improves the applicability of DWA in the complex environment. Compared with the original DWA, the DWA combined with Q-learning is effective and suitable for complex obstacle environments.
引用
下载
收藏
页数:4
相关论文
共 50 条
  • [1] Real-time Obstacle Avoidance of Hovercraft Based on Follow the Gap with Dynamic Window Approach
    Wang, Yuanhui
    She, Wenchao
    Fu, Mingyu
    Ding, Fuguang
    Dai, Shaoshi
    OCEANS 2018 MTS/IEEE CHARLESTON, 2018,
  • [2] AUV Real-time Dynamic Obstacle Avoidance Strategy Based on Relative Motion
    Lv, Chongyang
    Yu, Fei
    Zhu, Minghong
    Xiao, Shu
    ENGINEERING LETTERS, 2019, 27 (01) : 234 - 240
  • [3] A deep reinforcement learning based method for real-time path planning and dynamic obstacle avoidance
    Chen, Pengzhan
    Pei, Jiean
    Lu, Weiqing
    Li, Mingzhen
    NEUROCOMPUTING, 2022, 497 : 64 - 75
  • [4] AUV path tracking with real-time obstacle avoidance via reinforcement learning under adaptive constraints
    Zhang, Chenming
    Cheng, Peng
    Du, Bin
    Dong, Botao
    Zhang, Weidong
    OCEAN ENGINEERING, 2022, 256
  • [5] Real-time Obstacle Avoidance and Person Following Based on Adaptive Window Approach
    Cen, Minfeng
    Huang, Yonglong
    Zhong, Xunyu
    Peng, Xiafu
    Zou, Chaosheng
    2019 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2019, : 64 - 69
  • [6] Real-time obstacle avoidance for polygonal robots with a reduced dynamic window
    Arras, KO
    Persson, J
    Tomatis, N
    Siegwart, R
    2002 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS I-IV, PROCEEDINGS, 2002, : 3050 - 3055
  • [7] AUV Obstacle Avoidance Planning Based on Deep Reinforcement Learning
    Yuan, Jianya
    Wang, Hongjian
    Zhang, Honghan
    Lin, Changjian
    Yu, Dan
    Li, Chengfeng
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2021, 9 (11)
  • [8] Real-time obstacle avoidance with deep reinforcement learning * Three-Dimensional Autonomous Obstacle Avoidance for UAV
    Yang, Songyue
    Meng, Zhijun
    Chen, Xuzhi
    Xie, Ronglei
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ROBOTICS, INTELLIGENT CONTROL AND ARTIFICIAL INTELLIGENCE (RICAI 2019), 2019, : 324 - 329
  • [9] A dynamic approach to real-time obstacle avoidance control of redundant manipulators
    Ma, S
    Hirose, S
    JSME INTERNATIONAL JOURNAL SERIES C-MECHANICAL SYSTEMS MACHINE ELEMENTS AND MANUFACTURING, 1996, 39 (02) : 317 - 322
  • [10] Dynamic approach to real-time obstacle avoidance control of redundant manipulators
    Ma, Shugen
    Hirose, Shigeo
    JSME International Journal, Series C: Dynamics, Control, Robotics, Design and Manufacturing, 1996, 39 (02): : 317 - 322