Real-time Obstacle Avoidance for AUV Based on Reinforcement Learning and Dynamic Window Approach

被引:0
|
作者
Shen, Yue [1 ]
Xu, Han [1 ]
Wang, Dianrui [1 ]
Zhang, Yixiao [1 ]
Yan, Tianhong [2 ]
He, Bo [1 ]
机构
[1] Ocean Univ China, Sch Informat Sci & Engn, Qingdao, Peoples R China
[2] China Jiliang Univ, Sch Mech Elect Engn, Hangzhou, Peoples R China
关键词
autonomous underwater vehicle; obstacle avoidance; dynamic window approach; Q-learning;
D O I
10.1109/IEEECONF38699.2020.9389357
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
As an important tool for exploring the ocean, autonomous underwater vehicle (AUV) plays an irreplaceable role in various marine activities. Due to the complexity and uncertainty of the marine environment, AUV is required to develop in a more intelligent direction. How to ensure that AUV avoids obstacles and reaches the target point smoothly is a key research issue of the AUV. The dynamic window approach (DWA) is adopted to AUV in this paper to achieve AUV's autonomous obstacle avoidance for static obstacles. The DWA is used to search for the optimal velocity command in its admissible velocity space by maximizing the objective function, however, the weights of its objective function are constant, which makes AUV lack flexibility in complex environments, and even unable to avoid obstacles. To address the above problem, reinforcement learning is introduced to optimize DWA. Q-learning, a reinforcement learning algorithm, is used to learn the weights of the DWA's objective function, which enables appropriate weights can be selected in different environments and improves the applicability of DWA in the complex environment. Compared with the original DWA, the DWA combined with Q-learning is effective and suitable for complex obstacle environments.
引用
下载
收藏
页数:4
相关论文
共 50 条
  • [21] Real-Time Robot Path Planning for Dynamic Obstacle Avoidance
    Charalampous, Konstantinos
    Kostavelis, Ioannis
    Amanatiadis, Angelos
    Gasteratos, Antonios
    JOURNAL OF CELLULAR AUTOMATA, 2014, 9 (2-3) : 195 - 208
  • [22] Real-time robot path planning for dynamic obstacle avoidance
    Charalampous, Konstantinos, 1600, Old City Publishing (09): : 2 - 3
  • [23] Extended dynamic system modulation for real-time obstacle avoidance
    Zhang, Zhide
    Wang, Zhengjie
    Yu, Jin
    CHINESE JOURNAL OF AERONAUTICS, 2022, 35 (12) : 212 - 225
  • [24] Extended dynamic system modulation for real-time obstacle avoidance
    Zhide ZHANG
    Zhengjie WANG
    Jin YU
    Chinese Journal of Aeronautics, 2022, 35 (12) : 212 - 225
  • [25] IVFH*: Real-time Dynamic Obstacle Avoidance for Mobile Robots
    Dong Jie
    Ma Xueming
    Peng Kaixiang
    11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2010), 2010, : 844 - 847
  • [26] Fast Obstacle Avoidance Based on Real-Time Sensing
    Huber, Lukas
    Slotine, Jean-Jacques
    Billard, Aude
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (03) : 1375 - 1382
  • [27] Obstacle avoidance for mobile robot based on improved dynamic window approach
    Li, Xiuyun
    Liu, Fei
    Liu, Juan
    Liang, Shan
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2017, 25 (02) : 666 - 676
  • [28] Application of Reinforcement Learning Based on Neural Network to Dynamic Obstacle Avoidance
    Qiao, Junfei
    Hou, Zhanjun
    Ruan, Xiaogang
    2008 INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, VOLS 1-4, 2008, : 784 - 788
  • [29] A tractable convergent dynamic window approach to obstacle avoidance
    Ögren, P
    Leonard, NE
    2002 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-3, PROCEEDINGS, 2002, : 595 - 600
  • [30] Reinforcement Learning with Dynamic Movement Primitives for Obstacle Avoidance
    Li, Ang
    Liu, Zhenze
    Wang, Wenrui
    Zhu, Mingchao
    Li, Yanhui
    Huo, Qi
    Dai, Ming
    APPLIED SCIENCES-BASEL, 2021, 11 (23):