Path planning for unmanned surface vehicle based on improved Q-Learning algorithm

被引:4
|
作者
Wang, Yuanhui [1 ,2 ]
Lu, Changzhou [1 ]
Wu, Peng [1 ,2 ]
Zhang, Xiaoyue [1 ,2 ]
机构
[1] Harbin Engn Univ, Coll Intelligent Syst Sci & Engn, Nantong St 145, Harbin 150001, Peoples R China
[2] Harbin Engn Univ, Sanya Nanhai Innovat & Dev Base, Harbin, Peoples R China
关键词
Q-learning; Unmanned surface vehicle; Path planning; Reinforcement learning; RBF neural network;
D O I
10.1016/j.oceaneng.2023.116510
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Path planning is a key factor for the unmanned surface vehicle (USV) to achieve efficient navigation. In this paper, to solve the global path planning and obstacle avoidance problems for the USV, an improved Q-Learning algorithm called neural network smoothing and fast Q-Learning (NSFQ) is proposed. Three main improvement parts are composed of the proposed algorithm. Firstly, the radial basis function (RBF) neural network is combined with the Q-Learning algorithm to approximate the action value function Q, which improves the convergence speed of the Q-Learning algorithm. Secondly, to ensure that the planned path conforms to the maneuvering characteristics of the USV, the heading angle, motion characteristics, ship length, and safety of the USV are taken into account by the proposed algorithm. Based on these factors, the action space and reward function are optimized, the state space is reconstructed, and the safety threshold is proposed. Finally, a third-order Bezier curve is used to smooth the initial path, so that the USV can maintain its heading stability during navigation. Based on simulation results, the proposed NSFQ algorithm outperforms the A* and RRT algorithms in terms of evaluation indicators such as heading angle, angular velocity, path length, sailing time, and path smoothness.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] An improved ant colony algorithm based on Q-Learning for route planning of autonomous vehicle
    Zhao, Liping
    Li, Feng
    Sun, Dongye
    Zhao, Zihan
    [J]. INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2024, 19 (03)
  • [22] Mobile robot path planning based on Q-learning algorithm
    Li, Shaochuan
    Wang, Xuiqing
    Hu, Liwei
    Liu, Ying
    [J]. 2019 WORLD ROBOT CONFERENCE SYMPOSIUM ON ADVANCED ROBOTICS AND AUTOMATION (WRC SARA 2019), 2019, : 160 - 165
  • [23] A Path Planning Algorithm for Space Manipulator Based on Q-Learning
    Li, Taiguo
    Li, Quanhong
    Li, Wenxi
    Xia, Jiagao
    Tang, Wenhua
    Wang, Weiwen
    [J]. PROCEEDINGS OF 2019 IEEE 8TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC 2019), 2019, : 1566 - 1571
  • [24] Coverage Path Planning Optimization Based on Q-Learning Algorithm
    Piardi, Luis
    Lima, Jose
    Pereira, Ana, I
    Costa, Paulo
    [J]. INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS (ICNAAM-2018), 2019, 2116
  • [25] An Algorithm of Complete Coverage Path Planning for Unmanned Surface Vehicle Based on Reinforcement Learning
    Xing, Bowen
    Wang, Xiao
    Yang, Liu
    Liu, Zhenchong
    Wu, Qingyun
    [J]. JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (03)
  • [26] Research on path planning of autonomous vehicle based on RRT algorithm of Q-learning and obstacle distribution
    Shang, Yuze
    Liu, Fei
    Qin, Ping
    Guo, Zhizhong
    Li, Zhe
    [J]. ENGINEERING COMPUTATIONS, 2023, 40 (05) : 1266 - 1286
  • [27] An immune plasma algorithm with Q-learning based pandemic management for path planning of unmanned aerial vehicles
    Aslan, Selcuk
    Demirci, Sercan
    [J]. EGYPTIAN INFORMATICS JOURNAL, 2024, 26
  • [28] Energy Efficient Path Planning Scheme for Unmanned Aerial Vehicle Using Hybrid Generic Algorithm-Based Q-Learning Optimization
    Saeed, Rashid A.
    Ali, Elmustafa Sayed
    Abdelhaq, Maha
    Alsaqour, Raed
    Ahmed, Fatima Rayan Awad
    Saad, Asma Mohammed Elbashir
    [J]. IEEE ACCESS, 2024, 12 : 13400 - 13417
  • [29] Unmanned Ground Vehicle Path Planning Based on Improved DRL Algorithm
    Liu, Lisang
    Chen, Jionghui
    Zhang, Youyuan
    Chen, Jiayu
    Liang, Jingrun
    He, Dongwei
    [J]. ELECTRONICS, 2024, 13 (13)
  • [30] Heuristic Q-learning based on experience replay for three-dimensional path planning of the unmanned aerial vehicle
    Xie, Ronglei
    Meng, Zhijun
    Zhou, Yaoming
    Ma, Yunpeng
    Wu, Zhe
    [J]. SCIENCE PROGRESS, 2020, 103 (01)