Path planning for unmanned surface vehicle based on improved Q-Learning algorithm

被引：4

作者：

Wang, Yuanhui ^{[1
,2
]}

Lu, Changzhou ^{[1
]}

Wu, Peng ^{[1
,2
]}

Zhang, Xiaoyue ^{[1
,2
]}

机构：

[1] Harbin Engn Univ, Coll Intelligent Syst Sci & Engn, Nantong St 145, Harbin 150001, Peoples R China

[2] Harbin Engn Univ, Sanya Nanhai Innovat & Dev Base, Harbin, Peoples R China

来源：

OCEAN ENGINEERING | 2024年 / 292卷

关键词：

Q-learning; Unmanned surface vehicle; Path planning; Reinforcement learning; RBF neural network;

D O I：

10.1016/j.oceaneng.2023.116510

中图分类号：

U6 [水路运输]; P75 [海洋工程];

学科分类号：

0814 ; 081505 ; 0824 ; 082401 ;

摘要：

Path planning is a key factor for the unmanned surface vehicle (USV) to achieve efficient navigation. In this paper, to solve the global path planning and obstacle avoidance problems for the USV, an improved Q-Learning algorithm called neural network smoothing and fast Q-Learning (NSFQ) is proposed. Three main improvement parts are composed of the proposed algorithm. Firstly, the radial basis function (RBF) neural network is combined with the Q-Learning algorithm to approximate the action value function Q, which improves the convergence speed of the Q-Learning algorithm. Secondly, to ensure that the planned path conforms to the maneuvering characteristics of the USV, the heading angle, motion characteristics, ship length, and safety of the USV are taken into account by the proposed algorithm. Based on these factors, the action space and reward function are optimized, the state space is reconstructed, and the safety threshold is proposed. Finally, a third-order Bezier curve is used to smooth the initial path, so that the USV can maintain its heading stability during navigation. Based on simulation results, the proposed NSFQ algorithm outperforms the A* and RRT algorithms in terms of evaluation indicators such as heading angle, angular velocity, path length, sailing time, and path smoothness.

引用

页数：15

共 50 条

[21] An improved ant colony algorithm based on Q-Learning for route planning of autonomous vehicle
Zhao, Liping
Li, Feng
Sun, Dongye
Zhao, Zihan
[J]. INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2024, 19 (03)
[22] Mobile robot path planning based on Q-learning algorithm
Li, Shaochuan
Wang, Xuiqing
Hu, Liwei
Liu, Ying
[J]. 2019 WORLD ROBOT CONFERENCE SYMPOSIUM ON ADVANCED ROBOTICS AND AUTOMATION (WRC SARA 2019), 2019, : 160 - 165
[23] A Path Planning Algorithm for Space Manipulator Based on Q-Learning
Li, Taiguo
Li, Quanhong
Li, Wenxi
Xia, Jiagao
Tang, Wenhua
Wang, Weiwen
[J]. PROCEEDINGS OF 2019 IEEE 8TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC 2019), 2019, : 1566 - 1571
[24] Coverage Path Planning Optimization Based on Q-Learning Algorithm
Piardi, Luis
Lima, Jose
Pereira, Ana, I
Costa, Paulo
[J]. INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS (ICNAAM-2018), 2019, 2116
[25] An Algorithm of Complete Coverage Path Planning for Unmanned Surface Vehicle Based on Reinforcement Learning
Xing, Bowen
Wang, Xiao
Yang, Liu
Liu, Zhenchong
Wu, Qingyun
[J]. JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (03)
[26] Research on path planning of autonomous vehicle based on RRT algorithm of Q-learning and obstacle distribution
Shang, Yuze
Liu, Fei
Qin, Ping
Guo, Zhizhong
Li, Zhe
[J]. ENGINEERING COMPUTATIONS, 2023, 40 (05) : 1266 - 1286
[27] An immune plasma algorithm with Q-learning based pandemic management for path planning of unmanned aerial vehicles
Aslan, Selcuk
Demirci, Sercan
[J]. EGYPTIAN INFORMATICS JOURNAL, 2024, 26
[28] Energy Efficient Path Planning Scheme for Unmanned Aerial Vehicle Using Hybrid Generic Algorithm-Based Q-Learning Optimization
Saeed, Rashid A.
Ali, Elmustafa Sayed
Abdelhaq, Maha
Alsaqour, Raed
Ahmed, Fatima Rayan Awad
Saad, Asma Mohammed Elbashir
[J]. IEEE ACCESS, 2024, 12 : 13400 - 13417
[29] Unmanned Ground Vehicle Path Planning Based on Improved DRL Algorithm
Liu, Lisang
Chen, Jionghui
Zhang, Youyuan
Chen, Jiayu
Liang, Jingrun
He, Dongwei
[J]. ELECTRONICS, 2024, 13 (13)
[30] Heuristic Q-learning based on experience replay for three-dimensional path planning of the unmanned aerial vehicle
Xie, Ronglei
Meng, Zhijun
Zhou, Yaoming
Ma, Yunpeng
Wu, Zhe
[J]. SCIENCE PROGRESS, 2020, 103 (01)

← 1 2 3 4 5 →