Optimal path planning method based on epsilon-greedy Q-learning algorithm

被引：11

作者：

Bulut, Vahide ^{[1
]}

机构：

[1] Izmir Katip Celebi Univ, Dept Engn Sci, TR-35620 Izmir, Turkey

来源：

JOURNAL OF THE BRAZILIAN SOCIETY OF MECHANICAL SCIENCES AND ENGINEERING | 2022年 / 44卷 / 03期

关键词：

Path planning; Mobile robot; Improved epsilon-greedy Q-learning; Reward function; Quintic trigonometric Bezier curve; MOBILE ROBOT;

D O I：

10.1007/s40430-022-03399-w

中图分类号：

TH [机械、仪表工业];

学科分类号：

0802 ;

摘要：

Path planning in an environment with obstacles is an ongoing problem for mobile robots. Q-learning algorithm increases its importance due to its utility in interacting with the environment. However, the size of state space and computational cost are the main parts to be improved. Hence, this paper proposes an improved epsilon-greedy Q-learning (IEGQL) algorithm to enhance efficiency and productivity regarding path length and computational cost. It is important to determine an effective reward function and adjust the agent's next action to ensure exploitation and exploration. We present a new reward function to ensure the environment's knowledge in advance for a mobile robot. Additionally, novel mathematical modeling is proposed to provide the optimal selection besides ensuring a rapid convergence. Since a mobile robot has difficulty moving through the path with sharp corners, the smooth path is formed after obtaining the optimal skeleton path. Furthermore, a real-world experiment is given based on the multi-objective function. The benchmark of the proposed IEGQL algorithm with the classical EGQL and A-star algorithms is presented. The experimental results and performance analysis indicate that the IEGQL algorithm generates the optimal path based on path length, computation time, low jerk, and staying closer to the optimal skeleton path.

引用

页数：14

共 50 条

[41] Research on path planning of autonomous vehicle based on RRT algorithm of Q-learning and obstacle distribution
Shang, Yuze
Liu, Fei
Qin, Ping
Guo, Zhizhong
Li, Zhe
[J]. ENGINEERING COMPUTATIONS, 2023, 40 (05) : 1266 - 1286
[42] Path Planning of Intelligent Radar Anti-jamming Matrix based on Q-Learning Algorithm
Shi, Shasha
Zhou, Qingsong
Qian, Jialong
Shi, Shujie
[J]. Proceedings of SPIE - The International Society for Optical Engineering, 2024, 13107
[43] Solving the optimal path planning of a mobile robot using improved Q-learning
Low, Ee Soong
Ong, Pauline
Cheah, Kah Chun
[J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2019, 115 : 143 - 161
[44] ε-QLMR : ε-greedy based Q-Learning algorithm for Multipath Routing in SDN networks
Hassen, Houda
Meherzi, Soumaya
Ben Jemaa, Zouhair
[J]. 2023 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC, 2023, : 234 - 239
[45] Application of Improved Q-Learning Algorithm in Dynamic Path Planning for Aircraft at Airports
Xiang, Zheng
Sun, Heyang
Zhang, Jiahao
[J]. IEEE ACCESS, 2023, 11 : 107892 - 107905
[46] Q-LEARNING ALGORITHM FOR PATH-PLANNING TO MANEUVER THROUGH A SATELLITE CLUSTER
Chu, Xiaoyu
Alfriend, Kyle T.
Zhang, Jingrui
Zhang, Yao
[J]. ASTRODYNAMICS 2018, PTS I-IV, 2019, 167 : 2063 - 2082
[47] A multi-objective hyper-heuristic algorithm based on adaptive epsilon-greedy selection
Tailong Yang
Shuyan Zhang
Cuixia Li
[J]. Complex & Intelligent Systems, 2021, 7 : 765 - 780
[48] A Dynamic Planning Algorithm based on Q-Learning Routing in SDON
Shang, Jingkun
Li, Hui
Man, Xiangkun
Wu, Fang
Zhao, Jia Wei
Ma, Xiaomei
[J]. 2020 ASIA COMMUNICATIONS AND PHOTONICS CONFERENCE (ACP) AND INTERNATIONAL CONFERENCE ON INFORMATION PHOTONICS AND OPTICAL COMMUNICATIONS (IPOC), 2020,
[49] Weighted Rendezvous Planning on Q-Learning Based Adaptive Zone Partition with PSO Based Optimal Path Selection
Senthil Kumar, V.
Prasanth, K.
[J]. WIRELESS PERSONAL COMMUNICATIONS, 2020, 110 (01) : 153 - 167
[50] Weighted Rendezvous Planning on Q-Learning Based Adaptive Zone Partition with PSO Based Optimal Path Selection
V. Senthil kumar
K. Prasanth
[J]. Wireless Personal Communications, 2020, 110 : 153 - 167

← 1 2 3 4 5 →