Optimal path planning method based on epsilon-greedy Q-learning algorithm

被引:11
|
作者
Bulut, Vahide [1 ]
机构
[1] Izmir Katip Celebi Univ, Dept Engn Sci, TR-35620 Izmir, Turkey
关键词
Path planning; Mobile robot; Improved epsilon-greedy Q-learning; Reward function; Quintic trigonometric Bezier curve; MOBILE ROBOT;
D O I
10.1007/s40430-022-03399-w
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Path planning in an environment with obstacles is an ongoing problem for mobile robots. Q-learning algorithm increases its importance due to its utility in interacting with the environment. However, the size of state space and computational cost are the main parts to be improved. Hence, this paper proposes an improved epsilon-greedy Q-learning (IEGQL) algorithm to enhance efficiency and productivity regarding path length and computational cost. It is important to determine an effective reward function and adjust the agent's next action to ensure exploitation and exploration. We present a new reward function to ensure the environment's knowledge in advance for a mobile robot. Additionally, novel mathematical modeling is proposed to provide the optimal selection besides ensuring a rapid convergence. Since a mobile robot has difficulty moving through the path with sharp corners, the smooth path is formed after obtaining the optimal skeleton path. Furthermore, a real-world experiment is given based on the multi-objective function. The benchmark of the proposed IEGQL algorithm with the classical EGQL and A-star algorithms is presented. The experimental results and performance analysis indicate that the IEGQL algorithm generates the optimal path based on path length, computation time, low jerk, and staying closer to the optimal skeleton path.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Research on path planning of autonomous vehicle based on RRT algorithm of Q-learning and obstacle distribution
    Shang, Yuze
    Liu, Fei
    Qin, Ping
    Guo, Zhizhong
    Li, Zhe
    [J]. ENGINEERING COMPUTATIONS, 2023, 40 (05) : 1266 - 1286
  • [42] Path Planning of Intelligent Radar Anti-jamming Matrix based on Q-Learning Algorithm
    Shi, Shasha
    Zhou, Qingsong
    Qian, Jialong
    Shi, Shujie
    [J]. Proceedings of SPIE - The International Society for Optical Engineering, 2024, 13107
  • [43] Solving the optimal path planning of a mobile robot using improved Q-learning
    Low, Ee Soong
    Ong, Pauline
    Cheah, Kah Chun
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2019, 115 : 143 - 161
  • [44] ε-QLMR : ε-greedy based Q-Learning algorithm for Multipath Routing in SDN networks
    Hassen, Houda
    Meherzi, Soumaya
    Ben Jemaa, Zouhair
    [J]. 2023 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC, 2023, : 234 - 239
  • [45] Application of Improved Q-Learning Algorithm in Dynamic Path Planning for Aircraft at Airports
    Xiang, Zheng
    Sun, Heyang
    Zhang, Jiahao
    [J]. IEEE ACCESS, 2023, 11 : 107892 - 107905
  • [46] Q-LEARNING ALGORITHM FOR PATH-PLANNING TO MANEUVER THROUGH A SATELLITE CLUSTER
    Chu, Xiaoyu
    Alfriend, Kyle T.
    Zhang, Jingrui
    Zhang, Yao
    [J]. ASTRODYNAMICS 2018, PTS I-IV, 2019, 167 : 2063 - 2082
  • [47] A multi-objective hyper-heuristic algorithm based on adaptive epsilon-greedy selection
    Tailong Yang
    Shuyan Zhang
    Cuixia Li
    [J]. Complex & Intelligent Systems, 2021, 7 : 765 - 780
  • [48] A Dynamic Planning Algorithm based on Q-Learning Routing in SDON
    Shang, Jingkun
    Li, Hui
    Man, Xiangkun
    Wu, Fang
    Zhao, Jia Wei
    Ma, Xiaomei
    [J]. 2020 ASIA COMMUNICATIONS AND PHOTONICS CONFERENCE (ACP) AND INTERNATIONAL CONFERENCE ON INFORMATION PHOTONICS AND OPTICAL COMMUNICATIONS (IPOC), 2020,
  • [49] Weighted Rendezvous Planning on Q-Learning Based Adaptive Zone Partition with PSO Based Optimal Path Selection
    Senthil Kumar, V.
    Prasanth, K.
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2020, 110 (01) : 153 - 167
  • [50] Weighted Rendezvous Planning on Q-Learning Based Adaptive Zone Partition with PSO Based Optimal Path Selection
    V. Senthil kumar
    K. Prasanth
    [J]. Wireless Personal Communications, 2020, 110 : 153 - 167