Agent Maze Path Planning Based on Simulated Annealing Q-Learning Algorithm

被引：0

作者：

Mao, Zhongtian ^{[1
]}

Wu, Zipeng ^{[1
]}

Fang, Xiaohan ^{[1
]}

Cheng, Songsong ^{[1
]}

Fan, Yuan ^{[1
]}

机构：

[1] Anhui Univ, Anhui Engn Lab Human Robot Integrat Syst & Intell, Sch Elect Engn & Automat, Hefei 230601, Peoples R China

来源：

2022 41ST CHINESE CONTROL CONFERENCE (CCC) | 2022年

基金：

中国国家自然科学基金;

关键词：

Reinforcement learning; Q-Learning; Simulated annealing algorithm; Maze path planning;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The problem of path exploration and planning of agents in unknown environments is a popular application problem in the field of reinforcement learning. In this paper, we propose an improved reinforcement learning algorithm called the QLearning algorithm for adaptive exploration based on simulated annealing (AE-SAQL). We apply the algorithm to the agent path planning problem, improve the setting of the reward function and add the feedback information of the environment. By simulating the Metropolis criterion in the annealing algorithm and adding an adaptive adjustment mechanism, the agent fully explores the environment and makes full use of the environmental information, solving the exploration-utilization dilemma during the algorithm and finally enabling the agent to reach the target location safely. Compared with the standard Q-Learning algorithm and SARSA algorithm, AE-SAQL achieves better.

引用

页码：2272 / 2276

页数：5

共 50 条

[1] Hybrid Path Planning Algorithm of the Mobile Agent Based on Q-Learning
Gao, Tengteng
Li, Caihong
Liu, Guoming
Guo, Na
Wang, Di
Li, Yongdi
[J]. AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2022, 56 (02) : 130 - 142
[2] Hybrid Path Planning Algorithm of the Mobile Agent Based on Q-Learning
Caihong Tengteng Gao
Guoming Li
Na Liu
Di Guo
Yongdi Wang
[J]. Automatic Control and Computer Sciences, 2022, 56 : 130 - 142
[3] A Path Planning Algorithm for UAV Based on Improved Q-Learning
Yan, Chao
Xiang, Xiaojia
[J]. 2018 2ND INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION SCIENCES (ICRAS), 2018, : 46 - 50
[4] A Path Planning Algorithm for Space Manipulator Based on Q-Learning
Li, Taiguo
Li, Quanhong
Li, Wenxi
Xia, Jiagao
Tang, Wenhua
Wang, Weiwen
[J]. PROCEEDINGS OF 2019 IEEE 8TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC 2019), 2019, : 1566 - 1571
[5] Mobile robot path planning based on Q-learning algorithm
Li, Shaochuan
Wang, Xuiqing
Hu, Liwei
Liu, Ying
[J]. 2019 WORLD ROBOT CONFERENCE SYMPOSIUM ON ADVANCED ROBOTICS AND AUTOMATION (WRC SARA 2019), 2019, : 160 - 165
[6] Coverage Path Planning Optimization Based on Q-Learning Algorithm
Piardi, Luis
Lima, Jose
Pereira, Ana, I
Costa, Paulo
[J]. INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS (ICNAAM-2018), 2019, 2116
[7] Indoor Emergency Path Planning Based on the Q-Learning Optimization Algorithm
Xu, Shenghua
Gu, Yang
Li, Xiaoyan
Chen, Cai
Hu, Yingyi
Sang, Yu
Jiang, Wenxing
[J]. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2022, 11 (01)
[8] PATH PLANNING OF MOBILE ROBOT BASED ON THE IMPROVED Q-LEARNING ALGORITHM
Chen, Chaorui
Wang, Dongshu
[J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2022, 18 (03): : 687 - 702
[9] Hybrid Path Planning of A Quadrotor UAV Based on Q-Learning Algorithm
Zhang, Tianze
Huo, Xin
Chen, Songlin
Yang, Baoqing
Zhang, Guojiang
[J]. 2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 5415 - 5419
[10] A novel Q-learning algorithm based on improved whale optimization algorithm for path planning
Li, Ying
Wang, Hanyu
Fan, Jiahao
Geng, Yanyu
[J]. PLOS ONE, 2022, 17 (12):

← 1 2 3 4 5 →