Agent Maze Path Planning Based on Simulated Annealing Q-Learning Algorithm

被引：0

作者：

Mao, Zhongtian ^{[1
]}

Wu, Zipeng ^{[1
]}

Fang, Xiaohan ^{[1
]}

Cheng, Songsong ^{[1
]}

Fan, Yuan ^{[1
]}

机构：

[1] Anhui Univ, Anhui Engn Lab Human Robot Integrat Syst & Intell, Sch Elect Engn & Automat, Hefei 230601, Peoples R China

来源：

2022 41ST CHINESE CONTROL CONFERENCE (CCC) | 2022年

基金：

中国国家自然科学基金;

关键词：

Reinforcement learning; Q-Learning; Simulated annealing algorithm; Maze path planning;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The problem of path exploration and planning of agents in unknown environments is a popular application problem in the field of reinforcement learning. In this paper, we propose an improved reinforcement learning algorithm called the QLearning algorithm for adaptive exploration based on simulated annealing (AE-SAQL). We apply the algorithm to the agent path planning problem, improve the setting of the reward function and add the feedback information of the environment. By simulating the Metropolis criterion in the annealing algorithm and adding an adaptive adjustment mechanism, the agent fully explores the environment and makes full use of the environmental information, solving the exploration-utilization dilemma during the algorithm and finally enabling the agent to reach the target location safely. Compared with the standard Q-Learning algorithm and SARSA algorithm, AE-SAQL achieves better.

引用

页码：2272 / 2276

页数：5

共 50 条

[41] A novel reinforcement learning framework for disassembly sequence planning using Q-learning technique optimized using an enhanced simulated annealing algorithm
Chand, Mirothali
Ravi, Chandrasekar
[J]. AI EDAM-ARTIFICIAL INTELLIGENCE FOR ENGINEERING DESIGN ANALYSIS AND MANUFACTURING, 2024, 38
[42] Automatic Path Planning for Spraying Drones Based on Deep Q-Learning
Huang, Ya-Yu
Li, Zi-Wen
Yang, Chun-Hao
Huang, Yueh-Min
[J]. JOURNAL OF INTERNET TECHNOLOGY, 2023, 24 (03): : 565 - 575
[43] Q-Learning based Local Path Planning for UAVs with Different Priorities
de Carvalho, Kevin B.
Batista, Hiago O. B.
Fagundes-Junior, Leonardo A.
Brandao, Alexandre S.
[J]. 2023 LATIN AMERICAN ROBOTICS SYMPOSIUM, LARS, 2023 BRAZILIAN SYMPOSIUM ON ROBOTICS, SBR, AND 2023 WORKSHOP ON ROBOTICS IN EDUCATION, WRE, 2023, : 89 - 94
[44] Q-learning based method of adaptive path planning for mobile robot
Li, Yibin
Li, Caihong
Zhang, Zijian
[J]. 2006 IEEE INTERNATIONAL CONFERENCE ON INFORMATION ACQUISITION, VOLS 1 AND 2, CONFERENCE PROCEEDINGS, 2006, : 983 - 987
[45] Ground Robot Path Planning based on Simulated Annealing Genetic Algorithm
Wang, Lanfei
Guo, Jun
Wang, Qu
Kan, Jiangming
[J]. 2018 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY (CYBERC 2018), 2018, : 417 - 424
[46] Omnidirectional AGV path planning based on simulated annealing genetic algorithm
Niu, Qinyu
Li, Bo
[J]. Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2024, 30 (10): : 3730 - 3741
[47] Research on path planning of stacker based on improved simulated annealing algorithm
[J]. Bian, Heying (bhy9639@163.com), 2018, SHPMedia Sdn Bhd (COMPENDIUM VOL. 1):
[48] An immune plasma algorithm with Q-learning based pandemic management for path planning of unmanned aerial vehicles
Aslan, Selcuk
Demirci, Sercan
[J]. EGYPTIAN INFORMATICS JOURNAL, 2024, 26
[49] Improved Q-Learning Algorithm Based on Flower Pollination Algorithm and Tabulation Method for Unmanned Aerial Vehicle Path Planning
Bo, Lan
Zhang, Tiezhu
Zhang, Hongxin
Yang, Jian
Zhang, Zhen
Zhang, Caihong
Liu, Mingjie
[J]. IEEE ACCESS, 2024, 12 : 104429 - 104444
[50] IMAP-QL: an improved multi-agent pursuit path-planning based on Q-learning
El Habib Souidi, Mohammed
Ledmi, Makhlouf
Maarouk, Toufik Messaoud
Ledmi, Abdeldjalil
Laassami, Ferial
[J]. International Journal of Systems, Control and Communications, 2024, 15 (02) : 159 - 178

← 1 2 3 4 5 →