Agent Maze Path Planning Based on Simulated Annealing Q-Learning Algorithm

被引:0
|
作者
Mao, Zhongtian [1 ]
Wu, Zipeng [1 ]
Fang, Xiaohan [1 ]
Cheng, Songsong [1 ]
Fan, Yuan [1 ]
机构
[1] Anhui Univ, Anhui Engn Lab Human Robot Integrat Syst & Intell, Sch Elect Engn & Automat, Hefei 230601, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Q-Learning; Simulated annealing algorithm; Maze path planning;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The problem of path exploration and planning of agents in unknown environments is a popular application problem in the field of reinforcement learning. In this paper, we propose an improved reinforcement learning algorithm called the QLearning algorithm for adaptive exploration based on simulated annealing (AE-SAQL). We apply the algorithm to the agent path planning problem, improve the setting of the reward function and add the feedback information of the environment. By simulating the Metropolis criterion in the annealing algorithm and adding an adaptive adjustment mechanism, the agent fully explores the environment and makes full use of the environmental information, solving the exploration-utilization dilemma during the algorithm and finally enabling the agent to reach the target location safely. Compared with the standard Q-Learning algorithm and SARSA algorithm, AE-SAQL achieves better.
引用
收藏
页码:2272 / 2276
页数:5
相关论文
共 50 条
  • [41] A novel reinforcement learning framework for disassembly sequence planning using Q-learning technique optimized using an enhanced simulated annealing algorithm
    Chand, Mirothali
    Ravi, Chandrasekar
    [J]. AI EDAM-ARTIFICIAL INTELLIGENCE FOR ENGINEERING DESIGN ANALYSIS AND MANUFACTURING, 2024, 38
  • [42] Automatic Path Planning for Spraying Drones Based on Deep Q-Learning
    Huang, Ya-Yu
    Li, Zi-Wen
    Yang, Chun-Hao
    Huang, Yueh-Min
    [J]. JOURNAL OF INTERNET TECHNOLOGY, 2023, 24 (03): : 565 - 575
  • [43] Q-Learning based Local Path Planning for UAVs with Different Priorities
    de Carvalho, Kevin B.
    Batista, Hiago O. B.
    Fagundes-Junior, Leonardo A.
    Brandao, Alexandre S.
    [J]. 2023 LATIN AMERICAN ROBOTICS SYMPOSIUM, LARS, 2023 BRAZILIAN SYMPOSIUM ON ROBOTICS, SBR, AND 2023 WORKSHOP ON ROBOTICS IN EDUCATION, WRE, 2023, : 89 - 94
  • [44] Q-learning based method of adaptive path planning for mobile robot
    Li, Yibin
    Li, Caihong
    Zhang, Zijian
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON INFORMATION ACQUISITION, VOLS 1 AND 2, CONFERENCE PROCEEDINGS, 2006, : 983 - 987
  • [45] Ground Robot Path Planning based on Simulated Annealing Genetic Algorithm
    Wang, Lanfei
    Guo, Jun
    Wang, Qu
    Kan, Jiangming
    [J]. 2018 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY (CYBERC 2018), 2018, : 417 - 424
  • [46] Omnidirectional AGV path planning based on simulated annealing genetic algorithm
    Niu, Qinyu
    Li, Bo
    [J]. Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2024, 30 (10): : 3730 - 3741
  • [47] Research on path planning of stacker based on improved simulated annealing algorithm
    [J]. Bian, Heying (bhy9639@163.com), 2018, SHPMedia Sdn Bhd (COMPENDIUM VOL. 1):
  • [48] An immune plasma algorithm with Q-learning based pandemic management for path planning of unmanned aerial vehicles
    Aslan, Selcuk
    Demirci, Sercan
    [J]. EGYPTIAN INFORMATICS JOURNAL, 2024, 26
  • [49] Improved Q-Learning Algorithm Based on Flower Pollination Algorithm and Tabulation Method for Unmanned Aerial Vehicle Path Planning
    Bo, Lan
    Zhang, Tiezhu
    Zhang, Hongxin
    Yang, Jian
    Zhang, Zhen
    Zhang, Caihong
    Liu, Mingjie
    [J]. IEEE ACCESS, 2024, 12 : 104429 - 104444
  • [50] IMAP-QL: an improved multi-agent pursuit path-planning based on Q-learning
    El Habib Souidi, Mohammed
    Ledmi, Makhlouf
    Maarouk, Toufik Messaoud
    Ledmi, Abdeldjalil
    Laassami, Ferial
    [J]. International Journal of Systems, Control and Communications, 2024, 15 (02) : 159 - 178