Improved SARSA and DQN algorithms for reinforcement learning

被引:0
|
作者
Yao, Guangyu [1 ,2 ]
Zhang, Nan [1 ,2 ]
Duan, Zhenhua [1 ,2 ]
Tian, Cong [1 ,2 ]
机构
[1] Xidian Univ, Inst Comp Theory & Technol, Xian 710071, Peoples R China
[2] Xidian Univ, ISN Lab, Xian 710071, Peoples R China
关键词
Machine learning; Reinforcement learning; Deep Q-network; epsilon-greedy policy; Value iteration;
D O I
10.1016/j.tcs.2024.115025
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Reinforcement learning is a branch of machine learning in which an agent interacts with an environment to learn optimal actions that maximize cumulative rewards. This paper aims to enhance the SARSA and DQN algorithms in four key aspects: the epsilon-greedy policy, reward function, value iteration approach, and sampling probability. The experiments are conducted in three scenarios: path planning, CartPole, and MountainCar. The results show that, in these environments, the improved algorithms exhibit better convergence, higher rewards, and more stable training processes.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Least-Squares SARSA(λ) Algorithms for Reinforcement Learning
    Chen, Sheng-Lei
    Wei, Yan-Mei
    ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 2, PROCEEDINGS, 2008, : 632 - +
  • [2] Swarm Reinforcement Learning Algorithms Based on Sarsa Method
    Iima, Hitoshi
    Kuroe, Yasuaki
    2008 PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-7, 2008, : 1963 - 1967
  • [3] Reinforcement Learning Algorithms for Autonomous Mission Accomplishment by Unmanned Aerial Vehicles: A Comparative View with DQN, SARSA and A2C
    Jimenez, Gonzalo Aguilar
    Hueso, Arturo de la Escalera
    Gomez-Silva, Maria J.
    SENSORS, 2023, 23 (21)
  • [4] An Improved Sarsa(λ) Reinforcement Learning Algorithm for Wireless Communication Systems
    Jiang, Hao
    Gui, Renjie
    Chen, Zhen
    Wu, Liang
    Dang, Jian
    Zhou, Jie
    IEEE ACCESS, 2019, 7 : 115418 - 115427
  • [5] Factored SARSA(λ) algorithm of reinforcement learning
    Chen, H.W.
    Xie, J.P.
    Xie, L.J.
    2001, Science Press (38):
  • [6] Smoothed Sarsa: Reinforcement Learning for Robot Delivery Tasks
    Ramachandran, Deepak
    Gupta, Rakesh
    ICRA: 2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-7, 2009, : 3327 - +
  • [7] Asymmetric DQN for Partially Observable Reinforcement Learning
    Baisero, Andrea
    Daley, Brett
    Amato, Christopher
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 107 - 117
  • [8] Reinforcement Learning-based Control of a Buck Converter: A Comparative Study of DQN and DDPG Algorithms
    Shahnooshi, Shima
    Ranjbaran, Parisa
    Ebrahimi, Javad
    Bakhshai, Alireza
    Jain, Praveen
    2023 25TH EUROPEAN CONFERENCE ON POWER ELECTRONICS AND APPLICATIONS, EPE'23 ECCE EUROPE, 2023,
  • [9] Deep Reinforcement Learning with Experience Replay Based on SARSA
    Zhao, Dongbin
    Wang, Haitao
    Shao, Kun
    Zhu, Yuanheng
    PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
  • [10] A Reinforcement Learning Approach to the Shepherding Task Using SARSA
    Go, Clark Kendrick
    Lao, Bryan
    Yoshimoto, Junichiro
    Ikeda, Kazushi
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3833 - 3836