Improved SARSA and DQN algorithms for reinforcement learning

被引：0

作者：

Yao, Guangyu ^{[1
,2
]}

Zhang, Nan ^{[1
,2
]}

Duan, Zhenhua ^{[1
,2
]}

Tian, Cong ^{[1
,2
]}

机构：

[1] Xidian Univ, Inst Comp Theory & Technol, Xian 710071, Peoples R China

[2] Xidian Univ, ISN Lab, Xian 710071, Peoples R China

来源：

THEORETICAL COMPUTER SCIENCE | 2025年 / 1027卷

关键词：

Machine learning; Reinforcement learning; Deep Q-network; epsilon-greedy policy; Value iteration;

D O I：

10.1016/j.tcs.2024.115025

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Reinforcement learning is a branch of machine learning in which an agent interacts with an environment to learn optimal actions that maximize cumulative rewards. This paper aims to enhance the SARSA and DQN algorithms in four key aspects: the epsilon-greedy policy, reward function, value iteration approach, and sampling probability. The experiments are conducted in three scenarios: path planning, CartPole, and MountainCar. The results show that, in these environments, the improved algorithms exhibit better convergence, higher rewards, and more stable training processes.

引用

页数：15

共 50 条

[1] Least-Squares SARSA(λ) Algorithms for Reinforcement Learning
Chen, Sheng-Lei
Wei, Yan-Mei
ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 2, PROCEEDINGS, 2008, : 632 - +
[2] Swarm Reinforcement Learning Algorithms Based on Sarsa Method
Iima, Hitoshi
Kuroe, Yasuaki
2008 PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-7, 2008, : 1963 - 1967
[3] Reinforcement Learning Algorithms for Autonomous Mission Accomplishment by Unmanned Aerial Vehicles: A Comparative View with DQN, SARSA and A2C
Jimenez, Gonzalo Aguilar
Hueso, Arturo de la Escalera
Gomez-Silva, Maria J.
SENSORS, 2023, 23 (21)
[4] An Improved Sarsa(λ) Reinforcement Learning Algorithm for Wireless Communication Systems
Jiang, Hao
Gui, Renjie
Chen, Zhen
Wu, Liang
Dang, Jian
Zhou, Jie
IEEE ACCESS, 2019, 7 : 115418 - 115427
[5] Factored SARSA(λ) algorithm of reinforcement learning
Chen, H.W.
Xie, J.P.
Xie, L.J.
2001, Science Press (38):
[6] Smoothed Sarsa: Reinforcement Learning for Robot Delivery Tasks
Ramachandran, Deepak
Gupta, Rakesh
ICRA: 2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-7, 2009, : 3327 - +
[7] Asymmetric DQN for Partially Observable Reinforcement Learning
Baisero, Andrea
Daley, Brett
Amato, Christopher
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 107 - 117
[8] Reinforcement Learning-based Control of a Buck Converter: A Comparative Study of DQN and DDPG Algorithms
Shahnooshi, Shima
Ranjbaran, Parisa
Ebrahimi, Javad
Bakhshai, Alireza
Jain, Praveen
2023 25TH EUROPEAN CONFERENCE ON POWER ELECTRONICS AND APPLICATIONS, EPE'23 ECCE EUROPE, 2023,
[9] Deep Reinforcement Learning with Experience Replay Based on SARSA
Zhao, Dongbin
Wang, Haitao
Shao, Kun
Zhu, Yuanheng
PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
[10] A Reinforcement Learning Approach to the Shepherding Task Using SARSA
Go, Clark Kendrick
Lao, Bryan
Yoshimoto, Junichiro
Ikeda, Kazushi
2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3833 - 3836

← 1 2 3 4 5 →