Improved SARSA and DQN algorithms for reinforcement learning

被引：0

作者：

Yao, Guangyu ^{[1
,2
]}

Zhang, Nan ^{[1
,2
]}

Duan, Zhenhua ^{[1
,2
]}

Tian, Cong ^{[1
,2
]}

机构：

[1] Xidian Univ, Inst Comp Theory & Technol, Xian 710071, Peoples R China

[2] Xidian Univ, ISN Lab, Xian 710071, Peoples R China

来源：

THEORETICAL COMPUTER SCIENCE | 2025年 / 1027卷

关键词：

Machine learning; Reinforcement learning; Deep Q-network; epsilon-greedy policy; Value iteration;

D O I：

10.1016/j.tcs.2024.115025

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Reinforcement learning is a branch of machine learning in which an agent interacts with an environment to learn optimal actions that maximize cumulative rewards. This paper aims to enhance the SARSA and DQN algorithms in four key aspects: the epsilon-greedy policy, reward function, value iteration approach, and sampling probability. The experiments are conducted in three scenarios: path planning, CartPole, and MountainCar. The results show that, in these environments, the improved algorithms exhibit better convergence, higher rewards, and more stable training processes.

引用

页数：15

共 50 条

[31] Evolutionary algorithms for reinforcement learning
Moriarty, DE
Schultz, AC
Grefenstette, JJ
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1999, 11 : 241 - 276
[32] Evolutionary Algorithms for Reinforcement Learning
Moriarty, David E.
Schultz, Alan C.
Grefenstette, John J.
Journal of Artificial Intelligence Research, 1999, 11 (00): : 241 - 276
[33] Ensemble algorithms in reinforcement learning
Wiering, Marco A.
van Hasselt, Hado
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04): : 930 - 936
[34] REINFORCEMENT LEARNING ALGORITHMS IN ROBOTICS
Bocsi, Botond
Csato, Lehel
KEPT 2011: KNOWLEDGE ENGINEERING PRINCIPLES AND TECHNIQUES, 2011, : 131 - 142
[35] Enhanced Pub/Sub Communications for Massive IoT Traffic with SARSA Reinforcement Learning
Arruda, Carlos E.
Moraes, Pedro F.
Agoulmine, Nazim
Martins, Joberto S. B.
MACHINE LEARNING FOR NETWORKING, MLN 2020, 2021, 12629 : 204 - 225
[36] REINFORCEMENT LEARNING - ARCHITECTURES AND ALGORITHMS
KOKAR, MM
REVELIOTIS, SA
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 1993, 8 (08) : 875 - 894
[37] Aggregation of reinforcement learning algorithms
Jiang, Ju
Kamel, Mohamed S.
2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 68 - +
[38] Evolving the Behavior of Autonomous Agents in Strategic Combat Scenarios via SARSA Reinforcement Learning
Siebra, Clauirton A.
Botelho Neto, Gutenberg P.
2014 BRAZILIAN SYMPOSIUM ON COMPUTER GAMES AND DIGITAL ENTERTAINMENT (SBGAMES 2014), 2014, : 115 - 122
[39] Enhancing Reinforcement Learning Performance in Delayed Reward System Using DQN and Heuristics
Kim, Keecheon
IEEE Access, 2022, 10 : 50641 - 50650
[40] A-SARSA: A Predictive Container Auto-Scaling Algorithm Based on Reinforcement Learning
Zhang, Shubo
Wu, Tianyang
Pan, Maolin
Zhang, Chaomeng
Yu, Yang
2020 IEEE 13TH INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS 2020), 2020, : 489 - 497

← 1 2 3 4 5 →