Improved SARSA and DQN algorithms for reinforcement learning

被引:0
|
作者
Yao, Guangyu [1 ,2 ]
Zhang, Nan [1 ,2 ]
Duan, Zhenhua [1 ,2 ]
Tian, Cong [1 ,2 ]
机构
[1] Xidian Univ, Inst Comp Theory & Technol, Xian 710071, Peoples R China
[2] Xidian Univ, ISN Lab, Xian 710071, Peoples R China
关键词
Machine learning; Reinforcement learning; Deep Q-network; epsilon-greedy policy; Value iteration;
D O I
10.1016/j.tcs.2024.115025
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Reinforcement learning is a branch of machine learning in which an agent interacts with an environment to learn optimal actions that maximize cumulative rewards. This paper aims to enhance the SARSA and DQN algorithms in four key aspects: the epsilon-greedy policy, reward function, value iteration approach, and sampling probability. The experiments are conducted in three scenarios: path planning, CartPole, and MountainCar. The results show that, in these environments, the improved algorithms exhibit better convergence, higher rewards, and more stable training processes.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Evolutionary algorithms for reinforcement learning
    Moriarty, DE
    Schultz, AC
    Grefenstette, JJ
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1999, 11 : 241 - 276
  • [32] Evolutionary Algorithms for Reinforcement Learning
    Moriarty, David E.
    Schultz, Alan C.
    Grefenstette, John J.
    Journal of Artificial Intelligence Research, 1999, 11 (00): : 241 - 276
  • [33] Ensemble algorithms in reinforcement learning
    Wiering, Marco A.
    van Hasselt, Hado
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04): : 930 - 936
  • [34] REINFORCEMENT LEARNING ALGORITHMS IN ROBOTICS
    Bocsi, Botond
    Csato, Lehel
    KEPT 2011: KNOWLEDGE ENGINEERING PRINCIPLES AND TECHNIQUES, 2011, : 131 - 142
  • [35] Enhanced Pub/Sub Communications for Massive IoT Traffic with SARSA Reinforcement Learning
    Arruda, Carlos E.
    Moraes, Pedro F.
    Agoulmine, Nazim
    Martins, Joberto S. B.
    MACHINE LEARNING FOR NETWORKING, MLN 2020, 2021, 12629 : 204 - 225
  • [36] REINFORCEMENT LEARNING - ARCHITECTURES AND ALGORITHMS
    KOKAR, MM
    REVELIOTIS, SA
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 1993, 8 (08) : 875 - 894
  • [37] Aggregation of reinforcement learning algorithms
    Jiang, Ju
    Kamel, Mohamed S.
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 68 - +
  • [38] Evolving the Behavior of Autonomous Agents in Strategic Combat Scenarios via SARSA Reinforcement Learning
    Siebra, Clauirton A.
    Botelho Neto, Gutenberg P.
    2014 BRAZILIAN SYMPOSIUM ON COMPUTER GAMES AND DIGITAL ENTERTAINMENT (SBGAMES 2014), 2014, : 115 - 122
  • [39] Enhancing Reinforcement Learning Performance in Delayed Reward System Using DQN and Heuristics
    Kim, Keecheon
    IEEE Access, 2022, 10 : 50641 - 50650
  • [40] A-SARSA: A Predictive Container Auto-Scaling Algorithm Based on Reinforcement Learning
    Zhang, Shubo
    Wu, Tianyang
    Pan, Maolin
    Zhang, Chaomeng
    Yu, Yang
    2020 IEEE 13TH INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS 2020), 2020, : 489 - 497