Deep Reinforcement Learning with Double Q-Learning

被引:0
|
作者
van Hasselt, Hado [1 ]
Guez, Arthur [1 ]
Silver, David [1 ]
机构
[1] Google DeepMind, London, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The popular Q-learning algorithm is known to overestimate action values under certain conditions. It was not previously known whether, in practice, such overestimations are common, whether they harm performance, and whether they can generally be prevented. In this paper, we answer all these questions affirmatively. In particular, we first show that the recent DQN algorithm, which combines Q-learning with a deep neural network, suffers from substantial overestimations in some games in the Atari 2600 domain. We then show that the idea behind the Double Q-learning algorithm, which was introduced in a tabular setting, can be generalized to work with large-scale function approximation. We propose a specific adaptation to the DQN algorithm and show that the resulting algorithm not only reduces the observed overestimations, as hypothesized, but that this also leads to much better performance on several games.
引用
收藏
页码:2094 / 2100
页数:7
相关论文
共 50 条
  • [1] Deep Reinforcement Learning: From Q-Learning to Deep Q-Learning
    Tan, Fuxiao
    Yan, Pengfei
    Guan, Xinping
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT IV, 2017, 10637 : 475 - 483
  • [2] Deep Reinforcement Learning with Sarsa and Q-Learning: A Hybrid Approach
    Xu, Zhi-xiong
    Cao, Lei
    Chen, Xi-liang
    Li, Chen-xi
    Zhang, Yong-liang
    Lai, Jun
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (09) : 2315 - 2322
  • [3] Enhanced Machine Learning Algorithms: Deep Learning, Reinforcement Learning, ana Q-Learning
    Park, Ji Su
    Park, Jong Hyuk
    [J]. JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2020, 16 (05): : 1001 - 1007
  • [4] Human-like Autonomous Vehicle Speed Control by Deep Reinforcement Learning with Double Q-Learning
    Zhang, Yi
    Sun, Ping
    Yin, Yuhan
    Lin, Lin
    Wang, Xuesong
    [J]. 2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 1251 - 1256
  • [5] Energy management based on reinforcement learning with double deep Q-learning for a hybrid electric tracked vehicle
    Han, Xuefeng
    He, Hongwen
    Wu, Jingda
    Peng, Jiankun
    Li, Yuecheng
    [J]. APPLIED ENERGY, 2019, 254
  • [6] Fuzzy Q-Learning for generalization of reinforcement learning
    Berenji, HR
    [J]. FUZZ-IEEE '96 - PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 1996, : 2208 - 2214
  • [7] Deep Q-Learning Based Reinforcement Learning Approach for Network Intrusion Detection
    Alavizadeh, Hooman
    Alavizadeh, Hootan
    Jang-Jaccard, Julian
    [J]. COMPUTERS, 2022, 11 (03)
  • [8] Reinforcement learning guidance law of Q-learning
    Zhang, Qinhao
    Ao, Baiqiang
    Zhang, Qinxue
    [J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2020, 42 (02): : 414 - 419
  • [9] DDNSAS: Deep reinforcement learning based deep Q-learning network for smart agriculture system
    Devarajan, Ganesh Gopal
    Nagarajan, Senthil Murugan
    Ramana, T. V.
    Vignesh, T.
    Ghosh, Uttam
    Alnumay, Waleed
    [J]. SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2023, 39
  • [10] Constrained Deep Q-Learning Gradually Approaching Ordinary Q-Learning
    Ohnishi, Shota
    Uchibe, Eiji
    Yamaguchi, Yotaro
    Nakanishi, Kosuke
    Yasui, Yuji
    Ishii, Shin
    [J]. FRONTIERS IN NEUROROBOTICS, 2019, 13