Deep Reinforcement Learning with Double Q-Learning

被引：0

作者：

van Hasselt, Hado ^{[1
]}

Guez, Arthur ^{[1
]}

Silver, David ^{[1
]}

机构：

[1] Google DeepMind, London, England

来源：

THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2016年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The popular Q-learning algorithm is known to overestimate action values under certain conditions. It was not previously known whether, in practice, such overestimations are common, whether they harm performance, and whether they can generally be prevented. In this paper, we answer all these questions affirmatively. In particular, we first show that the recent DQN algorithm, which combines Q-learning with a deep neural network, suffers from substantial overestimations in some games in the Atari 2600 domain. We then show that the idea behind the Double Q-learning algorithm, which was introduced in a tabular setting, can be generalized to work with large-scale function approximation. We propose a specific adaptation to the DQN algorithm and show that the resulting algorithm not only reduces the observed overestimations, as hypothesized, but that this also leads to much better performance on several games.

引用

页码：2094 / 2100

页数：7

共 50 条

[1] Deep Reinforcement Learning: From Q-Learning to Deep Q-Learning
Tan, Fuxiao
Yan, Pengfei
Guan, Xinping
[J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT IV, 2017, 10637 : 475 - 483
[2] Deep Reinforcement Learning with Sarsa and Q-Learning: A Hybrid Approach
Xu, Zhi-xiong
Cao, Lei
Chen, Xi-liang
Li, Chen-xi
Zhang, Yong-liang
Lai, Jun
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (09) : 2315 - 2322
[3] Enhanced Machine Learning Algorithms: Deep Learning, Reinforcement Learning, ana Q-Learning
Park, Ji Su
Park, Jong Hyuk
[J]. JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2020, 16 (05): : 1001 - 1007
[4] Human-like Autonomous Vehicle Speed Control by Deep Reinforcement Learning with Double Q-Learning
Zhang, Yi
Sun, Ping
Yin, Yuhan
Lin, Lin
Wang, Xuesong
[J]. 2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 1251 - 1256
[5] Energy management based on reinforcement learning with double deep Q-learning for a hybrid electric tracked vehicle
Han, Xuefeng
He, Hongwen
Wu, Jingda
Peng, Jiankun
Li, Yuecheng
[J]. APPLIED ENERGY, 2019, 254
[6] Fuzzy Q-Learning for generalization of reinforcement learning
Berenji, HR
[J]. FUZZ-IEEE '96 - PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 1996, : 2208 - 2214
[7] Deep Q-Learning Based Reinforcement Learning Approach for Network Intrusion Detection
Alavizadeh, Hooman
Alavizadeh, Hootan
Jang-Jaccard, Julian
[J]. COMPUTERS, 2022, 11 (03)
[8] Reinforcement learning guidance law of Q-learning
Zhang, Qinhao
Ao, Baiqiang
Zhang, Qinxue
[J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2020, 42 (02): : 414 - 419
[9] DDNSAS: Deep reinforcement learning based deep Q-learning network for smart agriculture system
Devarajan, Ganesh Gopal
Nagarajan, Senthil Murugan
Ramana, T. V.
Vignesh, T.
Ghosh, Uttam
Alnumay, Waleed
[J]. SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2023, 39
[10] Constrained Deep Q-Learning Gradually Approaching Ordinary Q-Learning
Ohnishi, Shota
Uchibe, Eiji
Yamaguchi, Yotaro
Nakanishi, Kosuke
Yasui, Yuji
Ishii, Shin
[J]. FRONTIERS IN NEUROROBOTICS, 2019, 13

← 1 2 3 4 5 →