Faster Deep Q-learning using Neural Episodic Control

被引:4
|
作者
Nishio, Daichi [1 ]
Yamane, Satoshi [1 ]
机构
[1] Kanazawa Univ, Inst Sci & Engn, Kanazawa, Ishikawa, Japan
关键词
Deep reinforcement learning; DQN; Neural Episodic Control; Sample efficiency;
D O I
10.1109/COMPSAC.2018.00075
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The research on deep reinforcement learning which estimates Q-value by deep learning has been attracted the interest of researchers recently. In deep reinforcement learning, it is important to efficiently learn the experiences that an agent has collected by exploring environment. We propose NEC2DQN that improves learning speed of a poor sample efficiency algorithm such as DQN by using good one such as NEC at the beginning of learning. We show it is able to learn faster than Double DQN or N-step DQN in the experiments of Pong.
引用
收藏
页码:486 / 491
页数:6
相关论文
共 50 条
  • [1] Deep Reinforcement Learning: From Q-Learning to Deep Q-Learning
    Tan, Fuxiao
    Yan, Pengfei
    Guan, Xinping
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT IV, 2017, 10637 : 475 - 483
  • [2] QLP: Deep Q-Learning for Pruning Deep Neural Networks
    Camci, Efe
    Gupta, Manas
    Wu, Min
    Lin, Jie
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 6488 - 6501
  • [3] Deep Q-learning: A robust control approach
    Varga, Balazs
    Kulcsar, Balazs
    Chehreghani, Morteza Haghir
    [J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (01) : 526 - 544
  • [4] An Online Home Energy Management System using Q-Learning and Deep Q-Learning
    İzmitligil, Hasan
    Karamancıoğlu, Abdurrahman
    [J]. Sustainable Computing: Informatics and Systems, 2024, 43
  • [5] Neural Q-learning
    ten Hagen, S
    Kröse, B
    [J]. NEURAL COMPUTING & APPLICATIONS, 2003, 12 (02): : 81 - 88
  • [6] Neural Q-learning
    Stephan ten Hagen
    Ben Kröse
    [J]. Neural Computing & Applications, 2003, 12 : 81 - 88
  • [7] Deep Spatial Q-Learning for Infectious Disease Control
    Zhishuai Liu
    Jesse Clifton
    Eric B. Laber
    John Drake
    Ethan X. Fang
    [J]. Journal of Agricultural, Biological and Environmental Statistics, 2023, 28 : 749 - 773
  • [8] Deep Spatial Q-Learning for Infectious Disease Control
    Liu, Zhishuai
    Clifton, Jesse
    Laber, Eric B.
    Drake, John
    Fang, Ethan X.
    [J]. JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2023, 28 (04) : 749 - 773
  • [9] Fuzzy neural control of systems with unknown dynamic using Q-learning strategies
    Kwok, DP
    Deng, ZD
    Li, CK
    Leung, TP
    Sun, ZQ
    Wong, JCK
    [J]. PROCEEDINGS OF THE 12TH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1 AND 2, 2003, : 482 - 487
  • [10] Constrained Deep Q-Learning Gradually Approaching Ordinary Q-Learning
    Ohnishi, Shota
    Uchibe, Eiji
    Yamaguchi, Yotaro
    Nakanishi, Kosuke
    Yasui, Yuji
    Ishii, Shin
    [J]. FRONTIERS IN NEUROROBOTICS, 2019, 13