Deep Reinforcement Learning With Quantum-Inspired Experience Replay

被引:43
|
作者
Wei, Qing [1 ]
Ma, Hailan [1 ,2 ]
Chen, Chunlin [1 ]
Dong, Daoyi [2 ]
机构
[1] Nanjing Univ, Sch Management & Engn, Dept Control & Syst Engn, Nanjing 210093, Peoples R China
[2] Univ New South Wales, Sch Engn & Informat Technol, Canberra, ACT 2600, Australia
基金
中国国家自然科学基金; 澳大利亚研究理事会;
关键词
Training; Reinforcement learning; Logic gates; Task analysis; Qubit; Neural networks; Transforms; Deep reinforcement learning (DRL); quantum computation; quantum-inspired experience replay (QER); quantum reinforcement learning; ALGORITHMS;
D O I
10.1109/TCYB.2021.3053414
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, a novel training paradigm inspired by quantum computation is proposed for deep reinforcement learning (DRL) with experience replay. In contrast to the traditional experience replay mechanism in DRL, the proposed DRL with quantum-inspired experience replay (DRL-QER) adaptively chooses experiences from the replay buffer according to the complexity and the replayed times of each experience (also called transition), to achieve a balance between exploration and exploitation. In DRL-QER, transitions are first formulated in quantum representations and then the preparation operation and depreciation operation are performed on the transitions. In this process, the preparation operation reflects the relationship between the temporal-difference errors (TD-errors) and the importance of the experiences, while the depreciation operation is taken into account to ensure the diversity of the transitions. The experimental results on Atari 2600 games show that DRL-QER outperforms state-of-the-art algorithms, such as DRL-PER and DCRL on most of these games with improved training efficiency and is also applicable to such memory-based DRL approaches as double network and dueling network.
引用
收藏
页码:9326 / 9338
页数:13
相关论文
共 50 条
  • [1] Quantum-Inspired Reinforcement Learning for Quantum Control
    Yu, Haixu
    Zhao, Xudong
    Chen, Chunlin
    [J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2024,
  • [2] Robust Quantum-Inspired Reinforcement Learning for Robot Navigation
    Dong, Daoyi
    Chen, Chunlin
    Chu, Jian
    Tarn, Tzyh-Jong
    [J]. IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2012, 17 (01) : 86 - 97
  • [3] Deep Reinforcement Learning with Experience Replay Based on SARSA
    Zhao, Dongbin
    Wang, Haitao
    Shao, Kun
    Zhu, Yuanheng
    [J]. PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
  • [4] Associative Memory Based Experience Replay for Deep Reinforcement Learning
    Li, Mengyuan
    Kazemi, Arman
    Laguna, Ann Franchesca
    Hu, X. Sharon
    [J]. 2022 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2022,
  • [5] Trial and Error Experience Replay Based Deep Reinforcement Learning
    Zhang, Cheng
    Ma, Liang
    [J]. 4TH IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD 2019) / 3RD INTERNATIONAL SYMPOSIUM ON REINFORCEMENT LEARNING (ISRL 2019), 2019, : 221 - 226
  • [6] Knowledge Transfer for Deep Reinforcement Learning with Hierarchical Experience Replay
    Yin, Haiyan
    Pan, Sinno Jialin
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1640 - 1646
  • [7] A Quantum-Inspired Deep Learning Models for Skin Lesion Classification
    Mehdi, Mohamed Ait
    Belattar, Khadidja
    Souami, Feryel
    [J]. QUANTUM COMPUTING: APPLICATIONS AND CHALLENGES, QSAC 2023, 2024, 2 : 194 - 207
  • [8] Path Planning for Cellular-Connected UAV: A DRL Solution With Quantum-Inspired Experience Replay
    Li, Yuanjian
    Aghvami, A. Hamid
    Dong, Daoyi
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (10) : 7897 - 7912
  • [9] Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning
    Foerster, Jakob
    Nardelli, Nantas
    Farquhar, Gregory
    Afouras, Triantafyllos
    Torr, Philip H. S.
    Kohli, Pushmeet
    Whiteson, Shimon
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [10] Experience Replay Optimization via ESMM for Stable Deep Reinforcement Learning
    Osei, Richard Sakyi
    Lopez, Daphne
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (01) : 715 - 723