Deep Reinforcement Learning With Quantum-Inspired Experience Replay

被引：43

作者：

Wei, Qing ^{[1
]}

Ma, Hailan ^{[1
,2
]}

Chen, Chunlin ^{[1
]}

Dong, Daoyi ^{[2
]}

机构：

[1] Nanjing Univ, Sch Management & Engn, Dept Control & Syst Engn, Nanjing 210093, Peoples R China

[2] Univ New South Wales, Sch Engn & Informat Technol, Canberra, ACT 2600, Australia

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2022年 / 52卷 / 09期

基金：

中国国家自然科学基金; 澳大利亚研究理事会;

关键词：

Training; Reinforcement learning; Logic gates; Task analysis; Qubit; Neural networks; Transforms; Deep reinforcement learning (DRL); quantum computation; quantum-inspired experience replay (QER); quantum reinforcement learning; ALGORITHMS;

D O I：

10.1109/TCYB.2021.3053414

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this article, a novel training paradigm inspired by quantum computation is proposed for deep reinforcement learning (DRL) with experience replay. In contrast to the traditional experience replay mechanism in DRL, the proposed DRL with quantum-inspired experience replay (DRL-QER) adaptively chooses experiences from the replay buffer according to the complexity and the replayed times of each experience (also called transition), to achieve a balance between exploration and exploitation. In DRL-QER, transitions are first formulated in quantum representations and then the preparation operation and depreciation operation are performed on the transitions. In this process, the preparation operation reflects the relationship between the temporal-difference errors (TD-errors) and the importance of the experiences, while the depreciation operation is taken into account to ensure the diversity of the transitions. The experimental results on Atari 2600 games show that DRL-QER outperforms state-of-the-art algorithms, such as DRL-PER and DCRL on most of these games with improved training efficiency and is also applicable to such memory-based DRL approaches as double network and dueling network.

引用

页码：9326 / 9338

页数：13

共 50 条

[1] Quantum-Inspired Reinforcement Learning for Quantum Control
Yu, Haixu
Zhao, Xudong
Chen, Chunlin
[J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2024,
[2] Robust Quantum-Inspired Reinforcement Learning for Robot Navigation
Dong, Daoyi
Chen, Chunlin
Chu, Jian
Tarn, Tzyh-Jong
[J]. IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2012, 17 (01) : 86 - 97
[3] Deep Reinforcement Learning with Experience Replay Based on SARSA
Zhao, Dongbin
Wang, Haitao
Shao, Kun
Zhu, Yuanheng
[J]. PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
[4] Associative Memory Based Experience Replay for Deep Reinforcement Learning
Li, Mengyuan
Kazemi, Arman
Laguna, Ann Franchesca
Hu, X. Sharon
[J]. 2022 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2022,
[5] Trial and Error Experience Replay Based Deep Reinforcement Learning
Zhang, Cheng
Ma, Liang
[J]. 4TH IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD 2019) / 3RD INTERNATIONAL SYMPOSIUM ON REINFORCEMENT LEARNING (ISRL 2019), 2019, : 221 - 226
[6] Knowledge Transfer for Deep Reinforcement Learning with Hierarchical Experience Replay
Yin, Haiyan
Pan, Sinno Jialin
[J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1640 - 1646
[7] A Quantum-Inspired Deep Learning Models for Skin Lesion Classification
Mehdi, Mohamed Ait
Belattar, Khadidja
Souami, Feryel
[J]. QUANTUM COMPUTING: APPLICATIONS AND CHALLENGES, QSAC 2023, 2024, 2 : 194 - 207
[8] Path Planning for Cellular-Connected UAV: A DRL Solution With Quantum-Inspired Experience Replay
Li, Yuanjian
Aghvami, A. Hamid
Dong, Daoyi
[J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (10) : 7897 - 7912
[9] Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning
Foerster, Jakob
Nardelli, Nantas
Farquhar, Gregory
Afouras, Triantafyllos
Torr, Philip H. S.
Kohli, Pushmeet
Whiteson, Shimon
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
[10] Experience Replay Optimization via ESMM for Stable Deep Reinforcement Learning
Osei, Richard Sakyi
Lopez, Daphne
[J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (01) : 715 - 723

← 1 2 3 4 5 →