Map-based experience replay: a memory-efficient solution to catastrophic forgetting in reinforcement learning

被引：0

作者：

Hafez, Muhammad Burhan ^{[1
]}

Immisch, Tilman ^{[1
]}

Weber, Tom ^{[1
]}

Wermter, Stefan ^{[1
]}

机构：

[1] Univ Hamburg, Dept Informat, Knowledge Technol Res Grp, Hamburg, Germany

来源：

FRONTIERS IN NEUROROBOTICS | 2023年 / 17卷

关键词：

continual learning; reinforcement learning; cognitive robotics; catastrophic forgetting; experience replay; growing self-organizing maps; GO; SHOGI; LEVEL; CHESS;

D O I：

10.3389/fnbot.2023.1127642

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep reinforcement learning (RL) agents often suffer from catastrophic forgetting, forgetting previously found solutions in parts of the input space when training new data. Replay memories are a common solution to the problem by decorrelating and shuffling old and new training samples. They naively store state transitions as they arrive, without regard for redundancy. We introduce a novel cognitive-inspired replay memory approach based on the Grow-When-Required (GWR) self-organizing network, which resembles a map-based mental model of the world. Our approach organizes stored transitions into a concise environment-model-like network of state nodes and transition edges, merging similar samples to reduce the memory size and increase pair-wise distance among samples, which increases the relevancy of each sample. Overall, our study shows that map-based experience replay allows for significant memory reduction with only small decreases in performance.

引用

页数：13

共 50 条

[1] Complementary Learning for Overcoming Catastrophic Forgetting Using Experience Replay
Rostami, Mohammad
Kolouri, Soheil
Pilly, Praveen K.
[J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3339 - 3345
[2] Associative Memory Based Experience Replay for Deep Reinforcement Learning
Li, Mengyuan
Kazemi, Arman
Laguna, Ann Franchesca
Hu, X. Sharon
[J]. 2022 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2022,
[3] Memory Efficient Experience Replay for Streaming Learning
Hayes, Tyler L.
Cahill, Nathan D.
Kanan, Christopher
[J]. 2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 9769 - 9776
[4] Efficient experience replay architecture for offline reinforcement learning
Zhang, Longfei
Feng, Yanghe
Wang, Rongxiao
Xu, Yue
Xu, Naifu
Liu, Zeyi
Du, Hang
[J]. ROBOTIC INTELLIGENCE AND AUTOMATION, 2023, 43 (01): : 35 - 43
[5] A Dual Memory Structure for Efficient Use of Replay Memory in Deep Reinforcement Learning
Ko, Wonshick
Chang, Dong Eui
[J]. 2019 19TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2019), 2019, : 1483 - 1486
[6] Deep Reinforcement Learning with Experience Replay Based on SARSA
Zhao, Dongbin
Wang, Haitao
Shao, Kun
Zhu, Yuanheng
[J]. PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
[7] Efficient Policy Learning for General Robotic Tasks with Adaptive Dual-memory Hindsight Experience Replay Based on Deep Reinforcement Learning
Dong, Menghua
Ying, Fengkang
Li, Xiangjian
Liu, Huashan
[J]. 2023 7TH INTERNATIONAL CONFERENCE ON ROBOTICS, CONTROL AND AUTOMATION, ICRCA, 2023, : 62 - 66
[8] Memory Reduction through Experience Classification for Deep Reinforcement Learning with Prioritized Experience Replay
Shen, Kai-Huan
Tsai, Pei-Yun
[J]. PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS 2019), 2019, : 166 - 171
[9] A New Reinforcement Learning Algorithm Based on Counterfactual Experience Replay
Li Menglin
Chen Jing
Chen Shaofei
Gao Wei
[J]. PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 1994 - 2001
[10] An Experience Replay Method Based on Tree Structure for Reinforcement Learning
Jiang, Wei-Cheng
Hwang, Kao-Shing
Lin, Jin-Ling
[J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2021, 9 (02) : 972 - 982

← 1 2 3 4 5 →