Sample Efficient Reinforcement Learning Using Graph-Based Memory Reconstruction

被引：0

作者：

Kang Y. ^{[1
,2
]}

Zhao E. ^{[1
,2
]}

Zang Y. ^{[1
,2
]}

Li L. ^{[2
]}

Li K. ^{[2
]}

Tao P. ^{[3
]}

Xing J. ^{[3
]}

机构：

[1] School of Artificial Intelligence, University of Chinese, Academy of Sciences, Beijing

[2] Institute of Automation, Chinese Academy of Sciences, Beijing

[3] Department of Computer Science and Technology, Tsinghua University, Beijing

来源：

IEEE Transactions on Artificial Intelligence | 2024年 / 5卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Experience replay (ER); graph model; memory reconstruction; reinforcement learning (RL); sample efficiency;

D O I：

10.1109/TAI.2023.3268612

中图分类号：

学科分类号：

摘要：

Reinforcement learning (RL) algorithms typically require orders of magnitude more interactions than humans to learn effective policies. Research on memory in neuroscience suggests that humans' learning efficiency benefits from associating their experiences and reconstructing potential events. Inspired by this finding, we introduce a human brainlike memory structure for agents and build a general learning framework based on this structure to improve the RL sampling efficiency. Since this framework is similar to the memory reconstruction process in psychology, we name the newly proposed RL framework as graph-based memory reconstruction (GBMR). In particular, GBMR first maintains an attribute graph on the agent's memory and then retrieves its critical nodes to build and update potential paths among these nodes. This novel pipeline drives the RL agent to learn faster with its memory-enhanced value functions and reduces interactions with the environment by reconstructing its valuable paths. Extensive experimental analyses and evaluations in the grid maze and some challenging Atari environments demonstrate GBMRs superiority over traditional RL methods. We will release the source code and trained models to facilitate further studies in this research direction. © 2023 IEEE.

引用

页码：751 / 762

页数：11

共 50 条

[41] Graph-based Relational Learning
NEC Laboratories Europe GmbH, Germany
不详
不详
不详
NEC Tech. J., 1 (101-105):
[42] A Graph-Based Deep Reinforcement Learning Approach to Grasping Fully Occluded Objects
Guoyu Zuo
Jiayuan Tong
Zihao Wang
Daoxiong Gong
Cognitive Computation, 2023, 15 : 36 - 49
[43] Collaborative Information Dissemination with Graph-Based Multi-Agent Reinforcement Learning
Galliera, Raffaele
Venable, Kristen Brent
Bassani, Matteo
Suri, Niranjan
ALGORITHMIC DECISION THEORY, ADT 2024, 2025, 15248 : 160 - 173
[44] Graph-based semisupervised learning
Culp, Mark
Michailidis, George
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (01) : 174 - 179
[45] Sample-efficient multi-agent reinforcement learning with masked reconstruction
Kim, Jung In
Lee, Young Jae
Heo, Jongkook
Park, Jinhyeok
Kim, Jaehoon
Lim, Sae Rin
Jeong, Jinyong
Kim, Seoung Bum
PLOS ONE, 2023, 18 (09):
[46] Efficient Dynamic IC Design Analysis Using Graph-Based Semi-Supervised Learning
Obert, James
Hamlet, Jason
Turner, Sean
INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2024, 18 (03) : 437 - 464
[47] GSBRL : Efficient RDF graph storage based on reinforcement learning
Zheng, Lei
Shen, Ziming
Wang, Hongzhi
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2022, 25 (02): : 763 - 784
[48] GSBRL : Efficient RDF graph storage based on reinforcement learning
Lei Zheng
Ziming Shen
Hongzhi Wang
World Wide Web, 2022, 25 : 763 - 784
[49] Memory-efficient semantic segmentation of large microscopy images using graph-based neural networks
Jain, Atishay
Laidlaw, David H.
Bajcsy, Peter
Singh, Ritambhara
MICROSCOPY, 2023, 73 (03) : 275 - 286
[50] An Efficient Mining of Transactional Data Using Graph-based Technique
AlZoubi, Wael Ahmad
Omar, Khairuddin
Abu Bakar, Azuraliza
2011 3RD CONFERENCE ON DATA MINING AND OPTIMIZATION (DMO), 2011, : 74 - 81

← 1 2 3 4 5 →