Sample Efficient Reinforcement Learning Using Graph-Based Memory Reconstruction

被引：0

作者：

Kang Y. ^{[1
,2
]}

Zhao E. ^{[1
,2
]}

Zang Y. ^{[1
,2
]}

Li L. ^{[2
]}

Li K. ^{[2
]}

Tao P. ^{[3
]}

Xing J. ^{[3
]}

机构：

[1] School of Artificial Intelligence, University of Chinese, Academy of Sciences, Beijing

[2] Institute of Automation, Chinese Academy of Sciences, Beijing

[3] Department of Computer Science and Technology, Tsinghua University, Beijing

来源：

IEEE Transactions on Artificial Intelligence | 2024年 / 5卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Experience replay (ER); graph model; memory reconstruction; reinforcement learning (RL); sample efficiency;

D O I：

10.1109/TAI.2023.3268612

中图分类号：

学科分类号：

摘要：

Reinforcement learning (RL) algorithms typically require orders of magnitude more interactions than humans to learn effective policies. Research on memory in neuroscience suggests that humans' learning efficiency benefits from associating their experiences and reconstructing potential events. Inspired by this finding, we introduce a human brainlike memory structure for agents and build a general learning framework based on this structure to improve the RL sampling efficiency. Since this framework is similar to the memory reconstruction process in psychology, we name the newly proposed RL framework as graph-based memory reconstruction (GBMR). In particular, GBMR first maintains an attribute graph on the agent's memory and then retrieves its critical nodes to build and update potential paths among these nodes. This novel pipeline drives the RL agent to learn faster with its memory-enhanced value functions and reduces interactions with the environment by reconstructing its valuable paths. Extensive experimental analyses and evaluations in the grid maze and some challenging Atari environments demonstrate GBMRs superiority over traditional RL methods. We will release the source code and trained models to facilitate further studies in this research direction. © 2023 IEEE.

引用

页码：751 / 762

页数：11

共 50 条

[21] A Graph-Based Reinforcement Learning Method with Converged State Exploration and Exploitation
Li, Han
Chen, Tianding
Teng, Hualiang
Jiang, Yingtao
CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2019, 118 (02): : 253 - +
[22] Poisoning attacks against knowledge graph-based recommendation systems using deep reinforcement learning
Zih-Wun Wu
Chiao-Ting Chen
Szu-Hao Huang
Neural Computing and Applications, 2022, 34 : 3097 - 3115
[23] Poisoning attacks against knowledge graph-based recommendation systems using deep reinforcement learning
Wu, Zih-Wun
Chen, Chiao-Ting
Huang, Szu-Hao
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (04): : 3097 - 3115
[24] Efficient learning of supervised kernels with a graph-based loss function
Pan, Binbin
Chen, Wen-Sheng
Chen, Bo
Xu, Chen
INFORMATION SCIENCES, 2016, 370 : 50 - 62
[25] Learning Heterogeneous Strategies via Graph-based Multi-agent Reinforcement Learning
Li, Yang
Luo, Xiangfeng
Xie, Shaorong
2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 709 - 713
[26] Graph-based Efficient WiFi Fingerprint Training Using Un-supervised Learning
Zhao, Bo
Pei, Ling
Xu, Changqing
Gu, Li
PROCEEDINGS OF THE 28TH INTERNATIONAL TECHNICAL MEETING OF THE SATELLITE DIVISION OF THE INSTITUTE OF NAVIGATION (ION GNSS+ 2015), 2015, : 2301 - 2310
[27] Enhancing Federated Learning Performance Fairness via Collaboration Graph-Based Reinforcement Learning
Xia, Yuexuan
Ma, Benteng
Dou, Qi
Xia, Yong
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT X, 2024, 15010 : 263 - 272
[28] Efficient Join Order Selection Learning with Graph-based Representation
Chen, Jin
Ye, Guanyu
Zhao, Yan
Liu, Shuncheng
Deng, Liwei
Chen, Xu
Zhou, Rui
Zheng, Kai
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 97 - 107
[29] Efficient locality weighted sparse representation for graph-based learning
Feng, Xiaodong
Wu, Sen
Zhou, Wenjun
Quan, Min
KNOWLEDGE-BASED SYSTEMS, 2017, 121 : 129 - 141
[30] Energy Efficient UAV-Assisted IoT Data Collection: A Graph-Based Deep Reinforcement Learning Approach
Wu, Qianqian
Liu, Qiang
Zhu, Wenliang
Wu, Zefan
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (06): : 6082 - 6094

← 1 2 3 4 5 →