Sample Efficient Reinforcement Learning Using Graph-Based Memory Reconstruction

被引:0
|
作者
Kang Y. [1 ,2 ]
Zhao E. [1 ,2 ]
Zang Y. [1 ,2 ]
Li L. [2 ]
Li K. [2 ]
Tao P. [3 ]
Xing J. [3 ]
机构
[1] School of Artificial Intelligence, University of Chinese, Academy of Sciences, Beijing
[2] Institute of Automation, Chinese Academy of Sciences, Beijing
[3] Department of Computer Science and Technology, Tsinghua University, Beijing
来源
基金
中国国家自然科学基金;
关键词
Experience replay (ER); graph model; memory reconstruction; reinforcement learning (RL); sample efficiency;
D O I
10.1109/TAI.2023.3268612
中图分类号
学科分类号
摘要
Reinforcement learning (RL) algorithms typically require orders of magnitude more interactions than humans to learn effective policies. Research on memory in neuroscience suggests that humans' learning efficiency benefits from associating their experiences and reconstructing potential events. Inspired by this finding, we introduce a human brainlike memory structure for agents and build a general learning framework based on this structure to improve the RL sampling efficiency. Since this framework is similar to the memory reconstruction process in psychology, we name the newly proposed RL framework as graph-based memory reconstruction (GBMR). In particular, GBMR first maintains an attribute graph on the agent's memory and then retrieves its critical nodes to build and update potential paths among these nodes. This novel pipeline drives the RL agent to learn faster with its memory-enhanced value functions and reduces interactions with the environment by reconstructing its valuable paths. Extensive experimental analyses and evaluations in the grid maze and some challenging Atari environments demonstrate GBMRs superiority over traditional RL methods. We will release the source code and trained models to facilitate further studies in this research direction. © 2023 IEEE.
引用
收藏
页码:751 / 762
页数:11
相关论文
共 50 条
  • [21] A Graph-Based Reinforcement Learning Method with Converged State Exploration and Exploitation
    Li, Han
    Chen, Tianding
    Teng, Hualiang
    Jiang, Yingtao
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2019, 118 (02): : 253 - +
  • [22] Poisoning attacks against knowledge graph-based recommendation systems using deep reinforcement learning
    Zih-Wun Wu
    Chiao-Ting Chen
    Szu-Hao Huang
    Neural Computing and Applications, 2022, 34 : 3097 - 3115
  • [23] Poisoning attacks against knowledge graph-based recommendation systems using deep reinforcement learning
    Wu, Zih-Wun
    Chen, Chiao-Ting
    Huang, Szu-Hao
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (04): : 3097 - 3115
  • [24] Efficient learning of supervised kernels with a graph-based loss function
    Pan, Binbin
    Chen, Wen-Sheng
    Chen, Bo
    Xu, Chen
    INFORMATION SCIENCES, 2016, 370 : 50 - 62
  • [25] Learning Heterogeneous Strategies via Graph-based Multi-agent Reinforcement Learning
    Li, Yang
    Luo, Xiangfeng
    Xie, Shaorong
    2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 709 - 713
  • [26] Graph-based Efficient WiFi Fingerprint Training Using Un-supervised Learning
    Zhao, Bo
    Pei, Ling
    Xu, Changqing
    Gu, Li
    PROCEEDINGS OF THE 28TH INTERNATIONAL TECHNICAL MEETING OF THE SATELLITE DIVISION OF THE INSTITUTE OF NAVIGATION (ION GNSS+ 2015), 2015, : 2301 - 2310
  • [27] Enhancing Federated Learning Performance Fairness via Collaboration Graph-Based Reinforcement Learning
    Xia, Yuexuan
    Ma, Benteng
    Dou, Qi
    Xia, Yong
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT X, 2024, 15010 : 263 - 272
  • [28] Efficient Join Order Selection Learning with Graph-based Representation
    Chen, Jin
    Ye, Guanyu
    Zhao, Yan
    Liu, Shuncheng
    Deng, Liwei
    Chen, Xu
    Zhou, Rui
    Zheng, Kai
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 97 - 107
  • [29] Efficient locality weighted sparse representation for graph-based learning
    Feng, Xiaodong
    Wu, Sen
    Zhou, Wenjun
    Quan, Min
    KNOWLEDGE-BASED SYSTEMS, 2017, 121 : 129 - 141
  • [30] Energy Efficient UAV-Assisted IoT Data Collection: A Graph-Based Deep Reinforcement Learning Approach
    Wu, Qianqian
    Liu, Qiang
    Zhu, Wenliang
    Wu, Zefan
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (06): : 6082 - 6094