Sample Efficient Reinforcement Learning Using Graph-Based Memory Reconstruction

被引:0
|
作者
Kang Y. [1 ,2 ]
Zhao E. [1 ,2 ]
Zang Y. [1 ,2 ]
Li L. [2 ]
Li K. [2 ]
Tao P. [3 ]
Xing J. [3 ]
机构
[1] School of Artificial Intelligence, University of Chinese, Academy of Sciences, Beijing
[2] Institute of Automation, Chinese Academy of Sciences, Beijing
[3] Department of Computer Science and Technology, Tsinghua University, Beijing
来源
基金
中国国家自然科学基金;
关键词
Experience replay (ER); graph model; memory reconstruction; reinforcement learning (RL); sample efficiency;
D O I
10.1109/TAI.2023.3268612
中图分类号
学科分类号
摘要
Reinforcement learning (RL) algorithms typically require orders of magnitude more interactions than humans to learn effective policies. Research on memory in neuroscience suggests that humans' learning efficiency benefits from associating their experiences and reconstructing potential events. Inspired by this finding, we introduce a human brainlike memory structure for agents and build a general learning framework based on this structure to improve the RL sampling efficiency. Since this framework is similar to the memory reconstruction process in psychology, we name the newly proposed RL framework as graph-based memory reconstruction (GBMR). In particular, GBMR first maintains an attribute graph on the agent's memory and then retrieves its critical nodes to build and update potential paths among these nodes. This novel pipeline drives the RL agent to learn faster with its memory-enhanced value functions and reduces interactions with the environment by reconstructing its valuable paths. Extensive experimental analyses and evaluations in the grid maze and some challenging Atari environments demonstrate GBMRs superiority over traditional RL methods. We will release the source code and trained models to facilitate further studies in this research direction. © 2023 IEEE.
引用
收藏
页码:751 / 762
页数:11
相关论文
共 50 条
  • [41] Graph-based Relational Learning
    NEC Laboratories Europe GmbH, Germany
    不详
    不详
    不详
    NEC Tech. J., 1 (101-105):
  • [42] A Graph-Based Deep Reinforcement Learning Approach to Grasping Fully Occluded Objects
    Guoyu Zuo
    Jiayuan Tong
    Zihao Wang
    Daoxiong Gong
    Cognitive Computation, 2023, 15 : 36 - 49
  • [43] Collaborative Information Dissemination with Graph-Based Multi-Agent Reinforcement Learning
    Galliera, Raffaele
    Venable, Kristen Brent
    Bassani, Matteo
    Suri, Niranjan
    ALGORITHMIC DECISION THEORY, ADT 2024, 2025, 15248 : 160 - 173
  • [44] Graph-based semisupervised learning
    Culp, Mark
    Michailidis, George
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (01) : 174 - 179
  • [45] Sample-efficient multi-agent reinforcement learning with masked reconstruction
    Kim, Jung In
    Lee, Young Jae
    Heo, Jongkook
    Park, Jinhyeok
    Kim, Jaehoon
    Lim, Sae Rin
    Jeong, Jinyong
    Kim, Seoung Bum
    PLOS ONE, 2023, 18 (09):
  • [46] Efficient Dynamic IC Design Analysis Using Graph-Based Semi-Supervised Learning
    Obert, James
    Hamlet, Jason
    Turner, Sean
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2024, 18 (03) : 437 - 464
  • [47] GSBRL : Efficient RDF graph storage based on reinforcement learning
    Zheng, Lei
    Shen, Ziming
    Wang, Hongzhi
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2022, 25 (02): : 763 - 784
  • [48] GSBRL : Efficient RDF graph storage based on reinforcement learning
    Lei Zheng
    Ziming Shen
    Hongzhi Wang
    World Wide Web, 2022, 25 : 763 - 784
  • [49] Memory-efficient semantic segmentation of large microscopy images using graph-based neural networks
    Jain, Atishay
    Laidlaw, David H.
    Bajcsy, Peter
    Singh, Ritambhara
    MICROSCOPY, 2023, 73 (03) : 275 - 286
  • [50] An Efficient Mining of Transactional Data Using Graph-based Technique
    AlZoubi, Wael Ahmad
    Omar, Khairuddin
    Abu Bakar, Azuraliza
    2011 3RD CONFERENCE ON DATA MINING AND OPTIMIZATION (DMO), 2011, : 74 - 81