Learning by reusing previous advice: a memory-based teacher–student framework

被引:0
|
作者
Changxi Zhu
Yi Cai
Shuyue Hu
Ho-fung Leung
Dickson K. W. Chiu
机构
[1] South China University of Technology,School of Software Engineering
[2] Shanghai Artificial Intelligence Laboratory,Department of Computer Science and Engineering
[3] The Chinese University of Hong Kong,Faculty of Education
[4] The University of Hong Kong,undefined
关键词
Reinforcement learning; Multi-agent learning; Action advising; Teacher–student;
D O I
暂无
中图分类号
学科分类号
摘要
Reinforcement Learning (RL) has been widely used to solve sequential decision-making problems. However, it often suffers from slow learning speed in complex scenarios. Teacher–student frameworks address this issue by enabling agents to ask for and give advice so that a student agent can leverage the knowledge of a teacher agent to facilitate its learning. In this paper, we consider the effect of reusing previous advice, and propose a novel memory-based teacher–student framework such that student agents can memorize and reuse the previous advice from teacher agents. In particular, we propose two methods to decide whether previous advice should be reused: Q-Change per Step that reuses the advice if it leads to an increase in Q-values, and Decay Reusing Probability that reuses the advice with a decaying probability. The experiments on diverse RL tasks (Mario, Predator–Prey and Half Field Offense) confirm that our proposed framework significantly outperforms the existing frameworks in which previous advice is not reused.
引用
收藏
相关论文
共 50 条
  • [1] Learning by reusing previous advice: a memory-based teacher-student framework
    Zhu, Changxi
    Cai, Yi
    Hu, Shuyue
    Leung, Ho-fung
    Chiu, Dickson K. W.
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2023, 37 (01)
  • [2] A GNN-based teacher-student framework with multi-advice
    Lei, Yunjiao
    Ye, Dayong
    Zhu, Congcong
    Shen, Sheng
    Zhou, Wanlei
    Zhu, Tianqing
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 250
  • [3] A THEORY FOR MEMORY-BASED LEARNING
    LIN, JH
    VITTER, JS
    MACHINE LEARNING, 1994, 17 (2-3) : 143 - 167
  • [4] A probabilistic framework for memory-based reasoning
    Kasif, S
    Salzberg, S
    Waltz, D
    Rachlin, J
    Aha, DW
    ARTIFICIAL INTELLIGENCE, 1998, 104 (1-2) : 287 - 311
  • [5] Memory-Based Explainable Reinforcement Learning
    Cruz, Francisco
    Dazeley, Richard
    Vamplew, Peter
    AI 2019: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11919 : 66 - 77
  • [6] A tabular approach memory-based learning
    Lin, C.-S. (linc@missouri.edu), 1600, Taylor and Francis Inc. (05):
  • [7] Memory-based learning for visual odometry
    Roberts, Richard
    Nguyen, Hai
    Krishnamurthi, Niyant
    Balch, Tucker
    2008 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-9, 2008, : 47 - 52
  • [8] Hierarchical memory-based reinforcement learning
    Hernandez-Gardiol, N
    Mahadevan, S
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 1047 - 1053
  • [9] Advice Replay Approach for Richer Knowledge Transfer in Teacher Student Framework
    Gupta, Vaibhav
    Anand, Daksh
    Paruchuri, Praveen
    Ravindran, Balaraman
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1997 - 1999
  • [10] Memory-based in situ learning for unmanned vehicles
    McDowell, Patrick
    Bourgeois, Brian S.
    Sofge, Donald A.
    Iyengar, S. S.
    COMPUTER, 2006, 39 (12) : 62 - +