Learning by reusing previous advice: a memory-based teacher–student framework

被引：0

作者：

Changxi Zhu

Yi Cai

Shuyue Hu

Ho-fung Leung

Dickson K. W. Chiu

机构：

[1] South China University of Technology,School of Software Engineering

[2] Shanghai Artificial Intelligence Laboratory,Department of Computer Science and Engineering

[3] The Chinese University of Hong Kong,Faculty of Education

[4] The University of Hong Kong,undefined

来源：

Autonomous Agents and Multi-Agent Systems | 2023年 / 37卷

关键词：

Reinforcement learning; Multi-agent learning; Action advising; Teacher–student;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Reinforcement Learning (RL) has been widely used to solve sequential decision-making problems. However, it often suffers from slow learning speed in complex scenarios. Teacher–student frameworks address this issue by enabling agents to ask for and give advice so that a student agent can leverage the knowledge of a teacher agent to facilitate its learning. In this paper, we consider the effect of reusing previous advice, and propose a novel memory-based teacher–student framework such that student agents can memorize and reuse the previous advice from teacher agents. In particular, we propose two methods to decide whether previous advice should be reused: Q-Change per Step that reuses the advice if it leads to an increase in Q-values, and Decay Reusing Probability that reuses the advice with a decaying probability. The experiments on diverse RL tasks (Mario, Predator–Prey and Half Field Offense) confirm that our proposed framework significantly outperforms the existing frameworks in which previous advice is not reused.

引用

共 50 条

[1] Learning by reusing previous advice: a memory-based teacher-student framework
Zhu, Changxi
Cai, Yi
Hu, Shuyue
Leung, Ho-fung
Chiu, Dickson K. W.
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2023, 37 (01)
[2] A GNN-based teacher-student framework with multi-advice
Lei, Yunjiao
Ye, Dayong
Zhu, Congcong
Shen, Sheng
Zhou, Wanlei
Zhu, Tianqing
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 250
[3] A THEORY FOR MEMORY-BASED LEARNING
LIN, JH
VITTER, JS
MACHINE LEARNING, 1994, 17 (2-3) : 143 - 167
[4] A probabilistic framework for memory-based reasoning
Kasif, S
Salzberg, S
Waltz, D
Rachlin, J
Aha, DW
ARTIFICIAL INTELLIGENCE, 1998, 104 (1-2) : 287 - 311
[5] Memory-Based Explainable Reinforcement Learning
Cruz, Francisco
Dazeley, Richard
Vamplew, Peter
AI 2019: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11919 : 66 - 77
[6] A tabular approach memory-based learning
Lin, C.-S. (linc@missouri.edu), 1600, Taylor and Francis Inc. (05):
[7] Memory-based learning for visual odometry
Roberts, Richard
Nguyen, Hai
Krishnamurthi, Niyant
Balch, Tucker
2008 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-9, 2008, : 47 - 52
[8] Hierarchical memory-based reinforcement learning
Hernandez-Gardiol, N
Mahadevan, S
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 1047 - 1053
[9] Advice Replay Approach for Richer Knowledge Transfer in Teacher Student Framework
Gupta, Vaibhav
Anand, Daksh
Paruchuri, Praveen
Ravindran, Balaraman
AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1997 - 1999
[10] Memory-based in situ learning for unmanned vehicles
McDowell, Patrick
Bourgeois, Brian S.
Sofge, Donald A.
Iyengar, S. S.
COMPUTER, 2006, 39 (12) : 62 - +

← 1 2 3 4 5 →