Learning by reusing previous advice: a memory-based teacher–student framework

被引：0

作者：

Changxi Zhu

Yi Cai

Shuyue Hu

Ho-fung Leung

Dickson K. W. Chiu

机构：

[1] South China University of Technology,School of Software Engineering

[2] Shanghai Artificial Intelligence Laboratory,Department of Computer Science and Engineering

[3] The Chinese University of Hong Kong,Faculty of Education

[4] The University of Hong Kong,undefined

来源：

Autonomous Agents and Multi-Agent Systems | 2023年 / 37卷

关键词：

Reinforcement learning; Multi-agent learning; Action advising; Teacher–student;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Reinforcement Learning (RL) has been widely used to solve sequential decision-making problems. However, it often suffers from slow learning speed in complex scenarios. Teacher–student frameworks address this issue by enabling agents to ask for and give advice so that a student agent can leverage the knowledge of a teacher agent to facilitate its learning. In this paper, we consider the effect of reusing previous advice, and propose a novel memory-based teacher–student framework such that student agents can memorize and reuse the previous advice from teacher agents. In particular, we propose two methods to decide whether previous advice should be reused: Q-Change per Step that reuses the advice if it leads to an increase in Q-values, and Decay Reusing Probability that reuses the advice with a decaying probability. The experiments on diverse RL tasks (Mario, Predator–Prey and Half Field Offense) confirm that our proposed framework significantly outperforms the existing frameworks in which previous advice is not reused.

引用

共 50 条

[21] Dynamic Memory-Based Continual Learning with Generating and Screening
Tao, Siying
Huang, Jinyang
Zhang, Xiang
Sun, Xiao
Gu, Yu
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT III, 2023, 14256 : 365 - 376
[22] Text Chunker for Malayalam using Memory-Based Learning
Raj, Rekha C. T.
Raj, Reghu P. C.
2015 INTERNATIONAL CONFERENCE ON CONTROL COMMUNICATION & COMPUTING INDIA (ICCC), 2015, : 595 - 599
[23] Memory-based Statistical Learning for The Travelling Salesman Problem
Xia, Yong
Li, Changhe
2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 2935 - 2941
[24] Hydraulic system modeling through memory-based learning
Krishna, M
Bares, J
1998 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS - PROCEEDINGS, VOLS 1-3: INNOVATIONS IN THEORY, PRACTICE AND APPLICATIONS, 1998, : 1733 - 1738
[25] Memory-based interference effects in implicit contextual learning
Zellin, M.
Conci, M.
Von Muehlenen, A.
Mueller, H. J.
PERCEPTION, 2011, 40 : 85 - 85
[26] Improved Schemes for Episodic Memory-based Lifelong Learning
Guo, Yunhui
Liu, Mingrui
Yang, Tianbao
Rosing, Tajana
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[27] A memory-based learning model of dutch plural inflection
Keuleers, E
Sandra, D
PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE ON COGNITIVE MODELING, 2004, : 358 - 359
[28] Memory-Based Dual Gaussian Processes for Sequential Learning
Chang, Paul E.
Verma, Prakhar
John, S. T.
Solin, Arno
Khan, Mohammad Emtiyaz
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
[29] Traffic signal control using reinforcement learning based on the teacher-student framework
Liu, Junxiu
Qin, Sheng
Su, Min
Luo, Yuling
Zhang, Shunsheng
Wang, Yanhu
Yang, Su
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 228
[30] Memory-based active learning for French broadcast news
Tantini, Frederic
Cerisara, Christophe
Gardent, Claire
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1377 - 1380

← 1 2 3 4 5 →