Learning by reusing previous advice: a memory-based teacher–student framework

被引：0

作者：

Changxi Zhu

Yi Cai

Shuyue Hu

Ho-fung Leung

Dickson K. W. Chiu

机构：

[1] South China University of Technology,School of Software Engineering

[2] Shanghai Artificial Intelligence Laboratory,Department of Computer Science and Engineering

[3] The Chinese University of Hong Kong,Faculty of Education

[4] The University of Hong Kong,undefined

来源：

Autonomous Agents and Multi-Agent Systems | 2023年 / 37卷

关键词：

Reinforcement learning; Multi-agent learning; Action advising; Teacher–student;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Reinforcement Learning (RL) has been widely used to solve sequential decision-making problems. However, it often suffers from slow learning speed in complex scenarios. Teacher–student frameworks address this issue by enabling agents to ask for and give advice so that a student agent can leverage the knowledge of a teacher agent to facilitate its learning. In this paper, we consider the effect of reusing previous advice, and propose a novel memory-based teacher–student framework such that student agents can memorize and reuse the previous advice from teacher agents. In particular, we propose two methods to decide whether previous advice should be reused: Q-Change per Step that reuses the advice if it leads to an increase in Q-values, and Decay Reusing Probability that reuses the advice with a decaying probability. The experiments on diverse RL tasks (Mario, Predator–Prey and Half Field Offense) confirm that our proposed framework significantly outperforms the existing frameworks in which previous advice is not reused.

引用

共 50 条

[41] A Teacher-Student Markov Decision Process-based Framework for Online Correctional Learning
Lourenco, Ines
Winqvist, Rebecka
Rojas, Cristian R.
Wahlberg, Bo
2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 3456 - 3461
[42] Recent progress in analog memory-based accelerators for deep learning
Tsai, Hsinyu
Ambrogio, Stefano
Narayanan, Pritish
Shelby, Robert M.
Burr, Geoffrey W.
JOURNAL OF PHYSICS D-APPLIED PHYSICS, 2018, 51 (28)
[43] MEMORY-BASED PARAMETERIZED SKILLS LEARNING FOR MAPLESS VISUAL NAVIGATION
Liu, Yuyang
Cong, Yang
Sun, Gan
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1890 - 1894
[44] Development of Amharic Morphological Analyzer Using Memory-Based Learning
Abate, Mesfin
Assabie, Yaregal
ADVANCES IN NATURAL LANGUAGE PROCESSING, 2014, 8686 : 1 - 13
[45] A Memory-based Multiagent Framework for Adaptive Decision Making Extended Abstract
Khadka, Shauharda
Yates, Connor
Tumer, Kagan
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 1977 - 1979
[46] A memory-based approach to learning shallow natural language patterns
Argamon-Engelson, S
Dagan, I
Krymolowski, Y
JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 1999, 11 (03) : 369 - 390
[47] Instance-family abstraction in memory-based language learning
van den Bosch, A
MACHINE LEARNING, PROCEEDINGS, 1999, : 39 - 48
[48] Constructing hydraulic robot models using memory-based learning
Krishna, M
Bares, J
JOURNAL OF AEROSPACE ENGINEERING, 1999, 12 (02) : 34 - 42
[49] Pacing Electrocardiogram Detection With Memory-Based Autoencoder and Metric Learning
Ge, Zhaoyang
Cheng, Huiqing
Tong, Zhuang
Yang, Lihong
Zhou, Bing
Wang, Zongmin
FRONTIERS IN PHYSIOLOGY, 2021, 12
[50] Development of amharic morphological analyzer using memory-based learning
1600, Springer Verlag (8686):

← 1 2 3 4 5 →