Learning by reusing previous advice: a memory-based teacher–student framework

被引：0

作者：

Changxi Zhu

Yi Cai

Shuyue Hu

Ho-fung Leung

Dickson K. W. Chiu

机构：

[1] South China University of Technology,School of Software Engineering

[2] Shanghai Artificial Intelligence Laboratory,Department of Computer Science and Engineering

[3] The Chinese University of Hong Kong,Faculty of Education

[4] The University of Hong Kong,undefined

来源：

Autonomous Agents and Multi-Agent Systems | 2023年 / 37卷

关键词：

Reinforcement learning; Multi-agent learning; Action advising; Teacher–student;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Reinforcement Learning (RL) has been widely used to solve sequential decision-making problems. However, it often suffers from slow learning speed in complex scenarios. Teacher–student frameworks address this issue by enabling agents to ask for and give advice so that a student agent can leverage the knowledge of a teacher agent to facilitate its learning. In this paper, we consider the effect of reusing previous advice, and propose a novel memory-based teacher–student framework such that student agents can memorize and reuse the previous advice from teacher agents. In particular, we propose two methods to decide whether previous advice should be reused: Q-Change per Step that reuses the advice if it leads to an increase in Q-values, and Decay Reusing Probability that reuses the advice with a decaying probability. The experiments on diverse RL tasks (Mario, Predator–Prey and Half Field Offense) confirm that our proposed framework significantly outperforms the existing frameworks in which previous advice is not reused.

引用

共 50 条

[31] Personality- and Memory-based framework for Emotionally Intelligent agents
Nardelli, Alice
Maccagni, Giacomo
Minutoli, Federico
Sgorbissa, Antonio
Recchiuto, Carmine Tommaso
2024 33RD IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, ROMAN 2024, 2024, : 769 - 776
[32] LEARNING MODES, FEATURE CORRELATIONS, AND MEMORY-BASED CATEGORIZATION
WATTENMAKER, WD
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 1991, 17 (05) : 908 - 923
[33] A Scalable Memory-Based Reconfigurable Computing Framework for Nanoscale Crossbar
Paul, Somnath
Bhunia, Swarup
IEEE TRANSACTIONS ON NANOTECHNOLOGY, 2012, 11 (03) : 451 - 462
[34] MEMORY-BASED PEDESTRIAN DETECTION THROUGH SEQUENCE LEARNING
Li, Xudong
Ye, Mao
Liu, Yiguang
Zhu, Ce
2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 1129 - 1134
[35] MHDFS: A Memory-Based Hadoop Framework for Large Data Storage
Song, Aibo
Zhao, Maoxian
Xue, Yingying
Luo, Junzhou
SCIENTIFIC PROGRAMMING, 2016, 2016
[36] Reinforcement Learning with Teacher-student Framework In Future Market
Chen, Sihang
Luo, Weiqi
Yu, Chao
INTERNATIONAL CONFERENCE ON ALGORITHMS, HIGH PERFORMANCE COMPUTING, AND ARTIFICIAL INTELLIGENCE (AHPCAI 2021), 2021, 12156
[37] Reinforcement Learning Using a Stochastic Gradient Method with Memory-Based Learning
Yamada, Takafumi
Yamaguchi, Satoshi
ELECTRICAL ENGINEERING IN JAPAN, 2010, 173 (01) : 32 - 40
[38] Tagging by Combining Rules- Based Method and Memory-Based Learning
Yamina, Tlili-Guiassa
PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 6, 2005, : 110 - 114
[39] Study on LSTM and ConvLSTM Memory-Based Deep Reinforcement Learning
Duarte, Fernando Fradique
Lau, Nuno
Pereira, Artur
Reis, Luis Paulo
AGENTS AND ARTIFICIAL INTELLIGENCE, ICAART 2023, 2024, 14546 : 223 - 243
[40] Incremental Model Enhancement via Memory-based Contrastive Learning
Xuan, Shiyu
Yang, Ming
Zhang, Shiliang
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025, 133 (01) : 65 - 83

← 1 2 3 4 5 →