Learning by reusing previous advice: a memory-based teacher–student framework

被引:0
|
作者
Changxi Zhu
Yi Cai
Shuyue Hu
Ho-fung Leung
Dickson K. W. Chiu
机构
[1] South China University of Technology,School of Software Engineering
[2] Shanghai Artificial Intelligence Laboratory,Department of Computer Science and Engineering
[3] The Chinese University of Hong Kong,Faculty of Education
[4] The University of Hong Kong,undefined
关键词
Reinforcement learning; Multi-agent learning; Action advising; Teacher–student;
D O I
暂无
中图分类号
学科分类号
摘要
Reinforcement Learning (RL) has been widely used to solve sequential decision-making problems. However, it often suffers from slow learning speed in complex scenarios. Teacher–student frameworks address this issue by enabling agents to ask for and give advice so that a student agent can leverage the knowledge of a teacher agent to facilitate its learning. In this paper, we consider the effect of reusing previous advice, and propose a novel memory-based teacher–student framework such that student agents can memorize and reuse the previous advice from teacher agents. In particular, we propose two methods to decide whether previous advice should be reused: Q-Change per Step that reuses the advice if it leads to an increase in Q-values, and Decay Reusing Probability that reuses the advice with a decaying probability. The experiments on diverse RL tasks (Mario, Predator–Prey and Half Field Offense) confirm that our proposed framework significantly outperforms the existing frameworks in which previous advice is not reused.
引用
收藏
相关论文
共 50 条
  • [31] Personality- and Memory-based framework for Emotionally Intelligent agents
    Nardelli, Alice
    Maccagni, Giacomo
    Minutoli, Federico
    Sgorbissa, Antonio
    Recchiuto, Carmine Tommaso
    2024 33RD IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, ROMAN 2024, 2024, : 769 - 776
  • [32] LEARNING MODES, FEATURE CORRELATIONS, AND MEMORY-BASED CATEGORIZATION
    WATTENMAKER, WD
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 1991, 17 (05) : 908 - 923
  • [33] A Scalable Memory-Based Reconfigurable Computing Framework for Nanoscale Crossbar
    Paul, Somnath
    Bhunia, Swarup
    IEEE TRANSACTIONS ON NANOTECHNOLOGY, 2012, 11 (03) : 451 - 462
  • [34] MEMORY-BASED PEDESTRIAN DETECTION THROUGH SEQUENCE LEARNING
    Li, Xudong
    Ye, Mao
    Liu, Yiguang
    Zhu, Ce
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 1129 - 1134
  • [35] MHDFS: A Memory-Based Hadoop Framework for Large Data Storage
    Song, Aibo
    Zhao, Maoxian
    Xue, Yingying
    Luo, Junzhou
    SCIENTIFIC PROGRAMMING, 2016, 2016
  • [36] Reinforcement Learning with Teacher-student Framework In Future Market
    Chen, Sihang
    Luo, Weiqi
    Yu, Chao
    INTERNATIONAL CONFERENCE ON ALGORITHMS, HIGH PERFORMANCE COMPUTING, AND ARTIFICIAL INTELLIGENCE (AHPCAI 2021), 2021, 12156
  • [37] Reinforcement Learning Using a Stochastic Gradient Method with Memory-Based Learning
    Yamada, Takafumi
    Yamaguchi, Satoshi
    ELECTRICAL ENGINEERING IN JAPAN, 2010, 173 (01) : 32 - 40
  • [38] Tagging by Combining Rules- Based Method and Memory-Based Learning
    Yamina, Tlili-Guiassa
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 6, 2005, : 110 - 114
  • [39] Study on LSTM and ConvLSTM Memory-Based Deep Reinforcement Learning
    Duarte, Fernando Fradique
    Lau, Nuno
    Pereira, Artur
    Reis, Luis Paulo
    AGENTS AND ARTIFICIAL INTELLIGENCE, ICAART 2023, 2024, 14546 : 223 - 243
  • [40] Incremental Model Enhancement via Memory-based Contrastive Learning
    Xuan, Shiyu
    Yang, Ming
    Zhang, Shiliang
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025, 133 (01) : 65 - 83