Learning by reusing previous advice: a memory-based teacher–student framework

被引:0
|
作者
Changxi Zhu
Yi Cai
Shuyue Hu
Ho-fung Leung
Dickson K. W. Chiu
机构
[1] South China University of Technology,School of Software Engineering
[2] Shanghai Artificial Intelligence Laboratory,Department of Computer Science and Engineering
[3] The Chinese University of Hong Kong,Faculty of Education
[4] The University of Hong Kong,undefined
关键词
Reinforcement learning; Multi-agent learning; Action advising; Teacher–student;
D O I
暂无
中图分类号
学科分类号
摘要
Reinforcement Learning (RL) has been widely used to solve sequential decision-making problems. However, it often suffers from slow learning speed in complex scenarios. Teacher–student frameworks address this issue by enabling agents to ask for and give advice so that a student agent can leverage the knowledge of a teacher agent to facilitate its learning. In this paper, we consider the effect of reusing previous advice, and propose a novel memory-based teacher–student framework such that student agents can memorize and reuse the previous advice from teacher agents. In particular, we propose two methods to decide whether previous advice should be reused: Q-Change per Step that reuses the advice if it leads to an increase in Q-values, and Decay Reusing Probability that reuses the advice with a decaying probability. The experiments on diverse RL tasks (Mario, Predator–Prey and Half Field Offense) confirm that our proposed framework significantly outperforms the existing frameworks in which previous advice is not reused.
引用
收藏
相关论文
共 50 条
  • [21] Dynamic Memory-Based Continual Learning with Generating and Screening
    Tao, Siying
    Huang, Jinyang
    Zhang, Xiang
    Sun, Xiao
    Gu, Yu
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT III, 2023, 14256 : 365 - 376
  • [22] Text Chunker for Malayalam using Memory-Based Learning
    Raj, Rekha C. T.
    Raj, Reghu P. C.
    2015 INTERNATIONAL CONFERENCE ON CONTROL COMMUNICATION & COMPUTING INDIA (ICCC), 2015, : 595 - 599
  • [23] Memory-based Statistical Learning for The Travelling Salesman Problem
    Xia, Yong
    Li, Changhe
    2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 2935 - 2941
  • [24] Hydraulic system modeling through memory-based learning
    Krishna, M
    Bares, J
    1998 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS - PROCEEDINGS, VOLS 1-3: INNOVATIONS IN THEORY, PRACTICE AND APPLICATIONS, 1998, : 1733 - 1738
  • [25] Memory-based interference effects in implicit contextual learning
    Zellin, M.
    Conci, M.
    Von Muehlenen, A.
    Mueller, H. J.
    PERCEPTION, 2011, 40 : 85 - 85
  • [26] Improved Schemes for Episodic Memory-based Lifelong Learning
    Guo, Yunhui
    Liu, Mingrui
    Yang, Tianbao
    Rosing, Tajana
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [27] A memory-based learning model of dutch plural inflection
    Keuleers, E
    Sandra, D
    PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE ON COGNITIVE MODELING, 2004, : 358 - 359
  • [28] Memory-Based Dual Gaussian Processes for Sequential Learning
    Chang, Paul E.
    Verma, Prakhar
    John, S. T.
    Solin, Arno
    Khan, Mohammad Emtiyaz
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [29] Traffic signal control using reinforcement learning based on the teacher-student framework
    Liu, Junxiu
    Qin, Sheng
    Su, Min
    Luo, Yuling
    Zhang, Shunsheng
    Wang, Yanhu
    Yang, Su
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 228
  • [30] Memory-based active learning for French broadcast news
    Tantini, Frederic
    Cerisara, Christophe
    Gardent, Claire
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1377 - 1380