Learning by reusing previous advice: a memory-based teacher–student framework

被引:0
|
作者
Changxi Zhu
Yi Cai
Shuyue Hu
Ho-fung Leung
Dickson K. W. Chiu
机构
[1] South China University of Technology,School of Software Engineering
[2] Shanghai Artificial Intelligence Laboratory,Department of Computer Science and Engineering
[3] The Chinese University of Hong Kong,Faculty of Education
[4] The University of Hong Kong,undefined
关键词
Reinforcement learning; Multi-agent learning; Action advising; Teacher–student;
D O I
暂无
中图分类号
学科分类号
摘要
Reinforcement Learning (RL) has been widely used to solve sequential decision-making problems. However, it often suffers from slow learning speed in complex scenarios. Teacher–student frameworks address this issue by enabling agents to ask for and give advice so that a student agent can leverage the knowledge of a teacher agent to facilitate its learning. In this paper, we consider the effect of reusing previous advice, and propose a novel memory-based teacher–student framework such that student agents can memorize and reuse the previous advice from teacher agents. In particular, we propose two methods to decide whether previous advice should be reused: Q-Change per Step that reuses the advice if it leads to an increase in Q-values, and Decay Reusing Probability that reuses the advice with a decaying probability. The experiments on diverse RL tasks (Mario, Predator–Prey and Half Field Offense) confirm that our proposed framework significantly outperforms the existing frameworks in which previous advice is not reused.
引用
收藏
相关论文
共 50 条
  • [41] A Teacher-Student Markov Decision Process-based Framework for Online Correctional Learning
    Lourenco, Ines
    Winqvist, Rebecka
    Rojas, Cristian R.
    Wahlberg, Bo
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 3456 - 3461
  • [42] Recent progress in analog memory-based accelerators for deep learning
    Tsai, Hsinyu
    Ambrogio, Stefano
    Narayanan, Pritish
    Shelby, Robert M.
    Burr, Geoffrey W.
    JOURNAL OF PHYSICS D-APPLIED PHYSICS, 2018, 51 (28)
  • [43] MEMORY-BASED PARAMETERIZED SKILLS LEARNING FOR MAPLESS VISUAL NAVIGATION
    Liu, Yuyang
    Cong, Yang
    Sun, Gan
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1890 - 1894
  • [44] Development of Amharic Morphological Analyzer Using Memory-Based Learning
    Abate, Mesfin
    Assabie, Yaregal
    ADVANCES IN NATURAL LANGUAGE PROCESSING, 2014, 8686 : 1 - 13
  • [45] A Memory-based Multiagent Framework for Adaptive Decision Making Extended Abstract
    Khadka, Shauharda
    Yates, Connor
    Tumer, Kagan
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 1977 - 1979
  • [46] A memory-based approach to learning shallow natural language patterns
    Argamon-Engelson, S
    Dagan, I
    Krymolowski, Y
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 1999, 11 (03) : 369 - 390
  • [47] Instance-family abstraction in memory-based language learning
    van den Bosch, A
    MACHINE LEARNING, PROCEEDINGS, 1999, : 39 - 48
  • [48] Constructing hydraulic robot models using memory-based learning
    Krishna, M
    Bares, J
    JOURNAL OF AEROSPACE ENGINEERING, 1999, 12 (02) : 34 - 42
  • [49] Pacing Electrocardiogram Detection With Memory-Based Autoencoder and Metric Learning
    Ge, Zhaoyang
    Cheng, Huiqing
    Tong, Zhuang
    Yang, Lihong
    Zhou, Bing
    Wang, Zongmin
    FRONTIERS IN PHYSIOLOGY, 2021, 12