DECAF: Deep Case-based Policy Inference for knowledge transfer in Reinforcement Learning

被引:19
|
作者
Glatt, Ruben [1 ]
Da Silva, Felipe Leno [1 ]
da Costa Bianchi, Reinaldo Augusto [2 ]
Reali Costa, Anna Helena [1 ]
机构
[1] Univ Sao Paulo, Av Prof Luciano Gualberto 158, BR-05508010 Sao Paulo, Brazil
[2] FEIs Univ Ctr, Av Humberto Alencar Castelo Branco 3972, BR-09850901 Sao Bernardo Do Campo, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
Deep Reinforcement Learning; Case-based Reasoning; Transfer Learning; Knowledge discovery; Knowledge management; Neural networks; SYSTEM; CBR;
D O I
10.1016/j.eswa.2020.113420
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Having the ability to solve increasingly complex problems using Reinforcement Learning (RL) has prompted researchers to start developing a greater interest in systematic approaches to retain and reuse knowledge over a variety of tasks. With Case-based Reasoning (CBR) there exists a general methodology that provides a framework for knowledge transfer which has been underrepresented in the RL literature so far. We formulate a terminology for the CBR framework targeted towards RL researchers with the goal of facilitating communication between the respective research communities. Based on this framework, we propose the Deep Case-based Policy Inference (DECAF) algorithm to accelerate learning by building a library of cases and reusing them if they are similar to a new task when training a new policy. DECAF guides the training by dynamically selecting and blending policies according to their usefulness for the current target task, reusing previously learned policies for a more effective exploration but still enabling the adaptation to particularities of the new task. We show an empirical evaluation in the Atari game playing domain depicting the benefits of our algorithm with regards to sample efficiency, robustness against negative transfer, and performance increase when compared to state-of-the-art methods. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Transferring knowledge as heuristics in reinforcement learning: A case-based approach
    Bianchi, Reinaldo A. C.
    Celiberto, Luiz A., Jr.
    Santos, Paulo E.
    Matsuura, Jackson P.
    Lopez de Mantaras, Ramon
    [J]. ARTIFICIAL INTELLIGENCE, 2015, 226 : 102 - 121
  • [2] Heuristically Accelerated Reinforcement Learning by Means of Case-Based Reasoning and Transfer Learning
    Bianchi, Reinaldo A. C.
    Santos, Paulo E.
    da Silva, Isaac J.
    Celiberto, Luiz A., Jr.
    de Mantaras, Ramon Lopez
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2018, 91 (02) : 301 - 312
  • [3] Heuristically Accelerated Reinforcement Learning by Means of Case-Based Reasoning and Transfer Learning
    Reinaldo A. C. Bianchi
    Paulo E. Santos
    Isaac J. da Silva
    Luiz A. Celiberto
    Ramon Lopez de Mantaras
    [J]. Journal of Intelligent & Robotic Systems, 2018, 91 : 301 - 312
  • [4] Towards Knowledge Transfer in Deep Reinforcement Learning
    Glatt, Ruben
    da Silva, Felipe Leno
    Reali Costa, Anna Helena
    [J]. PROCEEDINGS OF 2016 5TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS 2016), 2016, : 91 - 96
  • [5] Improving Deep Reinforcement Learning with Knowledge Transfer
    Glatt, Ruben
    Reali Costa, Anna Helena
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 5036 - 5037
  • [6] REPAINT: Knowledge Transfer in Deep Reinforcement Learning
    Tao, Yunzhe
    Genc, Sahika
    Chung, Jonathan
    Sun, Tao
    Mallya, Sunil
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139 : 7145 - 7155
  • [7] The Case for Case-Based Transfer Learning
    Klenk, Matthew
    Aha, David W.
    Molineaux, Matt
    [J]. AI MAGAZINE, 2011, 32 (01) : 54 - 69
  • [8] Knowledge Reasoning Method Based on Deep Transfer Reinforcement Learning: DTRLpath
    Lin, Shiming
    Ye, Ling
    Zhuang, Yijie
    Lu, Lingyun
    Zheng, Shaoqiu
    Huang, Chenxi
    Kwee, Ng Yin
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (01): : 299 - 317
  • [9] Knowledge Transfer using Model-Based Deep Reinforcement Learning
    Boloka, Tlou
    Makondo, Ndivhuwo
    Rosman, Benjamin
    [J]. 2021 SOUTHERN AFRICAN UNIVERSITIES POWER ENGINEERING CONFERENCE/ROBOTICS AND MECHATRONICS/PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA (SAUPEC/ROBMECH/PRASA), 2021,
  • [10] A case-based knowledge graph with reinforcement learning for intelligent design approach of complex product
    Huang, Yu
    Wang, Guoxin
    Wang, Ru
    Peng, Tao
    Li, Haokun
    Yan, Yan
    [J]. JOURNAL OF ENGINEERING DESIGN, 2024,