Transformed Successor Features for Transfer Reinforcement Learning

被引:0
|
作者
Garces, Kiyoshige [1 ]
Xuan, Junyu [1 ]
Zuo, Hua [1 ]
机构
[1] Univ Technol Sydney, Australian Artificial Intelligence Inst AAII, 15 Broadway, Sydney, NSW 2007, Australia
基金
澳大利亚研究理事会;
关键词
reinforcement learning; successor features; transfer reinforcement learning;
D O I
10.1007/978-981-99-8391-9_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning algorithms require an extensive number of samples to perform a specific task. To achieve the same performance on a new task, the agent must learn from scratch. Transfer reinforcement learning is an emerging solution that aims to improve sample efficiency by reusing previously learnt knowledge in new tasks. Successor feature is a technique aiming to reuse representations to leverage that knowledge in unseen tasks. Successor feature has achieved outstanding results on the assumption that the transition dynamics must remain across tasks. Initial Successor feature approach omits settings with different environment dynamics, common among real-life tasks in reinforcement learning problems. Our approach transformed successor feature projects a set of diverse dynamics into a common dynamic distribution. Hence, it is an initial solution to relax the restriction of transference across fixed environment dynamics. Experimental results indicate that the transformed successor feature improves the transfer of knowledge in environments with fixed and diverse dynamics under the control of a simulated robotic arm, a robotic leg, and the cartpole environment.
引用
收藏
页码:298 / 309
页数:12
相关论文
共 50 条
  • [1] Successor Features for Transfer in Reinforcement Learning
    Barreto, Andre
    Dabney, Will
    Munos, Remi
    Hunt, Jonathan J.
    Schaul, Tom
    van Hasselt, Hado
    Silver, David
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [2] Risk-Aware Transfer in Reinforcement Learning using Successor Features
    Gimelfarb, Michael
    Barreto, Andre
    Sanner, Scott
    Lee, Chi-Guhn
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [3] Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement
    Barreto, Andre
    Borsa, Diana
    Quan, John
    Schaul, Tom
    Silver, David
    Hessel, Matteo
    Mankowitz, Daniel
    Zidek, Augustin
    Munos, Remi
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [4] Dynamic Successor Features for transfer learning and guided exploration
    Tasfi, Norman
    Santana, Eder
    Liboni, Luisa
    Capretz, Miriam
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 267
  • [5] Deep Reinforcement Learning with Successor Features for Navigation across Similar Environments
    Zhang, Jingwei
    Springenberg, Jost Tobias
    Boedecker, Joschka
    Burgard, Wolfram
    [J]. 2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 2371 - 2378
  • [6] PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning
    Filos, Angelos
    Lyle, Clare
    Gal, Yarin
    Levine, Sergey
    Jaques, Natasha
    Farquhar, Gregory
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [7] The successor representation in human reinforcement learning
    I. Momennejad
    E. M. Russek
    J. H. Cheong
    M. M. Botvinick
    N. D. Daw
    S. J. Gershman
    [J]. Nature Human Behaviour, 2017, 1 : 680 - 692
  • [8] The successor representation in human reinforcement learning
    Momennejad, I.
    Russek, E. M.
    Cheong, J. H.
    Botvinick, M. M.
    Daw, N. D.
    Gershman, S. J.
    [J]. NATURE HUMAN BEHAVIOUR, 2017, 1 (09): : 680 - 692
  • [9] Efficient Exploration for Multi-Agent Reinforcement Learning via Transferable Successor Features
    Wenzhang Liu
    Lu Dong
    Dan Niu
    Changyin Sun
    [J]. IEEE/CAA Journal of Automatica Sinica, 2022, 9 (09) : 1673 - 1686
  • [10] Efficient Exploration for Multi-Agent Reinforcement Learning via Transferable Successor Features
    Liu, Wenzhang
    Dong, Lu
    Niu, Dan
    Sun, Changyin
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 9 (09) : 1673 - 1686