DARLA: Improving Zero-Shot Transfer in Reinforcement Learning

被引:0
|
作者
Higgins, Irina [1 ]
Pal, Arka [1 ]
Rusu, Andrei [1 ]
Matthey, Loic [1 ]
Burgess, Christopher [1 ]
Pritzel, Alexander [1 ]
Botyinick, Matthew [1 ]
Blundell, Charles [1 ]
Lerchner, Alexander [1 ]
机构
[1] DeepMind, 6 Pancras Sq, London N1C 4AG, England
关键词
HIPPOCAMPAL; SYSTEMS; MEMORY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Domain adaptation is an important open problem in deep reinforcement learning (RL). In many scenarios of interest data is hard to obtain, so agents may learn a source policy in a setting where data is readily available, with the hope that it generalises well to the target domain We propose a new multi-stage RL agent, DARLA (DisentAngled Representation Learning Agent), which learns to see before learning to act. DARLA's vision is based on learning a disentangled representation of the observed environment. Once DARLA can see, it is able to acquire source policies that are robust to many domain shifts - even with no access to the target domain. DARLA significantly outperforms conventional baselines in zero-shot domain adaptation scenarios, an effect that holds across a variety of RL environments (Jaco arm, DeepMind Lab) and base RL algorithms (DQN, A3C and EC).
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Hypernetworks for Zero-Shot Transfer in Reinforcement Learning
    Rezaei-Shoshtari, Sahand
    Morissette, Charlotte
    Hogan, Francois R.
    Dudek, Gregory
    Meger, David
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9579 - 9587
  • [2] Zero-Shot Policy Transfer in Autonomous Racing: Reinforcement Learning vs Imitation Learning
    Hamilton, Nathaniel
    Musau, Patrick
    Lopez, Diego Manzanas
    Johnson, Taylor T.
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ASSURED AUTONOMY (ICAA 2022), 2022, : 11 - 20
  • [3] Zero-Shot Transfer with Deictic Object-Oriented Representation in Reinforcement Learning
    Marom, Ofir
    Rosman, Benjamin
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [4] A Survey of Zero-shot Generalisation in Deep Reinforcement Learning
    Kirk, Robert
    Zhang, Amy
    Grefenstette, Edward
    Rocktäschel, Tim
    [J]. Journal of Artificial Intelligence Research, 2023, 76 : 201 - 264
  • [5] Combining ontology and reinforcement learning for zero-shot classification
    Liu, Bin
    Yao, Li
    Ding, Zheyuan
    Xu, Junyi
    Wu, Junfeng
    [J]. KNOWLEDGE-BASED SYSTEMS, 2018, 144 : 42 - 50
  • [6] Environment Generation for Zero-Shot Compositional Reinforcement Learning
    Gur, Izzeddin
    Jaques, Natasha
    Miao, Yingjie
    Choi, Jongwook
    Tiwari, Manoj
    Lee, Honglak
    Faust, Aleksandra
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [7] Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions
    Mazoure, Bogdan
    Kostrikov, Ilya
    Nachum, Ofir
    Tompson, Jonathan
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [8] A Survey of Zero-shot Generalisation in Deep Reinforcement Learning
    Kirk, Robert
    Zhang, Amy
    Grefenstette, Edward
    Rocktaeschel, Tim
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2023, 76 : 201 - 264
  • [9] Relational Knowledge Transfer for Zero-Shot Learning
    Wang, Donghui
    Li, Yanan
    Lin, Yuetan
    Zhuang, Yueting
    [J]. THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2145 - 2151
  • [10] Zero-Shot Transfer Learning for Event Extraction
    Huang, Lifu
    Ji, Heng
    Cho, Kyunghyun
    Dagan, Ido
    Riedel, Sebastian
    Voss, Clare R.
    [J]. PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 2160 - 2170