Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning

被引:0
|
作者
Oh, Junhyuk [1 ]
Singh, Satinder [1 ]
Lee, Honglak [1 ,2 ]
Kohli, Pushmeet [3 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] Google Brain, Mountain View, CA USA
[3] Microsoft Res, Mountain View, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a step towards developing zero-shot task generalization capabilities in reinforcement learning (RL), we introduce a new RL problem where the agent should learn to execute sequences of instructions after learning useful skills that solve subtasks. In this problem, we consider two types of generalizations: to previously unseen instructions and to longer sequences of instructions. For generalization over unseen instructions, we propose a new objective which encourages learning correspondences between similar subtasks by making analogies. For generalization over sequential instructions, we present a hierarchical architecture where a meta controller learns to use the acquired skills for executing the instructions. To deal with delayed reward, we propose a new neural architecture in the meta controller that learns when to update the subtask, which makes learning more efficient. Experimental results on a stochastic 3D domain show that the proposed ideas are crucial for generalization to longer instructions as well as unseen instructions.Y
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Zero-Shot Rationalization by Multi-Task Transfer Learning from Question Answering
    Kung, Po-Nien
    Yang, Tse-Hsuan
    Chen, Yi-Cheng
    Yin, Sheng-Siang
    Chen, Yun-Nung
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 2187 - 2197
  • [2] Zero-Shot Rumor Detection via Meta Multi-Task Prompt Learning
    Shi, Yu
    Yu, Ning
    Sun, Yawei
    Liu, Jianyi
    [J]. Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2024, 47 (04): : 77 - 82
  • [3] Joint Embedding with Multi-Task Learning for Multi-Label Zero-Shot Action Recognition
    An, Rongqiao
    Miao, Zhenjiang
    Li, Qingyu
    [J]. PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 613 - 618
  • [4] Attentive Multi-task Deep Reinforcement Learning
    Bram, Timo
    Brunner, Gino
    Richter, Oliver
    Wattenhofer, Roger
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT III, 2020, 11908 : 134 - 149
  • [5] Multi-Task Deep Reinforcement Learning with PopArt
    Hessel, Matteo
    Soyer, Hubert
    Espeholt, Lasse
    Czarnecki, Wojciech
    Schmitt, Simon
    van Hasselt, Hado
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3796 - 3803
  • [6] A Survey of Multi-Task Deep Reinforcement Learning
    Vithayathil Varghese, Nelson
    Mahmoud, Qusay H.
    [J]. ELECTRONICS, 2020, 9 (09) : 1 - 21
  • [7] Multi-Task Zero-Shot Action Recognition with Prioritised Data Augmentation
    Xu, Xun
    Hospedales, Timothy M.
    Gong, Shaogang
    [J]. COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 : 343 - 359
  • [8] Canonical mean filter for almost zero-shot multi-task classification
    Li, Yong
    Wang, Heng
    Ye, Xiang
    [J]. APPLIED INTELLIGENCE, 2023, 53 (20) : 24422 - 24434
  • [9] Canonical mean filter for almost zero-shot multi-task classification
    Yong Li
    Heng Wang
    Xiang Ye
    [J]. Applied Intelligence, 2023, 53 : 24422 - 24434
  • [10] Multi-task Deep Reinforcement Learning for Scalable Parallel Task Scheduling
    Zhang, Lingxin
    Qi, Qi
    Wang, Jingyu
    Sun, Haifeng
    Liao, Jianxin
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 2992 - 3001