Generalization to New Actions in Reinforcement Learning

被引:0
|
作者
Jain, Ayush [1 ]
Szot, Andrew [1 ]
Lim, Joseph J. [1 ]
机构
[1] Univ Southern Calif, Dept Comp Sci, Los Angeles, CA 90089 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A fundamental trait of intelligence is the ability to achieve goals in the face of novel circumstances, such as making decisions from new action choices. However, standard reinforcement learning assumes a fixed set of actions and requires expensive retraining when given a new action set. To make learning agents more adaptable, we introduce the problem of zero-shot generalization to new actions. We propose a two-stage framework where the agent first infers action representations from action information acquired separately from the task. A policy flexible to varying action sets is then trained with generalization objectives. We benchmark generalization on sequential tasks, such as selecting from an unseen tool-set to solve physical reasoning puzzles and stacking towers with novel 3D shapes. Videos and code are available at https://sites.google.com/view/action-generalization.
引用
下载
收藏
页数:12
相关论文
共 50 条
  • [21] Offline reinforcement learning with representations for actions
    Lou, Xingzhou
    Yin, Qiyue
    Zhang, Junge
    Yu, Chao
    He, Zhaofeng
    Cheng, Nengjie
    Huang, Kaiqi
    INFORMATION SCIENCES, 2022, 610 : 746 - 758
  • [22] Abstraction and Generalization in Reinforcement Learning: A Summary and Framework
    Ponsen, Marc
    Taylor, Matthew E.
    Tuyls, Karl
    ADAPTIVE AND LEARNING AGENTS, 2010, 5924 : 1 - +
  • [23] Towards Min Max Generalization in Reinforcement Learning
    Fonteneau, Raphael
    Murphy, Susan A.
    Wehenkel, Louis
    Ernst, Damien
    AGENTS AND ARTIFICIAL INTELLIGENCE, 2011, 129 : 61 - +
  • [24] Decoupling Value and Policy for Generalization in Reinforcement Learning
    Raileanu, Roberta
    Fergus, Rob
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [25] Generalization in Reinforcement Learning by Soft Data Augmentation
    Hansen, Nicklas
    Wang, Xiaolong
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13611 - 13617
  • [26] Automatic Data Augmentation for Generalization in Reinforcement Learning
    Raileanu, Roberta
    Goldstein, Max
    Yarats, Denis
    Kostrikov, Ilya
    Fergus, Rob
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [27] Instance-based Generalization in Reinforcement Learning
    Bertran, Martin
    Martinez, Natalia
    Phielipp, Mariano
    Sapiro, Guillermo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [28] Improving Generalization in Reinforcement Learning with Mixture Regularization
    Wang, Kaixin
    Kang, Bingyi
    Shao, Jie
    Feng, Jiashi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [29] Novelty and Inductive Generalization in Human Reinforcement Learning
    Gershman, Samuel J.
    Niv, Yael
    TOPICS IN COGNITIVE SCIENCE, 2015, 7 (03) : 391 - 415
  • [30] Algebraic Reinforcement Learning Hypothesis Induction for Relational Reinforcement Learning Using Term Generalization
    Neubert, Stefanie
    Belzner, Lenz
    Wirsing, Martin
    LOGIC, REWRITING, AND CONCURRENCY, 2015, 9200 : 562 - 579