COMPOSUITE: A COMPOSITIONAL REINFORCEMENT LEARNING BENCHMARK

被引:0
|
作者
Mendez, Jorge A. [1 ]
Hussing, Marcel [1 ]
Gummadi, Meghna [1 ]
Eaton, Eric [1 ]
机构
[1] Univ Penn, Dept Comp & Informat Sci, Philadelphia, PA 19104 USA
关键词
ABSTRACTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present CompoSuite, an open-source simulated robotic manipulation benchmark for compositional multi-task reinforcement learning (RL). Each CompoSuite task requires a particular robot arm to manipulate one individual object to achieve a task objective while avoiding an obstacle. This compositional definition of the tasks endows CompoSuite with two remarkable properties. First, varying the robot/object/objective/obstacle elements leads to hundreds of RL tasks, each of which requires a meaningfully different behavior. Second, RL approaches can be evaluated specifically for their ability to learn the compositional structure of the tasks. This latter capability to functionally decompose problems would enable intelligent agents to identify and exploit commonalities between learning tasks to handle large varieties of highly diverse problems. We benchmark existing single-task, multi-task, and compositional learning algorithms on various training settings, and assess their capability to compositionally generalize to unseen tasks. Our evaluation exposes the shortcomings of existing RL approaches with respect to compositionality and opens new avenues for investigation.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Compositional Models for Reinforcement Learning
    Jong, Nicholas K.
    Stone, Peter
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT I, 2009, 5781 : 644 - 659
  • [2] Compositional Transfer in Hierarchical Reinforcement Learning
    Wulfmeier, Markus
    Abdolmaleki, Abbas
    Hafner, Roland
    Springenberg, Jost Tobias
    Neunert, Michael
    Hertweck, Tim
    Lampe, Thomas
    Siegel, Noah
    Heess, Nicolas
    Riedmiller, Martin
    [J]. ROBOTICS: SCIENCE AND SYSTEMS XVI, 2020,
  • [3] Leveraging Procedural Generation to Benchmark Reinforcement Learning
    Cobbe, Karl
    Hesse, Christopher
    Hilton, Jacob
    Schulman, John
    [J]. 25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [4] Leveraging Procedural Generation to Benchmark Reinforcement Learning
    Cobbe, Karl
    Hesse, Christopher
    Hilton, Jacob
    Schulman, John
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [5] COOM: A Game Benchmark for Continual Reinforcement Learning
    Tomilin, Tristan
    Fang, Meng
    Zhang, Yudi
    Pechenizkiy, Mykola
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [6] Compositional Reinforcement Learning from Logical Specifications
    Jothimurugan, Kishor
    Bansal, Suguman
    Bastani, Osbert
    Alur, Rajeev
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [7] Batch Reinforcement Learning on the Industrial Benchmark: First Experiences
    Hein, Daniel
    Udluft, Steffen
    Tokic, Michel
    Hentschel, Alexander
    Runkler, Thomas A.
    Sterzing, Volkmar
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 4214 - 4221
  • [8] A Reinforcement Learning Benchmark for Autonomous Driving in Intersection Scenarios
    Liu, Yuqi
    Zhang, Qichao
    Zhao, Dongbin
    [J]. 2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [9] Continual World: A Robotic Benchmark For Continual Reinforcement Learning
    Wolczyk, Maciej
    Zajac, Michal
    Pascanu, Razvan
    Kucinski, Lukasz
    Milos, Piotr
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [10] Bounding the Optimal Value Function in Compositional Reinforcement Learning
    Adamczyk, Jacob
    Makarenko, Volodymyr
    Arriojas, Argenis
    Tiomkin, Stas
    Kulkarni, Rahul V.
    [J]. UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 22 - 32