Blocks Assemble! Learning to Assemble with Large-Scale Structured Reinforcement Learning

被引:0
|
作者
Ghasemipour, Seyed Kamyar Seyed [1 ]
Freeman, Daniel [1 ]
David, Byron [1 ]
Gu, Shixiang Shane [1 ]
Kataoka, Satoshi [1 ]
Mordatch, Igor [1 ]
机构
[1] Google Res, Mountain View, CA 94043 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Assembly of multi-part physical structures is both a valuable end product for autonomous robotics, as well as a valuable diagnostic task for open-ended training of embodied intelligent agents. We introduce a naturalistic physics-based environment with a set of connectable magnet blocks inspired by children's toy kits. The objective is to assemble blocks into a succession of target blueprints. Despite the simplicity of this objective, the compositional nature of building diverse blueprints from a set of blocks leads to an explosion of complexity in structures that agents encounter. Furthermore, assembly stresses agents' multi-step planning, physical reasoning, and bimanual coordination. We find that the combination of large-scale reinforcement learning and graph-based policies - surprisingly without any additional complexity - is an effective recipe for training agents that not only generalize to complex unseen blueprints in a zero-shot manner, but even operate in a reset-free setting without being trained to do so. Through extensive experiments, we highlight the importance of largescale training, structured representations, contributions of multi-task vs. single-task learning, as well as the effects of curriculums, and discuss qualitative behaviors of trained agents. Our accompanying project webpage can be found at: sites.google.com/view/learning-direct-assembly
引用
收藏
页数:35
相关论文
共 50 条
  • [21] Planning Large-scale Object Rearrangement Using Deep Reinforcement Learning
    Ghosh, Sourav
    Das, Dipanjan
    Chakraborty, Abhishek
    Agarwal, Marichi
    Bhowmick, Brojeshwar
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [22] Deep reinforcement learning for scheduling in large-scale networked control systems
    Redder, Adrian
    Ramaswamy, Arunselvan
    Quevedo, Daniel E.
    IFAC PAPERSONLINE, 2019, 52 (20): : 333 - 338
  • [23] A Reinforcement Learning Based Large-Scale Refinery Production Scheduling Algorithm
    Chen, Yuandong
    Ding, Jinliang
    Chen, Qingda
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (04) : 6041 - 6055
  • [24] Large-scale and adaptive service composition based on deep reinforcement learning
    Liu, Jiang-Wen
    Hu, Li-Qiang
    Cai, Zhao-Quan
    Xing, Li-Ning
    Tan, Xu
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 65
  • [25] Learning brain regions via large-scale online structured sparse dictionary-learning
    Dohmatob, Elvis
    Mensch, Arthur
    Varoquaux, Gael
    Thirion, Bertrand
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [26] Deep Reinforcement Learning for Network Service Recovery in Large-scale Failures
    Akashi, Kazuaki
    Fukuda, Nobukazu
    Kanai, Shunsuke
    Tayama, Kenichi
    2023 19TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT, CNSM, 2023,
  • [27] NondBREM: Nondeterministic Offline Reinforcement Learning for Large-Scale Order Dispatching
    Zhang, Hongbo
    Wang, Guang
    Wang, Xu
    Zhou, Zhengyang
    Zhang, Chen
    Dong, Zheng
    Wang, Yang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 1, 2024, : 401 - 409
  • [28] Large-Scale Traffic Grid Signal Control with Regional Reinforcement Learning
    Chu, Tianshu
    Qu, Shuhui
    Wang, Jie
    2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 815 - 820
  • [29] Reinforcement Learning for Sustainability: Adapting in large-scale heterogeneous dynamic environments
    Dusparic, Ivana
    2022 IEEE INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING AND SELF-ORGANIZING SYSTEMS COMPANION (ACSOS-C 2022), 2022, : 49 - 50
  • [30] Automatic Hierarchical Reinforcement Learning for Efficient Large-scale Service Composition
    Wang, Hongbing
    Huang, Guicheng
    Yu, Qi
    2016 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS), 2016, : 57 - 64