Blocks Assemble! Learning to Assemble with Large-Scale Structured Reinforcement Learning

被引:0
|
作者
Ghasemipour, Seyed Kamyar Seyed [1 ]
Freeman, Daniel [1 ]
David, Byron [1 ]
Gu, Shixiang Shane [1 ]
Kataoka, Satoshi [1 ]
Mordatch, Igor [1 ]
机构
[1] Google Res, Mountain View, CA 94043 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Assembly of multi-part physical structures is both a valuable end product for autonomous robotics, as well as a valuable diagnostic task for open-ended training of embodied intelligent agents. We introduce a naturalistic physics-based environment with a set of connectable magnet blocks inspired by children's toy kits. The objective is to assemble blocks into a succession of target blueprints. Despite the simplicity of this objective, the compositional nature of building diverse blueprints from a set of blocks leads to an explosion of complexity in structures that agents encounter. Furthermore, assembly stresses agents' multi-step planning, physical reasoning, and bimanual coordination. We find that the combination of large-scale reinforcement learning and graph-based policies - surprisingly without any additional complexity - is an effective recipe for training agents that not only generalize to complex unseen blueprints in a zero-shot manner, but even operate in a reset-free setting without being trained to do so. Through extensive experiments, we highlight the importance of largescale training, structured representations, contributions of multi-task vs. single-task learning, as well as the effects of curriculums, and discuss qualitative behaviors of trained agents. Our accompanying project webpage can be found at: sites.google.com/view/learning-direct-assembly
引用
收藏
页数:35
相关论文
共 50 条
  • [1] Large-Scale Interactive Recommendation With Tree-Structured Reinforcement Learning
    Chen, Haokun
    Zhu, Chenxu
    Tang, Ruiming
    Zhang, Weinan
    He, Xiuqiang
    Yu, Yong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) : 4018 - 4032
  • [2] Efficient Large-Scale Structured Learning
    Branson, Steve
    Beijbom, Oscar
    Belongie, Serge
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 1806 - 1813
  • [3] Large-Scale Retrieval for Reinforcement Learning
    Humphreys, Peter C.
    Guez, Arthur
    Tieleman, Olivier
    Sifre, Laurent
    Weber, Theophane
    Lillicrap, Timothy
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [4] Tractable large-scale deep reinforcement learning
    Sarang, Nima
    Poullis, Charalambos
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 232
  • [5] Learning to Assemble Objects with a Robot Swarm
    Gebhardt, Gregor H. W.
    Daun, Kevin
    Schnaubelt, Marius
    Hendrich, Alexander
    Kauth, Daniel
    Neumann, Gerhard
    AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1547 - 1549
  • [6] Algorithms or Actions? A Study in Large-Scale Reinforcement Learning
    Tavares, Anderson Rocha
    Anbalagan, Sivasubramanian
    Marcolino, Leandro Soriano
    Chaimowicz, Luiz
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2717 - 2723
  • [7] Deep Reinforcement Learning for Large-Scale Epidemic Control
    Libin, Pieter J. K.
    Moonens, Arno
    Verstraeten, Timothy
    Perez-Sanjines, Fabian
    Hens, Niel
    Lemey, Philippe
    Nowe, Ann
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2020, PT V, 2021, 12461 : 155 - 170
  • [8] LEARNING TO ASSEMBLE CLASSIFIERS VIA GENETIC PROGRAMMING
    Acosta-Mendoza, Niusvel
    Morales-Reyes, Alicia
    Escalante, Hugo Jair
    Gago-Alonso, Andres
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2014, 28 (07)
  • [9] Latent Structured Perceptrons for Large-Scale Learning with Hidden Information
    Sun, Xu
    Matsuzaki, Takuya
    Li, Wenjie
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (09) : 2063 - 2075
  • [10] Reinforcement learning in a large-scale photonic recurrent neural network
    Bueno, J.
    Maktoobi, S.
    Froehly, L.
    Fischer, I.
    Jacquot, M.
    Larger, L.
    Brunner, D.
    OPTICA, 2018, 5 (06): : 756 - 760