Blocks Assemble! Learning to Assemble with Large-Scale Structured Reinforcement Learning

被引:0
|
作者
Ghasemipour, Seyed Kamyar Seyed [1 ]
Freeman, Daniel [1 ]
David, Byron [1 ]
Gu, Shixiang Shane [1 ]
Kataoka, Satoshi [1 ]
Mordatch, Igor [1 ]
机构
[1] Google Res, Mountain View, CA 94043 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Assembly of multi-part physical structures is both a valuable end product for autonomous robotics, as well as a valuable diagnostic task for open-ended training of embodied intelligent agents. We introduce a naturalistic physics-based environment with a set of connectable magnet blocks inspired by children's toy kits. The objective is to assemble blocks into a succession of target blueprints. Despite the simplicity of this objective, the compositional nature of building diverse blueprints from a set of blocks leads to an explosion of complexity in structures that agents encounter. Furthermore, assembly stresses agents' multi-step planning, physical reasoning, and bimanual coordination. We find that the combination of large-scale reinforcement learning and graph-based policies - surprisingly without any additional complexity - is an effective recipe for training agents that not only generalize to complex unseen blueprints in a zero-shot manner, but even operate in a reset-free setting without being trained to do so. Through extensive experiments, we highlight the importance of largescale training, structured representations, contributions of multi-task vs. single-task learning, as well as the effects of curriculums, and discuss qualitative behaviors of trained agents. Our accompanying project webpage can be found at: sites.google.com/view/learning-direct-assembly
引用
收藏
页数:35
相关论文
共 50 条
  • [31] Reinforcement learning for optimal tracking of large-scale systems with multitime scales
    Li, Jinna
    Nie, Hao
    Chai, Tianyou
    Lewis, Frank L.
    SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (07)
  • [32] Reinforcement learning for optimal tracking of large-scale systems with multitime scales
    Jinna LI
    Hao NIE
    Tianyou CHAI
    Frank L.LEWIS
    Science China(Information Sciences), 2023, 66 (07) : 5 - 29
  • [33] Large-Scale and Adaptive Service Composition Using Deep Reinforcement Learning
    Wang, Hongbing
    Gu, Mingzhu
    Yu, Qi
    Fei, Huanhuan
    Li, Jiajie
    Tao, Yong
    SERVICE-ORIENTED COMPUTING, ICSOC 2017, 2017, 10601 : 383 - 391
  • [34] Deep Reinforcement Learning-Based Large-Scale Robot Exploration
    Cao, Yuhong
    Zhao, Rui
    Wang, Yizhuo
    Xiang, Bairan
    Sartoretti, Guillaume
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (05) : 4631 - 4638
  • [35] Adaptive and large-scale service composition based on deep reinforcement learning
    Wang, Hongbing
    Gu, Mingzhu
    Yu, Qi
    Tao, Yong
    Li, Jiajie
    Fei, Huanhuan
    Yan, Jia
    Zhao, Wei
    Hong, Tianjing
    KNOWLEDGE-BASED SYSTEMS, 2019, 180 : 75 - 90
  • [36] NEAT for Large-Scale Reinforcement Learning through Evolutionary Feature Learning and Policy Gradient Search
    Peng, Yiming
    Chen, Gang
    Singh, Harman
    Zhang, Mengjie
    GECCO'18: PROCEEDINGS OF THE 2018 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2018, : 490 - 497
  • [37] Large-scale cost function learning for path planning using deep inverse reinforcement learning
    Wulfmeier, Markus
    Rao, Dushyant
    Wang, Dominic Zeng
    Ondruska, Peter
    Posner, Ingmar
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2017, 36 (10): : 1073 - 1087
  • [38] Learning From Big Data: A Survey and Evaluation of Approximation Technologies for Large-scale Reinforcement Learning
    Wu, Cheng
    Wang, Yiming
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (CIT), 2017, : 1 - 8
  • [39] AssembleRL: Learning to Assemble Furniture from Their Point Clouds
    Aslan, Ozgur
    Bolat, Burak
    Bal, Batuhan
    Tumer, Tugba
    Sahin, Erol
    Kalkan, Sinan
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 2748 - 2753
  • [40] Large-scale manifold learning
    Talwalkar, Ameet
    Kumar, Sanjiv
    Rowley, Henry
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 2554 - +