Blocks Assemble! Learning to Assemble with Large-Scale Structured Reinforcement Learning

被引：0

作者：

Ghasemipour, Seyed Kamyar Seyed ^{[1
]}

Freeman, Daniel ^{[1
]}

David, Byron ^{[1
]}

Gu, Shixiang Shane ^{[1
]}

Kataoka, Satoshi ^{[1
]}

Mordatch, Igor ^{[1
]}

机构：

[1] Google Res, Mountain View, CA 94043 USA

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162 | 2022年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Assembly of multi-part physical structures is both a valuable end product for autonomous robotics, as well as a valuable diagnostic task for open-ended training of embodied intelligent agents. We introduce a naturalistic physics-based environment with a set of connectable magnet blocks inspired by children's toy kits. The objective is to assemble blocks into a succession of target blueprints. Despite the simplicity of this objective, the compositional nature of building diverse blueprints from a set of blocks leads to an explosion of complexity in structures that agents encounter. Furthermore, assembly stresses agents' multi-step planning, physical reasoning, and bimanual coordination. We find that the combination of large-scale reinforcement learning and graph-based policies - surprisingly without any additional complexity - is an effective recipe for training agents that not only generalize to complex unseen blueprints in a zero-shot manner, but even operate in a reset-free setting without being trained to do so. Through extensive experiments, we highlight the importance of largescale training, structured representations, contributions of multi-task vs. single-task learning, as well as the effects of curriculums, and discuss qualitative behaviors of trained agents. Our accompanying project webpage can be found at: sites.google.com/view/learning-direct-assembly

引用

页数：35

共 50 条

[1] Large-Scale Interactive Recommendation With Tree-Structured Reinforcement Learning
Chen, Haokun
Zhu, Chenxu
Tang, Ruiming
Zhang, Weinan
He, Xiuqiang
Yu, Yong
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) : 4018 - 4032
[2] Efficient Large-Scale Structured Learning
Branson, Steve
Beijbom, Oscar
Belongie, Serge
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 1806 - 1813
[3] Large-Scale Retrieval for Reinforcement Learning
Humphreys, Peter C.
Guez, Arthur
Tieleman, Olivier
Sifre, Laurent
Weber, Theophane
Lillicrap, Timothy
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[4] Tractable large-scale deep reinforcement learning
Sarang, Nima
Poullis, Charalambos
COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 232
[5] Learning to Assemble Objects with a Robot Swarm
Gebhardt, Gregor H. W.
Daun, Kevin
Schnaubelt, Marius
Hendrich, Alexander
Kauth, Daniel
Neumann, Gerhard
AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1547 - 1549
[6] Algorithms or Actions? A Study in Large-Scale Reinforcement Learning
Tavares, Anderson Rocha
Anbalagan, Sivasubramanian
Marcolino, Leandro Soriano
Chaimowicz, Luiz
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2717 - 2723
[7] Deep Reinforcement Learning for Large-Scale Epidemic Control
Libin, Pieter J. K.
Moonens, Arno
Verstraeten, Timothy
Perez-Sanjines, Fabian
Hens, Niel
Lemey, Philippe
Nowe, Ann
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2020, PT V, 2021, 12461 : 155 - 170
[8] LEARNING TO ASSEMBLE CLASSIFIERS VIA GENETIC PROGRAMMING
Acosta-Mendoza, Niusvel
Morales-Reyes, Alicia
Escalante, Hugo Jair
Gago-Alonso, Andres
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2014, 28 (07)
[9] Latent Structured Perceptrons for Large-Scale Learning with Hidden Information
Sun, Xu
Matsuzaki, Takuya
Li, Wenjie
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (09) : 2063 - 2075
[10] Reinforcement learning in a large-scale photonic recurrent neural network
Bueno, J.
Maktoobi, S.
Froehly, L.
Fischer, I.
Jacquot, M.
Larger, L.
Brunner, D.
OPTICA, 2018, 5 (06): : 756 - 760

← 1 2 3 4 5 →