Blocks Assemble! Learning to Assemble with Large-Scale Structured Reinforcement Learning

被引：0

作者：

Ghasemipour, Seyed Kamyar Seyed ^{[1
]}

Freeman, Daniel ^{[1
]}

David, Byron ^{[1
]}

Gu, Shixiang Shane ^{[1
]}

Kataoka, Satoshi ^{[1
]}

Mordatch, Igor ^{[1
]}

机构：

[1] Google Res, Mountain View, CA 94043 USA

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162 | 2022年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Assembly of multi-part physical structures is both a valuable end product for autonomous robotics, as well as a valuable diagnostic task for open-ended training of embodied intelligent agents. We introduce a naturalistic physics-based environment with a set of connectable magnet blocks inspired by children's toy kits. The objective is to assemble blocks into a succession of target blueprints. Despite the simplicity of this objective, the compositional nature of building diverse blueprints from a set of blocks leads to an explosion of complexity in structures that agents encounter. Furthermore, assembly stresses agents' multi-step planning, physical reasoning, and bimanual coordination. We find that the combination of large-scale reinforcement learning and graph-based policies - surprisingly without any additional complexity - is an effective recipe for training agents that not only generalize to complex unseen blueprints in a zero-shot manner, but even operate in a reset-free setting without being trained to do so. Through extensive experiments, we highlight the importance of largescale training, structured representations, contributions of multi-task vs. single-task learning, as well as the effects of curriculums, and discuss qualitative behaviors of trained agents. Our accompanying project webpage can be found at: sites.google.com/view/learning-direct-assembly

引用

页数：35

共 50 条

[31] Reinforcement learning for optimal tracking of large-scale systems with multitime scales
Li, Jinna
Nie, Hao
Chai, Tianyou
Lewis, Frank L.
SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (07)
[32] Reinforcement learning for optimal tracking of large-scale systems with multitime scales
Jinna LI
Hao NIE
Tianyou CHAI
Frank L.LEWIS
Science China(Information Sciences), 2023, 66 (07) : 5 - 29
[33] Large-Scale and Adaptive Service Composition Using Deep Reinforcement Learning
Wang, Hongbing
Gu, Mingzhu
Yu, Qi
Fei, Huanhuan
Li, Jiajie
Tao, Yong
SERVICE-ORIENTED COMPUTING, ICSOC 2017, 2017, 10601 : 383 - 391
[34] Deep Reinforcement Learning-Based Large-Scale Robot Exploration
Cao, Yuhong
Zhao, Rui
Wang, Yizhuo
Xiang, Bairan
Sartoretti, Guillaume
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (05) : 4631 - 4638
[35] Adaptive and large-scale service composition based on deep reinforcement learning
Wang, Hongbing
Gu, Mingzhu
Yu, Qi
Tao, Yong
Li, Jiajie
Fei, Huanhuan
Yan, Jia
Zhao, Wei
Hong, Tianjing
KNOWLEDGE-BASED SYSTEMS, 2019, 180 : 75 - 90
[36] NEAT for Large-Scale Reinforcement Learning through Evolutionary Feature Learning and Policy Gradient Search
Peng, Yiming
Chen, Gang
Singh, Harman
Zhang, Mengjie
GECCO'18: PROCEEDINGS OF THE 2018 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2018, : 490 - 497
[37] Large-scale cost function learning for path planning using deep inverse reinforcement learning
Wulfmeier, Markus
Rao, Dushyant
Wang, Dominic Zeng
Ondruska, Peter
Posner, Ingmar
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2017, 36 (10): : 1073 - 1087
[38] Learning From Big Data: A Survey and Evaluation of Approximation Technologies for Large-scale Reinforcement Learning
Wu, Cheng
Wang, Yiming
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (CIT), 2017, : 1 - 8
[39] AssembleRL: Learning to Assemble Furniture from Their Point Clouds
Aslan, Ozgur
Bolat, Burak
Bal, Batuhan
Tumer, Tugba
Sahin, Erol
Kalkan, Sinan
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 2748 - 2753
[40] Large-scale manifold learning
Talwalkar, Ameet
Kumar, Sanjiv
Rowley, Henry
2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 2554 - +

← 1 2 3 4 5 →