Combined Optimization and Reinforcement Learning for Manipulation Skills

被引：0

作者：

Englert, Peter ^{[1
]}

Toussaint, Marc ^{[1
]}

机构：

[1] Univ Stuttgart, Machine Learning & Robot Lab, Stuttgart, Germany

来源：

ROBOTICS: SCIENCE AND SYSTEMS XII | 2016年

关键词：

D O I：

暂无

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

This work addresses the problem of how a robot can improve a manipulation skill in a sample-efficient and secure manner. As an alternative to the standard reinforcement learning formulation where all objectives are defined in a single reward function, we propose a generalized formulation that consists of three components: 1) A known analytic control cost function; 2) A black-box return function; and 3) A black-box binary success constraint. While the overall policy optimization problem is high-dimensional, in typical robot manipulation problems we can assume that the black-box return and constraint only depend on a lower-dimensional projection of the solution. With our formulation we can exploit this structure for a sample-efficient learning framework that iteratively improves the policy with respect to the objective functions under the success constraint. We employ efficient 2nd-order optimization methods to optimize the high-dimensional policy w.r.t. the analytic cost function while keeping the lower dimensional projection fixed. This is alternated with safe Bayesian optimization over the lower-dimensional projection to address the black-box return and success constraint. During both improvement steps the success constraint is used to keep the optimization in a secure region and to clearly distinguish between motions that lead to success or failure. The learning algorithm is evaluated on a simulated benchmark problem and a door opening task with a PR2.

引用

页数：9

共 50 条

[21] Learning to Scaffold the Development of Robotic Manipulation Skills
Shao, Lin
Migimatsu, Toki
Bohg, Jeannette
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 5671 - 5677
[22] Reinforcement learning of competitive skills with soccer agents
Leng, Jinsong
Fyfe, Colin
Jain, Lakhmi
[J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS: KES 2007 - WIRN 2007, PT I, PROCEEDINGS, 2007, 4692 : 572 - +
[23] Discovering and Exploiting Skills in Hierarchical Reinforcement Learning
Huang, Zhigang
[J]. IEEE Access, 2024, 12 : 163042 - 163055
[24] Reinforcement and Imitation Learning for Diverse Visuomotor Skills
Zhu, Yuke
Wang, Ziyu
Merel, Josh
Rusu, Andrei
Erez, Tom
Cabi, Serkan
Tunyasuvunakool, Saran
Kramar, Janos
Hadsell, Raia
de Freitas, Nando
Heess, Nicolas
[J]. ROBOTICS: SCIENCE AND SYSTEMS XIV, 2018,
[25] The Option Keyboard Combining Skills in Reinforcement Learning
Barreto, Andre
Borsa, Diana
Hou, Shaobo
Comanici, Gheorghe
Aygun, Eser
Hamel, Philippe
Toyama, Daniel
Hunt, Jonathan
Mourad, Shibl
Silver, David
Precup, Doina
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[26] ACQUIRING ROBOT SKILLS VIA REINFORCEMENT LEARNING
GULLAPALLI, V
FRANKLIN, JA
BENBRAHIM, H
[J]. IEEE CONTROL SYSTEMS MAGAZINE, 1994, 14 (01): : 13 - 24
[27] Connectionist reinforcement learning of robot control skills
Araujo, R
Nunes, U
de Almeida, AT
[J]. COMPUTING ANTICIPATORY SYSTEMS: CASYS - FIRST INTERNATIONAL CONFERENCE, 1998, 437 : 364 - 373
[28] Reinforcement learning of motor skills with policy gradients
Peters, Jan
Schaal, Stefan
[J]. NEURAL NETWORKS, 2008, 21 (04) : 682 - 697
[29] Multi-UAV Assisted Offloading Optimization: A Game Combined Reinforcement Learning Approach
Gao, Ang
Wang, Qi
Chen, Kaiyue
Liang, Wei
[J]. IEEE COMMUNICATIONS LETTERS, 2021, 25 (08) : 2629 - 2633
[30] Precise atom manipulation through deep reinforcement learning
I-Ju Chen
Markus Aapro
Abraham Kipnis
Alexander Ilin
Peter Liljeroth
Adam S. Foster
[J]. Nature Communications, 13

← 1 2 3 4 5 →