Combined Optimization and Reinforcement Learning for Manipulation Skills

被引：0

作者：

Englert, Peter ^{[1
]}

Toussaint, Marc ^{[1
]}

机构：

[1] Univ Stuttgart, Machine Learning & Robot Lab, Stuttgart, Germany

来源：

ROBOTICS: SCIENCE AND SYSTEMS XII | 2016年

关键词：

D O I：

暂无

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

This work addresses the problem of how a robot can improve a manipulation skill in a sample-efficient and secure manner. As an alternative to the standard reinforcement learning formulation where all objectives are defined in a single reward function, we propose a generalized formulation that consists of three components: 1) A known analytic control cost function; 2) A black-box return function; and 3) A black-box binary success constraint. While the overall policy optimization problem is high-dimensional, in typical robot manipulation problems we can assume that the black-box return and constraint only depend on a lower-dimensional projection of the solution. With our formulation we can exploit this structure for a sample-efficient learning framework that iteratively improves the policy with respect to the objective functions under the success constraint. We employ efficient 2nd-order optimization methods to optimize the high-dimensional policy w.r.t. the analytic cost function while keeping the lower dimensional projection fixed. This is alternated with safe Bayesian optimization over the lower-dimensional projection to address the black-box return and success constraint. During both improvement steps the success constraint is used to keep the optimization in a secure region and to clearly distinguish between motions that lead to success or failure. The learning algorithm is evaluated on a simulated benchmark problem and a door opening task with a PR2.

引用

页数：9

共 50 条

[31] A Hybrid Deep Reinforcement Learning Algorithm for Intelligent Manipulation
Ma, Chao
Li, Jianfei
Bai, Jie
Wang, Yaobing
Liu, Bin
Sun, Jing
[J]. INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PT IV, 2019, 11743 : 367 - 377
[32] Learning Global Optimization by Deep Reinforcement Learning
da Silva Filho, Moesio Wenceslau
Barbosa, Gabriel A.
Miranda, Pericles B. C.
[J]. INTELLIGENT SYSTEMS, PT II, 2022, 13654 : 417 - 433
[33] Rearrangement with Nonprehensile Manipulation Using Deep Reinforcement Learning
Yuan, Weihao
Stork, Johannes A.
Kragic, Danica
Wang, Michael Y.
Hang, Kaiyu
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 270 - 277
[34] Precise atom manipulation through deep reinforcement learning
Chen, I-Ju
Aapro, Markus
Kipnis, Abraham
Ilin, Alexander
Liljeroth, Peter
Foster, Adam S.
[J]. NATURE COMMUNICATIONS, 2022, 13 (01)
[35] Reinforcement Learning for 4-Finger-Gripper Manipulation
de Andres, Marco Ojer
Ardakani, M. Mahdi Ghazaei
Robertsson, Anders
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 4257 - 4262
[36] Reinforcement Learning With Sequences of Motion Primitives for Robust Manipulation
Stulp, Freek
Theodorou, Evangelos A.
Schaal, Stefan
[J]. IEEE TRANSACTIONS ON ROBOTICS, 2012, 28 (06) : 1360 - 1370
[37] Unsupervised Reinforcement Learning for Transferable Manipulation Skill Discovery
Cho, Daesol
Kim, Jigang
Kim, H. Jin
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) : 7455 - 7462
[38] Integrating Reinforcement Learning and Learning From Demonstrations to Learn Nonprehensile Manipulation
Sun, Xilong
Li, Jiqing
Kovalenko, Anna Vladimirovna
Feng, Wei
Ou, Yongsheng
[J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 20 (03) : 1735 - 1744
[39] Learning intraoperative organ manipulation with context-based reinforcement learning
D'Ettorre, Claudia
Zirino, Silvia
Dei, Neri Niccolo
Stilli, Agostino
De Momi, Elena
Stoyanov, Danail
[J]. INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2022, 17 (08) : 1419 - 1427
[40] Learning positioning policies for mobile manipulation operations with deep reinforcement learning
Iriondo, Ander
Lazkano, Elena
Ansuategi, Ander
Rivera, Andoni
Lluvia, Iker
Tubio, Carlos
[J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (09) : 3003 - 3023

← 1 2 3 4 5 →