Combined Optimization and Reinforcement Learning for Manipulation Skills

被引:0
|
作者
Englert, Peter [1 ]
Toussaint, Marc [1 ]
机构
[1] Univ Stuttgart, Machine Learning & Robot Lab, Stuttgart, Germany
关键词
D O I
暂无
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
This work addresses the problem of how a robot can improve a manipulation skill in a sample-efficient and secure manner. As an alternative to the standard reinforcement learning formulation where all objectives are defined in a single reward function, we propose a generalized formulation that consists of three components: 1) A known analytic control cost function; 2) A black-box return function; and 3) A black-box binary success constraint. While the overall policy optimization problem is high-dimensional, in typical robot manipulation problems we can assume that the black-box return and constraint only depend on a lower-dimensional projection of the solution. With our formulation we can exploit this structure for a sample-efficient learning framework that iteratively improves the policy with respect to the objective functions under the success constraint. We employ efficient 2nd-order optimization methods to optimize the high-dimensional policy w.r.t. the analytic cost function while keeping the lower dimensional projection fixed. This is alternated with safe Bayesian optimization over the lower-dimensional projection to address the black-box return and success constraint. During both improvement steps the success constraint is used to keep the optimization in a secure region and to clearly distinguish between motions that lead to success or failure. The learning algorithm is evaluated on a simulated benchmark problem and a door opening task with a PR2.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Learning to Scaffold the Development of Robotic Manipulation Skills
    Shao, Lin
    Migimatsu, Toki
    Bohg, Jeannette
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 5671 - 5677
  • [22] Reinforcement learning of competitive skills with soccer agents
    Leng, Jinsong
    Fyfe, Colin
    Jain, Lakhmi
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS: KES 2007 - WIRN 2007, PT I, PROCEEDINGS, 2007, 4692 : 572 - +
  • [23] Discovering and Exploiting Skills in Hierarchical Reinforcement Learning
    Huang, Zhigang
    [J]. IEEE Access, 2024, 12 : 163042 - 163055
  • [24] Reinforcement and Imitation Learning for Diverse Visuomotor Skills
    Zhu, Yuke
    Wang, Ziyu
    Merel, Josh
    Rusu, Andrei
    Erez, Tom
    Cabi, Serkan
    Tunyasuvunakool, Saran
    Kramar, Janos
    Hadsell, Raia
    de Freitas, Nando
    Heess, Nicolas
    [J]. ROBOTICS: SCIENCE AND SYSTEMS XIV, 2018,
  • [25] The Option Keyboard Combining Skills in Reinforcement Learning
    Barreto, Andre
    Borsa, Diana
    Hou, Shaobo
    Comanici, Gheorghe
    Aygun, Eser
    Hamel, Philippe
    Toyama, Daniel
    Hunt, Jonathan
    Mourad, Shibl
    Silver, David
    Precup, Doina
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [26] ACQUIRING ROBOT SKILLS VIA REINFORCEMENT LEARNING
    GULLAPALLI, V
    FRANKLIN, JA
    BENBRAHIM, H
    [J]. IEEE CONTROL SYSTEMS MAGAZINE, 1994, 14 (01): : 13 - 24
  • [27] Connectionist reinforcement learning of robot control skills
    Araujo, R
    Nunes, U
    de Almeida, AT
    [J]. COMPUTING ANTICIPATORY SYSTEMS: CASYS - FIRST INTERNATIONAL CONFERENCE, 1998, 437 : 364 - 373
  • [28] Reinforcement learning of motor skills with policy gradients
    Peters, Jan
    Schaal, Stefan
    [J]. NEURAL NETWORKS, 2008, 21 (04) : 682 - 697
  • [29] Multi-UAV Assisted Offloading Optimization: A Game Combined Reinforcement Learning Approach
    Gao, Ang
    Wang, Qi
    Chen, Kaiyue
    Liang, Wei
    [J]. IEEE COMMUNICATIONS LETTERS, 2021, 25 (08) : 2629 - 2633
  • [30] Precise atom manipulation through deep reinforcement learning
    I-Ju Chen
    Markus Aapro
    Abraham Kipnis
    Alexander Ilin
    Peter Liljeroth
    Adam S. Foster
    [J]. Nature Communications, 13