Combined Optimization and Reinforcement Learning for Manipulation Skills

被引:0
|
作者
Englert, Peter [1 ]
Toussaint, Marc [1 ]
机构
[1] Univ Stuttgart, Machine Learning & Robot Lab, Stuttgart, Germany
关键词
D O I
暂无
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
This work addresses the problem of how a robot can improve a manipulation skill in a sample-efficient and secure manner. As an alternative to the standard reinforcement learning formulation where all objectives are defined in a single reward function, we propose a generalized formulation that consists of three components: 1) A known analytic control cost function; 2) A black-box return function; and 3) A black-box binary success constraint. While the overall policy optimization problem is high-dimensional, in typical robot manipulation problems we can assume that the black-box return and constraint only depend on a lower-dimensional projection of the solution. With our formulation we can exploit this structure for a sample-efficient learning framework that iteratively improves the policy with respect to the objective functions under the success constraint. We employ efficient 2nd-order optimization methods to optimize the high-dimensional policy w.r.t. the analytic cost function while keeping the lower dimensional projection fixed. This is alternated with safe Bayesian optimization over the lower-dimensional projection to address the black-box return and success constraint. During both improvement steps the success constraint is used to keep the optimization in a secure region and to clearly distinguish between motions that lead to success or failure. The learning algorithm is evaluated on a simulated benchmark problem and a door opening task with a PR2.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] A Hybrid Deep Reinforcement Learning Algorithm for Intelligent Manipulation
    Ma, Chao
    Li, Jianfei
    Bai, Jie
    Wang, Yaobing
    Liu, Bin
    Sun, Jing
    [J]. INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PT IV, 2019, 11743 : 367 - 377
  • [32] Learning Global Optimization by Deep Reinforcement Learning
    da Silva Filho, Moesio Wenceslau
    Barbosa, Gabriel A.
    Miranda, Pericles B. C.
    [J]. INTELLIGENT SYSTEMS, PT II, 2022, 13654 : 417 - 433
  • [33] Rearrangement with Nonprehensile Manipulation Using Deep Reinforcement Learning
    Yuan, Weihao
    Stork, Johannes A.
    Kragic, Danica
    Wang, Michael Y.
    Hang, Kaiyu
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 270 - 277
  • [34] Precise atom manipulation through deep reinforcement learning
    Chen, I-Ju
    Aapro, Markus
    Kipnis, Abraham
    Ilin, Alexander
    Liljeroth, Peter
    Foster, Adam S.
    [J]. NATURE COMMUNICATIONS, 2022, 13 (01)
  • [35] Reinforcement Learning for 4-Finger-Gripper Manipulation
    de Andres, Marco Ojer
    Ardakani, M. Mahdi Ghazaei
    Robertsson, Anders
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 4257 - 4262
  • [36] Reinforcement Learning With Sequences of Motion Primitives for Robust Manipulation
    Stulp, Freek
    Theodorou, Evangelos A.
    Schaal, Stefan
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2012, 28 (06) : 1360 - 1370
  • [37] Unsupervised Reinforcement Learning for Transferable Manipulation Skill Discovery
    Cho, Daesol
    Kim, Jigang
    Kim, H. Jin
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) : 7455 - 7462
  • [38] Integrating Reinforcement Learning and Learning From Demonstrations to Learn Nonprehensile Manipulation
    Sun, Xilong
    Li, Jiqing
    Kovalenko, Anna Vladimirovna
    Feng, Wei
    Ou, Yongsheng
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 20 (03) : 1735 - 1744
  • [39] Learning intraoperative organ manipulation with context-based reinforcement learning
    D'Ettorre, Claudia
    Zirino, Silvia
    Dei, Neri Niccolo
    Stilli, Agostino
    De Momi, Elena
    Stoyanov, Danail
    [J]. INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2022, 17 (08) : 1419 - 1427
  • [40] Learning positioning policies for mobile manipulation operations with deep reinforcement learning
    Iriondo, Ander
    Lazkano, Elena
    Ansuategi, Ander
    Rivera, Andoni
    Lluvia, Iker
    Tubio, Carlos
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (09) : 3003 - 3023