Bayesian Optimization with Automatic Prior Selection for Data-Efficient Direct Policy Search

被引:0
|
作者
Pautrat, Remi [1 ,2 ,3 ]
Chatzilygeroudis, Konstantinos [1 ,2 ,3 ]
Mouret, Jean-Baptiste [1 ,2 ,3 ]
机构
[1] INRIA, F-54600 Villers Les Nancy, France
[2] CNRS, Loria, UMR 7503, F-54500 Vandoeuvre Les Nancy, France
[3] Univ Lorraine, Loria, UMR 7503, F-54500 Vandoeuvre Les Nancy, France
基金
欧洲研究理事会; 欧盟地平线“2020”;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One of the most interesting features of Bayesian optimization for direct policy search is that it can leverage priors (e.g., from simulation or from previous tasks) to accelerate learning on a robot. In this paper, we are interested in situations for which several priors exist but we do not know in advance which one fits best the current situation. We tackle this problem by introducing a novel acquisition function, called Most Likely Expected Improvement (MLEI), that combines the likelihood of the priors and the expected improvement. We evaluate this new acquisition function on a transfer learning task for a 5-DOF planar arm and on a possibly damaged, 6-legged robot that has to learn to walk on flat ground and on stairs, with priors corresponding to different stairs and different kinds of damages. Our results show that MLEI effectively identifies and exploits the priors, even when there is no obvious match between the current situations and the priors.
引用
收藏
页码:7571 / 7578
页数:8
相关论文
共 50 条
  • [1] Data-Efficient Domain Randomization With Bayesian Optimization
    Muratore, Fabio
    Eilers, Christian
    Gienger, Michael
    Peters, Jan
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 911 - 918
  • [2] Data-Efficient Policy Evaluation Through Behavior Policy Search
    Hanna, Josiah P.
    Thomas, Philip S.
    Stone, Peter
    Niekum, Scott
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [3] Data-Efficient Automatic Model Selection in Unsupervised Anomaly Detection
    Gudur, Gautham Krishna
    Raaghul, R.
    Adithya, K.
    Vasudevan, Shrihari
    [J]. 2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 1443 - 1448
  • [4] Data-Efficient Autotuning With Bayesian Optimization: An Industrial Control Study
    Neumann-Brosig, Matthias
    Marco, Alonso
    Schwarzmann, Dieter
    Trimpe, Sebastian
    [J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2020, 28 (03) : 730 - 740
  • [5] Data-Efficient Bayesian Optimization with Constraints for Power Amplifier Design
    Knudde, Nicolas
    Couckuyt, Ivo
    Spina, Domenico
    Lukasik, Konstanty
    Barmuta, Pawel
    Schreurs, Dominique
    Dhaene, Tom
    [J]. 2018 IEEE MTT-S INTERNATIONAL CONFERENCE ON NUMERICAL ELECTROMAGNETIC AND MULTIPHYSICS MODELING AND OPTIMIZATION (NEMO), 2018,
  • [6] Data-efficient autotuning with bayesian optimization: An industrial control study
    Neumann-Brosig, Matthias
    Marco, Alonso
    Schwarzmann, Dieter
    Trimpe, Sebastian
    [J]. IEEE Transactions on Control Systems Technology, 2020, 28 (03): : 730 - 740
  • [7] Black-Box Data-efficient Policy Search for Robotics
    Chatzilygeroudis, Konstantinos
    Rama, Roberto
    Kaushik, Rituraj
    Goepp, Dorian
    Vassiliades, Vassilis
    Mouret, Jean-Baptiste
    [J]. 2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 51 - 58
  • [8] Guided Bayesian Optimization: Data-Efficient Controller Tuning With Digital Twin
    Nobar, Mahdi
    Keller, Jurg
    Rupenyan, Alisa
    Khosravi, Mohammad
    Lygeros, John
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024,
  • [9] Bayesian Optimization for Policy Search in High-Dimensional Systems via Automatic Domain Selection
    Frohlich, Lukas P.
    Klenske, Edgar D.
    Daniel, Christian G.
    Zeilinger, Melanie N.
    [J]. 2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 757 - 764
  • [10] Data-Efficient Control Policy Search using Residual Dynamics Learning
    Saveriano, Matteo
    Yin, Yuchao
    Falco, Pietro
    Lee, Dongheui
    [J]. 2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 4709 - 4715