Safe Exploration for Active Learning with Gaussian Processes

被引:41
|
作者
Schreiter, Jens [1 ]
Duy Nguyen-Tuong [1 ]
Eberts, Mona [1 ]
Bischoff, Bastian [1 ]
Markert, Heiner [1 ]
Toussaint, Marc [2 ]
机构
[1] Robert Bosch GmbH, D-70442 Stuttgart, Germany
[2] Univ Stuttgart, MLR Lab, D-70569 Stuttgart, Germany
关键词
APPROXIMATIONS;
D O I
10.1007/978-3-319-23461-8_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, the problem of safe exploration in the active learning context is considered. Safe exploration is especially important for data sampling from technical and industrial systems, e.g. combustion engines and gas turbines, where critical and unsafe measurements need to be avoided. The objective is to learn data-based regression models from such technical systems using a limited budget of measured, i.e. labelled, points while ensuring that critical regions of the considered systems are avoided during measurements. We propose an approach for learning such models and exploring new data regions based on Gaussian processes (GP's). In particular, we employ a problem specific GP classifier to identify safe and unsafe regions, while using a differential entropy criterion for exploring relevant data regions. A theoretical analysis is shown for the proposed algorithm, where we provide an upper bound for the probability of failure. To demonstrate the efficiency and robustness of our safe exploration scheme in the active learning setting, we test the approach on a policy exploration task for the inverse pendulum hold up problem.
引用
收藏
页码:133 / 149
页数:17
相关论文
共 50 条
  • [31] Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration
    Jung, Tobias
    Stone, Peter
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT I: EUROPEAN CONFERENCE, ECML PKDD 2010, 2010, 6321 : 601 - 616
  • [32] Decentralized Multi-Agent Exploration with Online-Learning of Gaussian Processes
    Viseras, Alberto
    Wiedemann, Thomas
    Manss, Christoph
    Magel, Lukas
    Mueller, Joachim
    Shutin, Dmitriy
    Merino, Luis
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 4222 - 4229
  • [33] Spatially Adaptive Classification and Active Learning of Multispectral Data with Gaussian Processes
    Jun, Goo
    Vatsavai, Ranga Raju
    Ghosh, Joydeep
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 597 - +
  • [34] Active learning of Gaussian processes with manifold-preserving graph reduction
    Jin Zhou
    Shiliang Sun
    [J]. Neural Computing and Applications, 2014, 25 : 1615 - 1625
  • [35] Active Online Learning for Interactive Segmentation Using Sparse Gaussian Processes
    Triebel, Rudolph
    Stuehmer, Jan
    Souiai, Mohamed
    Cremers, Daniel
    [J]. PATTERN RECOGNITION, GCPR 2014, 2014, 8753 : 641 - 652
  • [36] Active learning of Gaussian processes with manifold-preserving graph reduction
    Zhou, Jin
    Sun, Shiliang
    [J]. NEURAL COMPUTING & APPLICATIONS, 2014, 25 (7-8): : 1615 - 1625
  • [37] Safe Active Dynamics Learning and Control: A Sequential Exploration-Exploitation Framework
    Lew, Thomas
    Sharma, Apoorva
    Harrison, James
    Bylard, Andrew
    Pavone, Marco
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2022, 38 (05) : 2888 - 2907
  • [38] Bayesian Active Learning for Scanning Probe Microscopy: From Gaussian Processes to Hypothesis Learning
    Ziatdinov, Maxim
    Liu, Yongtao
    Kelley, Kyle
    Vasudevan, Rama
    V. Kalinin, Sergei
    [J]. ACS NANO, 2022, 16 (09) : 13492 - 13512
  • [39] Safe Controller Optimization for Quadrotors with Gaussian Processes
    Berkenkamp, Felix
    Schoellig, Angela P.
    Krause, Andreas
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 491 - 496
  • [40] Stagewise Safe Bayesian Optimization with Gaussian Processes
    Sui, Yanan
    Zhuang, Vincent
    Burdick, Joel W.
    Yue, Yisong
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80