Safe Exploration for Active Learning with Gaussian Processes

被引:41
|
作者
Schreiter, Jens [1 ]
Duy Nguyen-Tuong [1 ]
Eberts, Mona [1 ]
Bischoff, Bastian [1 ]
Markert, Heiner [1 ]
Toussaint, Marc [2 ]
机构
[1] Robert Bosch GmbH, D-70442 Stuttgart, Germany
[2] Univ Stuttgart, MLR Lab, D-70569 Stuttgart, Germany
关键词
APPROXIMATIONS;
D O I
10.1007/978-3-319-23461-8_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, the problem of safe exploration in the active learning context is considered. Safe exploration is especially important for data sampling from technical and industrial systems, e.g. combustion engines and gas turbines, where critical and unsafe measurements need to be avoided. The objective is to learn data-based regression models from such technical systems using a limited budget of measured, i.e. labelled, points while ensuring that critical regions of the considered systems are avoided during measurements. We propose an approach for learning such models and exploring new data regions based on Gaussian processes (GP's). In particular, we employ a problem specific GP classifier to identify safe and unsafe regions, while using a differential entropy criterion for exploring relevant data regions. A theoretical analysis is shown for the proposed algorithm, where we provide an upper bound for the probability of failure. To demonstrate the efficiency and robustness of our safe exploration scheme in the active learning setting, we test the approach on a policy exploration task for the inverse pendulum hold up problem.
引用
收藏
页码:133 / 149
页数:17
相关论文
共 50 条
  • [21] Bayesian Active Learning with Fully Bayesian Gaussian Processes
    Riis, Christoffer
    Antunes, Francisco
    Huttel, Frederik Boe
    Azevedo, Carlos Lima
    Pereira, Francisco Camara
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [22] Active Learning with Gaussian Processes for High Throughput Phenotyping
    Kumar, Sumit
    Luo, Wenhao
    Kantor, George
    Sycara, Katia
    [J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2078 - 2080
  • [23] Active Learning with Maximum Margin Sparse Gaussian Processes
    Shi, Weishi
    Yu, Qi
    [J]. 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130 : 406 - +
  • [24] Model-free safe reinforcement learning for chemical processes using Gaussian processes
    Savage, Thomas
    Zhang, Dongda
    Mowbray, Max
    Chanona, Ehecatl Antonio Del Rio
    [J]. IFAC PAPERSONLINE, 2021, 54 (03): : 504 - 509
  • [25] Safe Learning of Regions of Attraction for Uncertain, Nonlinear Systems with Gaussian Processes
    Berkenkamp, Felix
    Moriconi, Riccardo
    Schoellig, Angela P.
    Krause, Andreas
    [J]. 2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 4661 - 4666
  • [26] A visual exploration of Gaussian processes
    Görtler, Jochen
    Kehlbeck, Rebecca
    Deussen, Oliver
    [J]. Distill, 2019, 4 (04):
  • [27] Safe Exploration Learning Supported Model Predictive Control of Repetitive Processes
    Morabito, Bruno
    Nguyen, Hoang Hai
    Matschek, Janine
    Findeisen, Rolf
    [J]. 2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 2631 - 2636
  • [28] Efficiently Computable Safety Bounds for Gaussian Processes in Active Learning
    Tebbe, Joern
    Zimmer, Christoph
    Steland, Ansgar
    Lange-Hegermann, Markus
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [29] Nonmyopic ε-Bayes-Optimal Active Learning of Gaussian Processes
    Trong Nghia Hoang
    Low, Kian Hsiang
    Jaillet, Patrick
    Kankanhalli, Mohan
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 739 - 747
  • [30] Bayesian Active Learning for Choice Models With Deep Gaussian Processes
    Yang, Jie
    Klabjan, Diego
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (02) : 1080 - 1092