Safe Exploration for Active Learning with Gaussian Processes

被引：41

作者：

Schreiter, Jens ^{[1
]}

Duy Nguyen-Tuong ^{[1
]}

Eberts, Mona ^{[1
]}

Bischoff, Bastian ^{[1
]}

Markert, Heiner ^{[1
]}

Toussaint, Marc ^{[2
]}

机构：

[1] Robert Bosch GmbH, D-70442 Stuttgart, Germany

[2] Univ Stuttgart, MLR Lab, D-70569 Stuttgart, Germany

来源：

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT III | 2015年 / 9286卷

关键词：

APPROXIMATIONS;

D O I：

10.1007/978-3-319-23461-8_9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, the problem of safe exploration in the active learning context is considered. Safe exploration is especially important for data sampling from technical and industrial systems, e.g. combustion engines and gas turbines, where critical and unsafe measurements need to be avoided. The objective is to learn data-based regression models from such technical systems using a limited budget of measured, i.e. labelled, points while ensuring that critical regions of the considered systems are avoided during measurements. We propose an approach for learning such models and exploring new data regions based on Gaussian processes (GP's). In particular, we employ a problem specific GP classifier to identify safe and unsafe regions, while using a differential entropy criterion for exploring relevant data regions. A theoretical analysis is shown for the proposed algorithm, where we provide an upper bound for the probability of failure. To demonstrate the efficiency and robustness of our safe exploration scheme in the active learning setting, we test the approach on a policy exploration task for the inverse pendulum hold up problem.

引用

页码：133 / 149

页数：17

共 50 条

[21] Bayesian Active Learning with Fully Bayesian Gaussian Processes
Riis, Christoffer
Antunes, Francisco
Huttel, Frederik Boe
Azevedo, Carlos Lima
Pereira, Francisco Camara
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[22] Active Learning with Gaussian Processes for High Throughput Phenotyping
Kumar, Sumit
Luo, Wenhao
Kantor, George
Sycara, Katia
[J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2078 - 2080
[23] Active Learning with Maximum Margin Sparse Gaussian Processes
Shi, Weishi
Yu, Qi
[J]. 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130 : 406 - +
[24] Model-free safe reinforcement learning for chemical processes using Gaussian processes
Savage, Thomas
Zhang, Dongda
Mowbray, Max
Chanona, Ehecatl Antonio Del Rio
[J]. IFAC PAPERSONLINE, 2021, 54 (03): : 504 - 509
[25] Safe Learning of Regions of Attraction for Uncertain, Nonlinear Systems with Gaussian Processes
Berkenkamp, Felix
Moriconi, Riccardo
Schoellig, Angela P.
Krause, Andreas
[J]. 2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 4661 - 4666
[26] A visual exploration of Gaussian processes
Görtler, Jochen
Kehlbeck, Rebecca
Deussen, Oliver
[J]. Distill, 2019, 4 (04):
[27] Safe Exploration Learning Supported Model Predictive Control of Repetitive Processes
Morabito, Bruno
Nguyen, Hoang Hai
Matschek, Janine
Findeisen, Rolf
[J]. 2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 2631 - 2636
[28] Efficiently Computable Safety Bounds for Gaussian Processes in Active Learning
Tebbe, Joern
Zimmer, Christoph
Steland, Ansgar
Lange-Hegermann, Markus
[J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
[29] Nonmyopic ε-Bayes-Optimal Active Learning of Gaussian Processes
Trong Nghia Hoang
Low, Kian Hsiang
Jaillet, Patrick
Kankanhalli, Mohan
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 739 - 747
[30] Bayesian Active Learning for Choice Models With Deep Gaussian Processes
Yang, Jie
Klabjan, Diego
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (02) : 1080 - 1092

← 1 2 3 4 5 →