Safe Exploration for Active Learning with Gaussian Processes

被引：41

作者：

Schreiter, Jens ^{[1
]}

Duy Nguyen-Tuong ^{[1
]}

Eberts, Mona ^{[1
]}

Bischoff, Bastian ^{[1
]}

Markert, Heiner ^{[1
]}

Toussaint, Marc ^{[2
]}

机构：

[1] Robert Bosch GmbH, D-70442 Stuttgart, Germany

[2] Univ Stuttgart, MLR Lab, D-70569 Stuttgart, Germany

来源：

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT III | 2015年 / 9286卷

关键词：

APPROXIMATIONS;

D O I：

10.1007/978-3-319-23461-8_9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, the problem of safe exploration in the active learning context is considered. Safe exploration is especially important for data sampling from technical and industrial systems, e.g. combustion engines and gas turbines, where critical and unsafe measurements need to be avoided. The objective is to learn data-based regression models from such technical systems using a limited budget of measured, i.e. labelled, points while ensuring that critical regions of the considered systems are avoided during measurements. We propose an approach for learning such models and exploring new data regions based on Gaussian processes (GP's). In particular, we employ a problem specific GP classifier to identify safe and unsafe regions, while using a differential entropy criterion for exploring relevant data regions. A theoretical analysis is shown for the proposed algorithm, where we provide an upper bound for the probability of failure. To demonstrate the efficiency and robustness of our safe exploration scheme in the active learning setting, we test the approach on a policy exploration task for the inverse pendulum hold up problem.

引用

页码：133 / 149

页数：17

共 50 条

[1] Safe Exploration for Optimization with Gaussian Processes
Sui, Yanan
Gotovos, Alkis
Burdick, Joel W.
Krause, Andreas
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 997 - 1005
[2] Safe Active Learning for Multi-Output Gaussian Processes
Li, Cen-You
Rakitsch, Barbara
Zimmer, Christoph
[J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
[3] Adaptive Exploration-Exploitation Active Learning of Gaussian Processes
Kontoudis, George P.
Otte, Michael
[J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 9448 - 9455
[4] Benefits of Monotonicity in Safe Exploration with Gaussian Processes
Losalka, Arpan
Scarlett, Jonathan
[J]. UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 1304 - 1314
[5] Safe Active Learning for Time-Series Modeling with Gaussian Processes
Zimmer, Christoph
Meister, Mona
Duy Nguyen-Tuong
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[6] Safe Exploration in Finite Markov Decision Processes with Gaussian Processes
Turchetta, Matteo
Berkenkamp, Felix
Krause, Andreas
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
[7] Active Tactile Object Exploration with Gaussian Processes
Yi, Zhengkun
Calandra, Roberto
Veiga, Filipe
van Hoof, Herke
Hermans, Tucker
Zhang, Yilei
Peters, Jan
[J]. 2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 4925 - 4930
[8] Safe and Robust Learning Control with Gaussian Processes
Berkenkamp, Felix
Schoellig, Angela P.
[J]. 2015 EUROPEAN CONTROL CONFERENCE (ECC), 2015, : 2496 - 2501
[9] Safe Exploration and Optimization of Constrained MDPs Using Gaussian Processes
Wachi, Akifumi
Sui, Yanan
Yue, Yisong
Ono, Masahiro
[J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 6548 - 6555
[10] Gaussian Processes for Informative Exploration in Reinforcement Learning
Chung, Jen Jen
Lawrance, Nicholas R. J.
Sukkarieh, Salah
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2013, : 2633 - 2639

← 1 2 3 4 5 →