Safe Exploration for Active Learning with Gaussian Processes

被引：41

作者：

Schreiter, Jens ^{[1
]}

Duy Nguyen-Tuong ^{[1
]}

Eberts, Mona ^{[1
]}

Bischoff, Bastian ^{[1
]}

Markert, Heiner ^{[1
]}

Toussaint, Marc ^{[2
]}

机构：

[1] Robert Bosch GmbH, D-70442 Stuttgart, Germany

[2] Univ Stuttgart, MLR Lab, D-70569 Stuttgart, Germany

来源：

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT III | 2015年 / 9286卷

关键词：

APPROXIMATIONS;

D O I：

10.1007/978-3-319-23461-8_9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, the problem of safe exploration in the active learning context is considered. Safe exploration is especially important for data sampling from technical and industrial systems, e.g. combustion engines and gas turbines, where critical and unsafe measurements need to be avoided. The objective is to learn data-based regression models from such technical systems using a limited budget of measured, i.e. labelled, points while ensuring that critical regions of the considered systems are avoided during measurements. We propose an approach for learning such models and exploring new data regions based on Gaussian processes (GP's). In particular, we employ a problem specific GP classifier to identify safe and unsafe regions, while using a differential entropy criterion for exploring relevant data regions. A theoretical analysis is shown for the proposed algorithm, where we provide an upper bound for the probability of failure. To demonstrate the efficiency and robustness of our safe exploration scheme in the active learning setting, we test the approach on a policy exploration task for the inverse pendulum hold up problem.

引用

页码：133 / 149

页数：17

共 50 条

[31] Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration
Jung, Tobias
Stone, Peter
[J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT I: EUROPEAN CONFERENCE, ECML PKDD 2010, 2010, 6321 : 601 - 616
[32] Decentralized Multi-Agent Exploration with Online-Learning of Gaussian Processes
Viseras, Alberto
Wiedemann, Thomas
Manss, Christoph
Magel, Lukas
Mueller, Joachim
Shutin, Dmitriy
Merino, Luis
[J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 4222 - 4229
[33] Spatially Adaptive Classification and Active Learning of Multispectral Data with Gaussian Processes
Jun, Goo
Vatsavai, Ranga Raju
Ghosh, Joydeep
[J]. 2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 597 - +
[34] Active learning of Gaussian processes with manifold-preserving graph reduction
Jin Zhou
Shiliang Sun
[J]. Neural Computing and Applications, 2014, 25 : 1615 - 1625
[35] Active Online Learning for Interactive Segmentation Using Sparse Gaussian Processes
Triebel, Rudolph
Stuehmer, Jan
Souiai, Mohamed
Cremers, Daniel
[J]. PATTERN RECOGNITION, GCPR 2014, 2014, 8753 : 641 - 652
[36] Active learning of Gaussian processes with manifold-preserving graph reduction
Zhou, Jin
Sun, Shiliang
[J]. NEURAL COMPUTING & APPLICATIONS, 2014, 25 (7-8): : 1615 - 1625
[37] Safe Active Dynamics Learning and Control: A Sequential Exploration-Exploitation Framework
Lew, Thomas
Sharma, Apoorva
Harrison, James
Bylard, Andrew
Pavone, Marco
[J]. IEEE TRANSACTIONS ON ROBOTICS, 2022, 38 (05) : 2888 - 2907
[38] Bayesian Active Learning for Scanning Probe Microscopy: From Gaussian Processes to Hypothesis Learning
Ziatdinov, Maxim
Liu, Yongtao
Kelley, Kyle
Vasudevan, Rama
V. Kalinin, Sergei
[J]. ACS NANO, 2022, 16 (09) : 13492 - 13512
[39] Safe Controller Optimization for Quadrotors with Gaussian Processes
Berkenkamp, Felix
Schoellig, Angela P.
Krause, Andreas
[J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 491 - 496
[40] Stagewise Safe Bayesian Optimization with Gaussian Processes
Sui, Yanan
Zhuang, Vincent
Burdick, Joel W.
Yue, Yisong
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80

← 1 2 3 4 5 →