Safe Exploration for Active Learning with Gaussian Processes

被引:41
|
作者
Schreiter, Jens [1 ]
Duy Nguyen-Tuong [1 ]
Eberts, Mona [1 ]
Bischoff, Bastian [1 ]
Markert, Heiner [1 ]
Toussaint, Marc [2 ]
机构
[1] Robert Bosch GmbH, D-70442 Stuttgart, Germany
[2] Univ Stuttgart, MLR Lab, D-70569 Stuttgart, Germany
关键词
APPROXIMATIONS;
D O I
10.1007/978-3-319-23461-8_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, the problem of safe exploration in the active learning context is considered. Safe exploration is especially important for data sampling from technical and industrial systems, e.g. combustion engines and gas turbines, where critical and unsafe measurements need to be avoided. The objective is to learn data-based regression models from such technical systems using a limited budget of measured, i.e. labelled, points while ensuring that critical regions of the considered systems are avoided during measurements. We propose an approach for learning such models and exploring new data regions based on Gaussian processes (GP's). In particular, we employ a problem specific GP classifier to identify safe and unsafe regions, while using a differential entropy criterion for exploring relevant data regions. A theoretical analysis is shown for the proposed algorithm, where we provide an upper bound for the probability of failure. To demonstrate the efficiency and robustness of our safe exploration scheme in the active learning setting, we test the approach on a policy exploration task for the inverse pendulum hold up problem.
引用
收藏
页码:133 / 149
页数:17
相关论文
共 50 条
  • [1] Safe Exploration for Optimization with Gaussian Processes
    Sui, Yanan
    Gotovos, Alkis
    Burdick, Joel W.
    Krause, Andreas
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 997 - 1005
  • [2] Safe Active Learning for Multi-Output Gaussian Processes
    Li, Cen-You
    Rakitsch, Barbara
    Zimmer, Christoph
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [3] Adaptive Exploration-Exploitation Active Learning of Gaussian Processes
    Kontoudis, George P.
    Otte, Michael
    [J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 9448 - 9455
  • [4] Benefits of Monotonicity in Safe Exploration with Gaussian Processes
    Losalka, Arpan
    Scarlett, Jonathan
    [J]. UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 1304 - 1314
  • [5] Safe Active Learning for Time-Series Modeling with Gaussian Processes
    Zimmer, Christoph
    Meister, Mona
    Duy Nguyen-Tuong
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [6] Safe Exploration in Finite Markov Decision Processes with Gaussian Processes
    Turchetta, Matteo
    Berkenkamp, Felix
    Krause, Andreas
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [7] Active Tactile Object Exploration with Gaussian Processes
    Yi, Zhengkun
    Calandra, Roberto
    Veiga, Filipe
    van Hoof, Herke
    Hermans, Tucker
    Zhang, Yilei
    Peters, Jan
    [J]. 2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 4925 - 4930
  • [8] Safe and Robust Learning Control with Gaussian Processes
    Berkenkamp, Felix
    Schoellig, Angela P.
    [J]. 2015 EUROPEAN CONTROL CONFERENCE (ECC), 2015, : 2496 - 2501
  • [9] Safe Exploration and Optimization of Constrained MDPs Using Gaussian Processes
    Wachi, Akifumi
    Sui, Yanan
    Yue, Yisong
    Ono, Masahiro
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 6548 - 6555
  • [10] Gaussian Processes for Informative Exploration in Reinforcement Learning
    Chung, Jen Jen
    Lawrance, Nicholas R. J.
    Sukkarieh, Salah
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2013, : 2633 - 2639