Polytopic Trees for Verification of Learning-Based Controllers

被引:2
|
作者
Sadraddini, Sadra [1 ]
Shen, Shen [1 ]
Bastani, Osbert [2 ]
机构
[1] MIT, Cambridge, MA 02139 USA
[2] Univ Penn, Philadelphia, PA 19104 USA
来源
关键词
D O I
10.1007/978-3-030-28423-7_8
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Reinforcement learning is increasingly used to synthesize controllers for a broad range of applications. However, formal guarantees on the behavior of learning-based controllers are elusive due to the black-box nature of machine learning models such as neural networks. In this paper, we propose an algorithm for verifying learning-based controllers-in particular, deep neural networks with ReLU activations, and decision trees with linear decisions and leaf values-for deterministic, piecewise affine (PWA) dynamical systems. In this setting, our algorithm computes the safe (resp., unsafe) region of the state space-i.e., the region of the state space on which the learned controller is guaranteed to satisfy (resp., fail to satisfy) a given reach-avoid specification. Knowing the safe and unsafe regions is substantially more informative than the boolean characterization of safety (i.e., safe or unsafe) provided by standard verification algorithms-for example, this knowledge can be used to compose controllers that are safe on different portions of the state space. At a high level, our algorithm uses convex programming to iteratively compute new regions (in the form of polytopes) that are guaranteed to be entirely safe or entirely unsafe. Then, it connects these polytopic regions together in a tree-like fashion. We conclude with an illustrative example on controlling a hybrid model of a contact-based robotics problem.
引用
收藏
页码:110 / 127
页数:18
相关论文
共 50 条
  • [1] Learning-Based Synthesis of Safety Controllers
    Neider, Daniel
    Markgraf, Oliver
    [J]. 2019 FORMAL METHODS IN COMPUTER AIDED DESIGN (FMCAD), 2019, : 120 - 128
  • [2] Sparse Kernel Learning-based Nonlinear Predictive Controllers
    Liu, Yi
    Wang, Wei
    [J]. APPLIED MATHEMATICS & INFORMATION SCIENCES, 2011, 5 (02): : 195 - 201
  • [3] A Learning-based Framework for Automatic Parameterized Verification
    Li, Yongjian
    Cao, Jialun
    Pang, Jun
    [J]. 2019 IEEE 37TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2019), 2019, : 450 - 459
  • [4] PAC Learning-Based Verification and Model Synthesis
    Chen, Yu-Fang
    Hsieh, Chiao
    Lengal, Ondrej
    Lii, Tsung-Ju
    Tsai, Ming-Hsien
    Wang, Bow-Yaw
    Wang, Farn
    [J]. 2016 IEEE/ACM 38TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2016, : 714 - 724
  • [5] Polytopic Input Constraints in Learning-Based Optimal Control Using Neural Networks
    Markolf, Lukas
    Stursberg, Olaf
    [J]. 2021 EUROPEAN CONTROL CONFERENCE (ECC), 2021, : 1018 - 1023
  • [6] Towards nominal stability certification of deep learning-based controllers
    Hoang Hai Nguyen
    Matschek, Janine
    Zieger, Tim
    Savchenko, Anton
    Noroozi, Navid
    Findeisen, Rolf
    [J]. 2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 3886 - 3891
  • [7] Learning-Based Compositional Verification for Synchronous Probabilistic Systems
    Feng, Lu
    Han, Tingting
    Kwiatkowska, Marta
    Parker, David
    [J]. AUTOMATED TECHNOLOGY FOR VERIFICATION AND ANALYSIS, 2011, 6996 : 511 - 521
  • [8] Analyzing Learning-Based Networked Systems with Formal Verification
    Dethise, Arnaud
    Canini, Marco
    Narodytska, Nina
    [J]. IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2021), 2021,
  • [9] Learning-Based Assume-Guarantee Regression Verification
    He, Fei
    Mao, Shu
    Wang, Bow-Yaw
    [J]. COMPUTER AIDED VERIFICATION, (CAV 2016), PT I, 2016, 9779 : 310 - 328
  • [10] KNOWLEDGE-BASED SYSTEMS VERIFICATION - A MACHINE LEARNING-BASED APPROACH
    LOUNIS, H
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 1995, 8 (03) : 381 - 389