Polytopic Trees for Verification of Learning-Based Controllers

被引：2

作者：

Sadraddini, Sadra ^{[1
]}

Shen, Shen ^{[1
]}

Bastani, Osbert ^{[2
]}

机构：

[1] MIT, Cambridge, MA 02139 USA

[2] Univ Penn, Philadelphia, PA 19104 USA

来源：

NUMERICAL SOFTWARE VERIFICATION | 2019年 / 11652卷

关键词：

D O I：

10.1007/978-3-030-28423-7_8

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Reinforcement learning is increasingly used to synthesize controllers for a broad range of applications. However, formal guarantees on the behavior of learning-based controllers are elusive due to the black-box nature of machine learning models such as neural networks. In this paper, we propose an algorithm for verifying learning-based controllers-in particular, deep neural networks with ReLU activations, and decision trees with linear decisions and leaf values-for deterministic, piecewise affine (PWA) dynamical systems. In this setting, our algorithm computes the safe (resp., unsafe) region of the state space-i.e., the region of the state space on which the learned controller is guaranteed to satisfy (resp., fail to satisfy) a given reach-avoid specification. Knowing the safe and unsafe regions is substantially more informative than the boolean characterization of safety (i.e., safe or unsafe) provided by standard verification algorithms-for example, this knowledge can be used to compose controllers that are safe on different portions of the state space. At a high level, our algorithm uses convex programming to iteratively compute new regions (in the form of polytopes) that are guaranteed to be entirely safe or entirely unsafe. Then, it connects these polytopic regions together in a tree-like fashion. We conclude with an illustrative example on controlling a hybrid model of a contact-based robotics problem.

引用

页码：110 / 127

页数：18

共 50 条

[1] Learning-Based Synthesis of Safety Controllers
Neider, Daniel
Markgraf, Oliver
[J]. 2019 FORMAL METHODS IN COMPUTER AIDED DESIGN (FMCAD), 2019, : 120 - 128
[2] Sparse Kernel Learning-based Nonlinear Predictive Controllers
Liu, Yi
Wang, Wei
[J]. APPLIED MATHEMATICS & INFORMATION SCIENCES, 2011, 5 (02): : 195 - 201
[3] A Learning-based Framework for Automatic Parameterized Verification
Li, Yongjian
Cao, Jialun
Pang, Jun
[J]. 2019 IEEE 37TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2019), 2019, : 450 - 459
[4] PAC Learning-Based Verification and Model Synthesis
Chen, Yu-Fang
Hsieh, Chiao
Lengal, Ondrej
Lii, Tsung-Ju
Tsai, Ming-Hsien
Wang, Bow-Yaw
Wang, Farn
[J]. 2016 IEEE/ACM 38TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2016, : 714 - 724
[5] Polytopic Input Constraints in Learning-Based Optimal Control Using Neural Networks
Markolf, Lukas
Stursberg, Olaf
[J]. 2021 EUROPEAN CONTROL CONFERENCE (ECC), 2021, : 1018 - 1023
[6] Towards nominal stability certification of deep learning-based controllers
Hoang Hai Nguyen
Matschek, Janine
Zieger, Tim
Savchenko, Anton
Noroozi, Navid
Findeisen, Rolf
[J]. 2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 3886 - 3891
[7] Learning-Based Compositional Verification for Synchronous Probabilistic Systems
Feng, Lu
Han, Tingting
Kwiatkowska, Marta
Parker, David
[J]. AUTOMATED TECHNOLOGY FOR VERIFICATION AND ANALYSIS, 2011, 6996 : 511 - 521
[8] Analyzing Learning-Based Networked Systems with Formal Verification
Dethise, Arnaud
Canini, Marco
Narodytska, Nina
[J]. IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2021), 2021,
[9] Learning-Based Assume-Guarantee Regression Verification
He, Fei
Mao, Shu
Wang, Bow-Yaw
[J]. COMPUTER AIDED VERIFICATION, (CAV 2016), PT I, 2016, 9779 : 310 - 328
[10] KNOWLEDGE-BASED SYSTEMS VERIFICATION - A MACHINE LEARNING-BASED APPROACH
LOUNIS, H
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 1995, 8 (03) : 381 - 389

← 1 2 3 4 5 →