Polytopic Trees for Verification of Learning-Based Controllers

被引:2
|
作者
Sadraddini, Sadra [1 ]
Shen, Shen [1 ]
Bastani, Osbert [2 ]
机构
[1] MIT, Cambridge, MA 02139 USA
[2] Univ Penn, Philadelphia, PA 19104 USA
来源
关键词
D O I
10.1007/978-3-030-28423-7_8
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Reinforcement learning is increasingly used to synthesize controllers for a broad range of applications. However, formal guarantees on the behavior of learning-based controllers are elusive due to the black-box nature of machine learning models such as neural networks. In this paper, we propose an algorithm for verifying learning-based controllers-in particular, deep neural networks with ReLU activations, and decision trees with linear decisions and leaf values-for deterministic, piecewise affine (PWA) dynamical systems. In this setting, our algorithm computes the safe (resp., unsafe) region of the state space-i.e., the region of the state space on which the learned controller is guaranteed to satisfy (resp., fail to satisfy) a given reach-avoid specification. Knowing the safe and unsafe regions is substantially more informative than the boolean characterization of safety (i.e., safe or unsafe) provided by standard verification algorithms-for example, this knowledge can be used to compose controllers that are safe on different portions of the state space. At a high level, our algorithm uses convex programming to iteratively compute new regions (in the form of polytopes) that are guaranteed to be entirely safe or entirely unsafe. Then, it connects these polytopic regions together in a tree-like fashion. We conclude with an illustrative example on controlling a hybrid model of a contact-based robotics problem.
引用
收藏
页码:110 / 127
页数:18
相关论文
共 50 条
  • [21] Learning-based assume-guarantee verification (Tool paper)
    Giannakopoulou, D
    Pasareanu, CS
    [J]. MODEL CHECKING SOFTWARE, PROCEEDINGS, 2005, 3639 : 282 - 287
  • [22] Compositional Learning and Verification of Neural Network Controllers
    Ivanov, Radoslav
    Jothimurugan, Kishor
    Hsu, Steve
    Vaidya, Shaan
    Alur, Rajeev
    Bastani, Osbert
    [J]. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2021, 20 (05)
  • [23] Falsification of Learning-Based Controllers through Multi-Fidelity Bayesian Optimization
    Shahrooei, Zahra
    Kochenderfer, Mykel J.
    Baheri, Ali
    [J]. 2023 EUROPEAN CONTROL CONFERENCE, ECC, 2023,
  • [24] Rapid Transfer Of Controllers Between UAVs Using Learning-Based Adaptive Control
    Chowdhary, Girish
    Wu, Tongbin
    Cutler, Mark
    How, Jonathan P.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2013, : 5409 - 5416
  • [25] Deep Learning-Based Segmentation of Intertwined Fruit Trees for Agricultural Tasks
    La, Young-Jae
    Seo, Dasom
    Kang, Junhyeok
    Kim, Minwoo
    Yoo, Tae-Woong
    Oh, Il-Seok
    [J]. AGRICULTURE-BASEL, 2023, 13 (11):
  • [26] Robust learning-based prediction for timber-volume of living trees
    Zhang, Dong
    Zhang, Liyan
    Ye, Qiaolin
    Ruan, Honghua
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2017, 136 : 97 - 110
  • [27] Deep learning-based protoacoustic signal denoising for proton range verification
    Wang, Jing
    Sohn, James J.
    Lei, Yang
    Nie, Wei
    Zhou, Jun
    Avery, Stephen
    Liu, Tian
    Yang, Xiaofeng
    [J]. BIOMEDICAL PHYSICS & ENGINEERING EXPRESS, 2023, 9 (04)
  • [28] Machine learning-based offline signature verification systems: A systematic review
    Hameed, M. Muzaffar
    Ahmad, Rodina
    Kiah, Miss Laiha Mat
    Murtaza, Ghulam
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 93
  • [29] L-CMP: An Automatic Learning-Based Parameterized Verification Tool
    Cao, Jialun
    Li, Yongjian
    Pang, Jun
    [J]. PROCEEDINGS OF THE 2018 33RD IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMTED SOFTWARE ENGINEERING (ASE' 18), 2018, : 892 - 895
  • [30] Deep learning-based photoplethysmography biometric authentication for continuous user verification
    Wan, Li
    Liu, Kechen
    Mengash, Hanan Abdullah
    Alruwais, Nuha
    Al Duhayyim, Mesfer
    Venkatachalam, K.
    [J]. APPLIED SOFT COMPUTING, 2024, 156