Learning control lyapunov functions from counterexamples and demonstrations

被引：35

作者：

Ravanbakhsh, Hadi ^{[1
]}

Sankaranarayanan, Sriram ^{[1
]}

机构：

[1] Univ Colorado, Comp Sci, Boulder, CO 80309 USA

来源：

AUTONOMOUS ROBOTS | 2019年 / 43卷 / 02期

关键词：

Lyapunov functions; Controller synthesis; Learning from demonstrations; Concept learning; CUTTING PLANE ALGORITHM; SYSTEMS; OPTIMIZATION; STABILIZATION; INEQUALITY; STABILITY;

D O I：

10.1007/s10514-018-9791-9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a technique for learning control Lyapunov-like functions, which are used in turn to synthesize controllers for nonlinear dynamical systems that can stabilize the system, or satisfy specifications such as remaining inside a safe set, or eventually reaching a target set while remaining inside a safe set. The learning framework uses a demonstrator that implements a black-box, untrusted strategy presumed to solve the problem of interest, a learner that poses finitely many queries to the demonstrator to infer a candidate function, and a verifier that checks whether the current candidate is a valid control Lyapunov-like function. The overall learning framework is iterative, eliminating a set of candidates on each iteration using the counterexamples discovered by the verifier and the demonstrations over these counterexamples. We prove its convergence using ellipsoidal approximation techniques from convex optimization. We also implement this scheme using nonlinear MPC controllers to serve as demonstrators for a set of state and trajectory stabilization problems for nonlinear dynamical systems. We show how the verifier can be constructed efficiently using convex relaxations of the verification problem for polynomial systems to semi-definite programming problem instances. Our approach is able to synthesize relatively simple polynomial control Lyapunov-like functions, and in that process replace the MPC using a guaranteed and computationally less expensive controller.

引用

页码：275 / 307

页数：33

共 50 条

[1] Learning control lyapunov functions from counterexamples and demonstrations
Hadi Ravanbakhsh
Sriram Sankaranarayanan
[J]. Autonomous Robots, 2019, 43 : 275 - 307
[2] Learning Lyapunov (Potential) Functions from Counterexamples and Demonstrations
Ravanbakhsh, Hadi
Sankaranarayanan, Sriram
[J]. ROBOTICS: SCIENCE AND SYSTEMS XIII, 2017,
[3] Learning fixed-complexity polyhedral Lyapunov functions from counterexamples
Berger, Guillaume O.
Sankaranarayanan, Sriram
[J]. 2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 3250 - 3255
[4] Learning Control Barrier Functions from Expert Demonstrations
Robey, Alexander
Hu, Haimin
Lindemann, Lars
Zhang, Hanwen
Dimarogonas, Dimos, V
Tu, Stephen
Matni, Nikolai
[J]. 2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 3717 - 3724
[5] Safe Learning for Control using Control Lyapunov Functions and Control Barrier Functions: A Review
Anand, Akhil
Seel, Katrine
Gjaerum, Vilde
Hakansson, Anne
Robinson, Haakon
Saad, Aya
[J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021), 2021, 192 : 3987 - 3997
[6] Damping control by fusion of reinforcement learning and control Lyapunov functions
Glavic, Mevludin
Ernst, Damien
Wehenkel, Louis
[J]. 2006 38TH ANNUAL NORTH AMERICAN POWER SYMPOSIUM, NAPS-2006 PROCEEDINGS, 2006, : 361 - +
[7] Learning Traffic Signal Control from Demonstrations
Xiong, Yuanhao
Zheng, Guanjie
Xu, Kai
Li, Zhenhui
[J]. PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2289 - 2292
[8] Episodic Learning with Control Lyapunov Functions for Uncertain Robotic Systems
Taylor, Andrew J.
Dorobantu, Victor D.
Le, Hoang M.
Yue, Yisong
Ames, Aaron D.
[J]. 2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 6878 - 6884
[9] Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions
Du, Desong
Han, Shaohang
Qi, Naiming
Ammar, Haitham Bou
Wang, Jun
Pan, Wei
[J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9442 - 9448
[10] Redundant robot control with learning from expert demonstrations
Ramirez, Jorge
Yu, Wen
[J]. 2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 715 - 720

← 1 2 3 4 5 →