Learning control lyapunov functions from counterexamples and demonstrations

被引:35
|
作者
Ravanbakhsh, Hadi [1 ]
Sankaranarayanan, Sriram [1 ]
机构
[1] Univ Colorado, Comp Sci, Boulder, CO 80309 USA
关键词
Lyapunov functions; Controller synthesis; Learning from demonstrations; Concept learning; CUTTING PLANE ALGORITHM; SYSTEMS; OPTIMIZATION; STABILIZATION; INEQUALITY; STABILITY;
D O I
10.1007/s10514-018-9791-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a technique for learning control Lyapunov-like functions, which are used in turn to synthesize controllers for nonlinear dynamical systems that can stabilize the system, or satisfy specifications such as remaining inside a safe set, or eventually reaching a target set while remaining inside a safe set. The learning framework uses a demonstrator that implements a black-box, untrusted strategy presumed to solve the problem of interest, a learner that poses finitely many queries to the demonstrator to infer a candidate function, and a verifier that checks whether the current candidate is a valid control Lyapunov-like function. The overall learning framework is iterative, eliminating a set of candidates on each iteration using the counterexamples discovered by the verifier and the demonstrations over these counterexamples. We prove its convergence using ellipsoidal approximation techniques from convex optimization. We also implement this scheme using nonlinear MPC controllers to serve as demonstrators for a set of state and trajectory stabilization problems for nonlinear dynamical systems. We show how the verifier can be constructed efficiently using convex relaxations of the verification problem for polynomial systems to semi-definite programming problem instances. Our approach is able to synthesize relatively simple polynomial control Lyapunov-like functions, and in that process replace the MPC using a guaranteed and computationally less expensive controller.
引用
收藏
页码:275 / 307
页数:33
相关论文
共 50 条
  • [21] Learning Camera Control in Dynamic Scenes from Limited Demonstrations
    Hanocka, R.
    Assa, J.
    Cohen-Or, D.
    Giryes, R.
    [J]. COMPUTER GRAPHICS FORUM, 2022, 41 (01) : 427 - 437
  • [22] Control Lyapunov functions: New ideas from an old source
    Freeman, RA
    Primbs, JA
    [J]. PROCEEDINGS OF THE 35TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 1996, : 3926 - 3931
  • [23] Learning Lyapunov Functions for Hybrid Systems
    Chen, Shaoru
    Fazlyab, Mahyar
    Morari, Manfred
    Pappas, George J.
    Preciado, Victor M.
    [J]. HSCC2021: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON HYBRID SYSTEMS: COMPUTATION AND CONTROL (PART OF CPS-IOT WEEK), 2021,
  • [24] Learning Lyapunov Functions for Hybrid Systems
    Chen, Shaoru
    Fazlyab, Mahyar
    Morari, Manfred
    Pappas, George J.
    Preciado, Victor M.
    [J]. 2021 55TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2021,
  • [25] Reinforcement Learning for Safety-Critical Control under Model Uncertainty, using Control Lyapunov Functions and Control Barrier Functions
    Choi, Jason
    Castaneda, Fernando
    Tomlin, Claire J.
    Sreenath, Koushil
    [J]. ROBOTICS: SCIENCE AND SYSTEMS XVI, 2020,
  • [26] Uniting Control Lyapunov and Control Barrier Functions
    Romdlony, Muhammad Zakiyullah
    Jayawardhana, Bayu
    [J]. 2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 2293 - 2298
  • [27] Control Lyapunov functions for switched control systems
    Sun, HF
    Zhao, J
    [J]. PROCEEDINGS OF THE 2001 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2001, : 1890 - 1891
  • [28] Nonlinear Control System with Reinforcement Learning and Neural Networks Based Lyapunov Functions
    Rego, Rosana Cibely Batista
    Araujo, Fabio Meneghetti Ugulino de
    [J]. IEEE LATIN AMERICA TRANSACTIONS, 2021, 19 (08) : 1253 - 1260
  • [29] Learning potential functions from human demonstrations with encapsulated dynamic and compliant behaviors
    Seyed Mohammad Khansari-Zadeh
    Oussama Khatib
    [J]. Autonomous Robots, 2017, 41 : 45 - 69
  • [30] Learning potential functions from human demonstrations with encapsulated dynamic and compliant behaviors
    Khansari-Zadeh, Seyed Mohammad
    Khatib, Oussama
    [J]. AUTONOMOUS ROBOTS, 2017, 41 (01) : 45 - 69