Learning control lyapunov functions from counterexamples and demonstrations

被引:35
|
作者
Ravanbakhsh, Hadi [1 ]
Sankaranarayanan, Sriram [1 ]
机构
[1] Univ Colorado, Comp Sci, Boulder, CO 80309 USA
关键词
Lyapunov functions; Controller synthesis; Learning from demonstrations; Concept learning; CUTTING PLANE ALGORITHM; SYSTEMS; OPTIMIZATION; STABILIZATION; INEQUALITY; STABILITY;
D O I
10.1007/s10514-018-9791-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a technique for learning control Lyapunov-like functions, which are used in turn to synthesize controllers for nonlinear dynamical systems that can stabilize the system, or satisfy specifications such as remaining inside a safe set, or eventually reaching a target set while remaining inside a safe set. The learning framework uses a demonstrator that implements a black-box, untrusted strategy presumed to solve the problem of interest, a learner that poses finitely many queries to the demonstrator to infer a candidate function, and a verifier that checks whether the current candidate is a valid control Lyapunov-like function. The overall learning framework is iterative, eliminating a set of candidates on each iteration using the counterexamples discovered by the verifier and the demonstrations over these counterexamples. We prove its convergence using ellipsoidal approximation techniques from convex optimization. We also implement this scheme using nonlinear MPC controllers to serve as demonstrators for a set of state and trajectory stabilization problems for nonlinear dynamical systems. We show how the verifier can be constructed efficiently using convex relaxations of the verification problem for polynomial systems to semi-definite programming problem instances. Our approach is able to synthesize relatively simple polynomial control Lyapunov-like functions, and in that process replace the MPC using a guaranteed and computationally less expensive controller.
引用
收藏
页码:275 / 307
页数:33
相关论文
共 50 条
  • [1] Learning control lyapunov functions from counterexamples and demonstrations
    Hadi Ravanbakhsh
    Sriram Sankaranarayanan
    [J]. Autonomous Robots, 2019, 43 : 275 - 307
  • [2] Learning Lyapunov (Potential) Functions from Counterexamples and Demonstrations
    Ravanbakhsh, Hadi
    Sankaranarayanan, Sriram
    [J]. ROBOTICS: SCIENCE AND SYSTEMS XIII, 2017,
  • [3] Learning fixed-complexity polyhedral Lyapunov functions from counterexamples
    Berger, Guillaume O.
    Sankaranarayanan, Sriram
    [J]. 2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 3250 - 3255
  • [4] Learning Control Barrier Functions from Expert Demonstrations
    Robey, Alexander
    Hu, Haimin
    Lindemann, Lars
    Zhang, Hanwen
    Dimarogonas, Dimos, V
    Tu, Stephen
    Matni, Nikolai
    [J]. 2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 3717 - 3724
  • [5] Safe Learning for Control using Control Lyapunov Functions and Control Barrier Functions: A Review
    Anand, Akhil
    Seel, Katrine
    Gjaerum, Vilde
    Hakansson, Anne
    Robinson, Haakon
    Saad, Aya
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021), 2021, 192 : 3987 - 3997
  • [6] Damping control by fusion of reinforcement learning and control Lyapunov functions
    Glavic, Mevludin
    Ernst, Damien
    Wehenkel, Louis
    [J]. 2006 38TH ANNUAL NORTH AMERICAN POWER SYMPOSIUM, NAPS-2006 PROCEEDINGS, 2006, : 361 - +
  • [7] Learning Traffic Signal Control from Demonstrations
    Xiong, Yuanhao
    Zheng, Guanjie
    Xu, Kai
    Li, Zhenhui
    [J]. PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2289 - 2292
  • [8] Episodic Learning with Control Lyapunov Functions for Uncertain Robotic Systems
    Taylor, Andrew J.
    Dorobantu, Victor D.
    Le, Hoang M.
    Yue, Yisong
    Ames, Aaron D.
    [J]. 2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 6878 - 6884
  • [9] Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions
    Du, Desong
    Han, Shaohang
    Qi, Naiming
    Ammar, Haitham Bou
    Wang, Jun
    Pan, Wei
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9442 - 9448
  • [10] Redundant robot control with learning from expert demonstrations
    Ramirez, Jorge
    Yu, Wen
    [J]. 2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 715 - 720