Actor-Critic Physics-Informed Neural Lyapunov Control

被引:0
|
作者
Wang, Jiarui [1 ]
Fazlyab, Mahyar [2 ]
机构
[1] Johns Hopkins Univ, Comp Sci Dept, Baltimore, MD 21218 USA
[2] Johns Hopkins Univ, Elect & Comp Engn Dept, Baltimore, MD 21218 USA
来源
关键词
Lyapunov methods; stability of nonlinear systems; neural networks;
D O I
10.1109/LCSYS.2024.3416235
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Designing control policies for stabilization tasks with provable guarantees is a long-standing problem in nonlinear control. A crucial performance metric is the size of the resulting region of attraction, which essentially serves as a robustness "margin" of the closed-loop system against uncertainties. In this letter, we propose a new method to train a stabilizing neural network controller along with its corresponding Lyapunov certificate, aiming to maximize the resulting region of attraction while respecting the actuation constraints. Crucial to our approach is the use of Zubov's Partial Differential Equation (PDE), which precisely characterizes the true region of attraction of a given control policy. Our framework follows an actor-critic pattern where we alternate between improving the control policy (actor) and learning a Zubov function (critic). Finally, we compute the largest certifiable region of attraction by invoking an SMT solver after the training procedure. Our numerical experiments on several design problems show consistent and significant improvements in the size of the resulting region of attraction.
引用
收藏
页码:1751 / 1756
页数:6
相关论文
共 50 条
  • [11] Structure-Preserving Physics-Informed Neural Networks With Energy or Lyapunov Structure
    Chu, Haoyu
    Miyatake, Yuto
    Cui, Wenjun
    Wei, Shikui
    Furihata, Daisuke
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 3872 - 3880
  • [12] Automatic Generation Control Based on Multiple Neural Networks With Actor-Critic Strategy
    Xi, Lei
    Wu, Junnan
    Xu, Yanchun
    Sun, Hongbin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (06) : 2483 - 2493
  • [13] An Actor-Critic Algorithm With Second-Order Actor and Critic
    Wang, Jing
    Paschalidis, Ioannis Ch.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (06) : 2689 - 2703
  • [14] TAAC: Temporally Abstract Actor-Critic for Continuous Control
    Yu, Haonan
    Xu, Wei
    Zhang, Haichao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [15] Efficient Model Learning Methods for Actor-Critic Control
    Grondman, Ivo
    Vaandrager, Maarten
    Busoniu, Lucian
    Babuska, Robert
    Schuitema, Erik
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (03): : 591 - 602
  • [16] Actor-Critic Reinforcement Learning for Tracking Control in Robotics
    Pane, Yudha P.
    Nageshrao, Subramanya P.
    Babuska, Robert
    2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 5819 - 5826
  • [17] A supervised Actor-Critic approach for adaptive cruise control
    Zhao, Dongbin
    Wang, Bin
    Liu, Derong
    SOFT COMPUTING, 2013, 17 (11) : 2089 - 2099
  • [18] Deep Lyapunov-Based Physics-Informed Neural Networks (DeLb-PINN) for Adaptive Control Design
    Hart, Rebecca G.
    Patil, Omkar Sudhir
    Griffis, Emily J.
    Dixon, Warren E.
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 1511 - 1516
  • [19] Discrete-Time Lyapunov based Kinematic Control of Robot Manipulator using Actor-Critic Framework
    Kamboj, Ankur
    Prakash, Ravi
    Mohanta, Jayant Kumar
    Behera, Laxmidhar
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [20] Enforcing Dirichlet boundary conditions in physics-informed neural networks and variational physics-informed neural networks
    Berrone, S.
    Canuto, C.
    Pintore, M.
    Sukumar, N.
    HELIYON, 2023, 9 (08)