Actor-Critic Physics-Informed Neural Lyapunov Control

被引:0
|
作者
Wang, Jiarui [1 ]
Fazlyab, Mahyar [2 ]
机构
[1] Johns Hopkins Univ, Comp Sci Dept, Baltimore, MD 21218 USA
[2] Johns Hopkins Univ, Elect & Comp Engn Dept, Baltimore, MD 21218 USA
来源
关键词
Lyapunov methods; stability of nonlinear systems; neural networks;
D O I
10.1109/LCSYS.2024.3416235
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Designing control policies for stabilization tasks with provable guarantees is a long-standing problem in nonlinear control. A crucial performance metric is the size of the resulting region of attraction, which essentially serves as a robustness "margin" of the closed-loop system against uncertainties. In this letter, we propose a new method to train a stabilizing neural network controller along with its corresponding Lyapunov certificate, aiming to maximize the resulting region of attraction while respecting the actuation constraints. Crucial to our approach is the use of Zubov's Partial Differential Equation (PDE), which precisely characterizes the true region of attraction of a given control policy. Our framework follows an actor-critic pattern where we alternate between improving the control policy (actor) and learning a Zubov function (critic). Finally, we compute the largest certifiable region of attraction by invoking an SMT solver after the training procedure. Our numerical experiments on several design problems show consistent and significant improvements in the size of the resulting region of attraction.
引用
收藏
页码:1751 / 1756
页数:6
相关论文
共 50 条
  • [1] Actor-Critic Model Predictive Control
    Romero, Angel
    Song, Yunlong
    Scaramuzza, Davide
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024), 2024, : 14777 - 14784
  • [2] Physics-Informed Neural Networks for Quantum Control
    Norambuena, Ariel
    Mattheakis, Marios
    Gonzalez, Francisco J.
    Coto, Raul
    PHYSICAL REVIEW LETTERS, 2024, 132 (01)
  • [3] Actor-critic algorithms
    Konda, VR
    Tsitsiklis, JN
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 1008 - 1014
  • [4] On actor-critic algorithms
    Konda, VR
    Tsitsiklis, JN
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2003, 42 (04) : 1143 - 1166
  • [5] Natural Actor-Critic
    Peters, Jan
    Schaal, Stefan
    NEUROCOMPUTING, 2008, 71 (7-9) : 1180 - 1190
  • [6] Natural Actor-Critic
    Peters, J
    Vijayakumar, S
    Schaal, S
    MACHINE LEARNING: ECML 2005, PROCEEDINGS, 2005, 3720 : 280 - 291
  • [7] An actor-critic learning framework based on Lyapunov stability for automatic assembly
    Li, Xinwang
    Xiao, Juliang
    Cheng, Yu
    Liu, Haitao
    APPLIED INTELLIGENCE, 2023, 53 (04) : 4801 - 4812
  • [8] Physics-informed neural nets for control of dynamical systems
    Antonelo, Eric Aislan
    Camponogara, Eduardo
    Seman, Laio Oriel
    Jordanou, Jean Panaioti
    Souza, Eduardo Rehbein de
    Huebner, Jomi Fred
    NEUROCOMPUTING, 2024, 579
  • [9] An actor-critic learning framework based on Lyapunov stability for automatic assembly
    Xinwang Li
    Juliang Xiao
    Yu Cheng
    Haitao Liu
    Applied Intelligence, 2023, 53 : 4801 - 4812
  • [10] Physics-informed neural network Lyapunov functions: PDE characterization, learning, and verification☆
    Liu, Jun
    Meng, Yiming
    Fitzsimmons, Maxwell
    Zhou, Ruikun
    AUTOMATICA, 2025, 175