Recurrent Neural Network Controllers Synthesis with Stability Guarantees for Partially Observed Systems

被引:0
|
作者
Gu, Fangda [1 ]
Yin, He [1 ]
El Ghaoui, Laurent [1 ]
Arcak, Murat [1 ]
Seiler, Peter [2 ]
Jin, Ming [3 ]
机构
[1] Univ Calif Berkeley, 2594 Hearst Ave, Berkeley, CA 94720 USA
[2] Univ Michigan, 500 S State St, Ann Arbor, MI 48109 USA
[3] Virginia Tech, 1185 Perry St 453 Whittemore 0111, Blacksburg, VA 24061 USA
关键词
REINFORCEMENT; OPTIMIZATION; ALGORITHMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural network controllers have become popular in control tasks thanks to their flexibility and expressivity. Stability is a crucial property for safety-critical dynamical systems, while stabilization of partially observed systems, in many cases, requires controllers to retain and process long-term memories of the past. We consider the important class of recurrent neural networks (RNN) as dynamic controllers for nonlinear uncertain partially-observed systems, and derive convex stability conditions based on integral quadratic constraints, S-lemma and sequential convexification. To ensure stability during the learning and control process, we propose a projected policy gradient method that iteratively enforces the stability conditions in the reparametrized space taking advantage of mild additional information on system dynamics. Numerical experiments show that our method learns stabilizing controllers while using fewer samples and achieving higher final performance compared with policy gradient.
引用
收藏
页码:5385 / 5394
页数:10
相关论文
共 50 条
  • [21] Learning Local Volt/Var Controllers Towards Efficient Network Operation with Stability Guarantees
    Cavraro, Guido
    Yuan, Zhenyi
    Singh, Manish K.
    Cortes, Jorge
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 5056 - 5061
  • [22] Neural Lyapunov Control of Unknown Nonlinear Systems with Stability Guarantees
    Zhou, Ruikun
    Quartz, Thanin
    De Sterck, Hans
    Liu, Jun
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [23] Transient stability enhancement of power systems by Lyapunov-based recurrent neural networks UPFC controllers
    Chu, Chia-Chi
    Tsai, Hung-Chi
    Chang, Wei-Neng
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2008, E91A (09) : 2497 - 2506
  • [24] Regional stability conditions for recurrent neural network-based control systems
    La Bella, Alessio
    Farina, Marcello
    D'Amico, William
    Zaccarian, Luca
    AUTOMATICA, 2025, 174
  • [25] Data-based control design for nonlinear systems with recurrent neural network-based controllers
    D'Amico, William
    La Bella, Alessio
    Dercole, Fabio
    Farina, Marcello
    IFAC PAPERSONLINE, 2023, 56 (02): : 6235 - 6240
  • [26] Reduced-Order Neural Network Synthesis With Robustness Guarantees
    Drummond, Ross
    Turner, Matthew C.
    Duncan, Stephen R.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 1182 - 1191
  • [27] COMPACT MODEL SYNTHESIS FOR PARTIALLY OBSERVED OPERATIONAL SYSTEMS
    Dion, Jean-Luc
    Abid, Fatma
    Chevallier, Gael
    Festjens, Hugo
    Peyret, Nicolas
    Renaud, Franck
    Seifeddine, Moustafa
    Stephan, Cyrille
    PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2013, VOL 7B, 2014,
  • [28] Multilayer neural network controllers for multivariable dynamic systems
    Bahnasawi, AA
    Ibrahim, AG
    Hassan, MF
    Eid, SZ
    CONTROL AND COMPUTERS, 1997, 25 (01): : 12 - 20
  • [29] Verifying the Safety of Autonomous Systems with Neural Network Controllers
    Ivanov, Radoslav
    Carpenter, Taylor J.
    Weimer, James
    Alur, Rajeev
    Pappas, George J.
    Lee, Insup
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2021, 20 (01)
  • [30] Risk verification of stochastic systems with neural network controllers
    Cleaveland, Matthew
    Lindemann, Lars
    Ivanov, Radoslav
    Pappas, George J.
    ARTIFICIAL INTELLIGENCE, 2022, 313