Control-Theoretic Analysis of Smoothness for Stability-Certified Reinforcement Learning

被引:0
|
作者
Jin, Ming [1 ,2 ]
Lavaei, Javad [1 ,2 ]
机构
[1] Univ Calif Berkeley, Dept Ind Engn & Operat Res, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Tsinghua Berkeley Shenzhen Inst, Berkeley, CA 94720 USA
关键词
SYSTEMS; SAFE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It is critical to obtain stability certificate before deploying reinforcement learning in real-world mission-critical systems. This study justifies the intuition that smoothness (i.e., small changes in inputs lead to small changes in outputs) is an important property for stability-certified reinforcement learning from a control-theoretic perspective. The smoothness margin can be obtained by solving a feasibility problem based on semi-definite programming for both linear and nonlinear dynamical systems, and it does not need to access the exact parameters of the learned controllers. Numerical evaluation on nonlinear and decentralized frequency control for large-scale power grids demonstrates that the smoothness margin can certify stability during both exploration and deployment for (deep) neural-network policies, which substantially surpass nominal controllers in performance. The study opens up new opportunities for robust Lipschitz continuous policy learning.
引用
收藏
页码:6840 / 6847
页数:8
相关论文
共 50 条
  • [1] Stability-Certified Reinforcement Learning: A Control-Theoretic Perspective
    Jin, Ming
    Lavaei, Javad
    [J]. IEEE ACCESS, 2020, 8 : 229086 - 229100
  • [2] Stability-Certified Reinforcement Learning: A Control-Theoretic Perspective
    [J]. Jin, Ming (jinming@vt.edu), 1600, Institute of Electrical and Electronics Engineers Inc. (08):
  • [3] Stability-certified reinforcement learning control via spectral normalization
    Takase, Ryoichi
    Yoshikawa, Nobuyuki
    Mariyama, Toshisada
    Tsuchiya, Takeshi
    [J]. MACHINE LEARNING WITH APPLICATIONS, 2022, 10
  • [4] Estimating Bounded Uncertain Model for Stability-Certified Reinforcement Learning
    Takase, Ryoichi
    Yoshikawa, Nobuyuki
    Tsuchiya, Takeshi
    [J]. 2022 61ST ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS (SICE), 2022, : 481 - 487
  • [5] Learning-Based, Safety and Stability-Certified Microgrid Control
    Wang, Lizhi
    Zhang, Songyuan
    Zhou, Yifan
    Fan, Chuchu
    Zhang, Peng
    Shamash, Yacov A.
    [J]. 2023 IEEE POWER & ENERGY SOCIETY GENERAL MEETING, PESGM, 2023,
  • [6] Stability Analysis of a Control-Theoretic Work System Model
    Berger, Christoph
    Fetzer, Alexander
    Stykel, Tatjana
    Braunreuther, Stefan
    Reinhart, Gunther
    [J]. 11TH CIRP CONFERENCE ON INDUSTRIAL PRODUCT-SERVICE SYSTEMS, 2019, 83 : 642 - 648
  • [7] AN ATTEMPT AT A CONTROL-THEORETIC SOLUTION TO STABILITY PROBLEMS IN MANAGEMENT
    ADEAGBOSHEIKH, AG
    [J]. KYBERNETES, 1987, 16 (03) : 161 - 167
  • [8] Structural analysis in biology: A control-theoretic approach
    Blanchini, Franco
    Giordano, Giulia
    [J]. AUTOMATICA, 2021, 126
  • [9] Modeling, calibration, and control-theoretic analysis of the GMAW process
    Moore, KL
    Yender, R
    Tyler, J
    Naidu, DS
    [J]. PROCEEDINGS OF THE 1998 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 1998, : 1747 - 1751
  • [10] FURTHER RESULTS ON THE CONTROL-THEORETIC ANALYSIS OF THE LMS ALGORITHM
    KWONG, CP
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1993, 41 (02) : 943 - 946