FISAR: Forward Invariant Safe Reinforcement Learning with a Deep Neural Network-Based Optimizer

被引:3
|
作者
Sun, Chuangchuang [1 ]
Kim, Dong-Ki [1 ]
How, Jonathan P. [1 ]
机构
[1] MIT, Lab Informat & Decis Syst LIDS, 77 Massachusetts Ave, Cambridge, MA 02139 USA
来源
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021) | 2021年
关键词
ALGORITHMS;
D O I
10.1109/ICRA48506.2021.9561147
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates reinforcement learning with constraints, which are indispensable in safety-critical environments. To drive the constraint violation to decrease monotonically, we take the constraints as Lyapunov functions and impose new linear constraints on the policy parameters' updating dynamics. As a result, the original safety set can be forward-invariant. However, because the new guaranteed-feasible constraints are imposed on the updating dynamics instead of the original policy parameters, classic optimization algorithms are no longer applicable. To address this, we propose to learn a generic deep neural network (DNN)-based optimizer to optimize the objective while satisfying the linear constraints. The constraint-satisfaction is achieved via projection onto a polytope formulated by multiple linear inequality constraints, which can be solved analytically with our newly designed metric. To the best of our knowledge, this is the first DNN-based optimizer for constrained optimization with the forward invariance guarantee. We show that our optimizer trains a policy to decrease the constraint violation and maximize the cumulative reward monotonically. Results on numerical constrained optimization and obstacle-avoidance navigation validate the theoretical findings.
引用
收藏
页码:10617 / 10624
页数:8
相关论文
共 50 条
  • [41] Neural Network-Based Limiter with Transfer Learning
    Rémi Abgrall
    Maria Han Veiga
    Communications on Applied Mathematics and Computation, 2023, 5 (2) : 532 - 572
  • [42] Neural Network-Based Limiter with Transfer Learning
    Abgrall, Remi
    Han Veiga, Maria
    COMMUNICATIONS ON APPLIED MATHEMATICS AND COMPUTATION, 2023, 5 (02) : 532 - 572
  • [43] Seismic waveform inversion using a neural network-based forward
    Fu, Hongsun
    Zhang, Yan
    Ma, Mingyue
    SECOND INTERNATIONAL CONFERENCE ON PHYSICS, MATHEMATICS AND STATISTICS, 2019, 1324
  • [44] Graph Convolutional Network-Based Topology Embedded Deep Reinforcement Learning for Voltage Stability Control
    Hossain, Ramij R.
    Huang, Qiuhua
    Huang, Renke
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2021, 36 (05) : 4848 - 4851
  • [45] Deep Convolutional Neural Network Assisted Reinforcement Learning Based Mobile Network Power Saving
    Wu, Shangbin
    Wang, Yue
    Bai, Lu
    IEEE ACCESS, 2020, 8 (08): : 93671 - 93681
  • [46] A Neural Network based Deep Reinforcement Learning Controller for Voltage Regulation of Active Distribution Network
    Jain, Jatin
    Mohamed, Ahmed
    Rahman, Tanvir
    Ali, Mohamed
    2024 IEEE 5TH ANNUAL WORLD AI IOT CONGRESS, AIIOT 2024, 2024, : 0280 - 0285
  • [47] Performance enhancement of the artificial neural network-based reinforcement learning for wind turbine yaw control
    Saenz-Aguirre, Aitor
    Zulueta, Ekaitz
    Fernandez-Gamiz, Unai
    Ulazia, Alain
    Teso-Fz-Betono, Daniel
    WIND ENERGY, 2020, 23 (03) : 676 - 690
  • [48] Distributed Neural Network-based Policy Gradient Reinforcement Learning for Multi-Robot Formations
    Shang, Wen
    Sun, Dong
    2008 INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, VOLS 1-4, 2008, : 113 - +
  • [49] Political Optimizer with Probabilistic Neural Network-Based Arabic Comparative Opinion Mining
    Alotaibi, Najm
    Al-onazi, Badriyya B.
    Nour, Mohamed K.
    Mohamed, Abdullah
    Motwakel, Abdelwahed
    Mohammed, Gouse Pasha
    Yaseen, Ishfaq
    Rizwanullah, Mohammed
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (03): : 3121 - 3137
  • [50] Safe batch constrained deep reinforcement learning with generative adversarial network
    Dong, Wenbo
    Liu, Shaofan
    Sun, Shiliang
    INFORMATION SCIENCES, 2023, 634 : 259 - 270