FISAR: Forward Invariant Safe Reinforcement Learning with a Deep Neural Network-Based Optimizer

被引:3
|
作者
Sun, Chuangchuang [1 ]
Kim, Dong-Ki [1 ]
How, Jonathan P. [1 ]
机构
[1] MIT, Lab Informat & Decis Syst LIDS, 77 Massachusetts Ave, Cambridge, MA 02139 USA
来源
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021) | 2021年
关键词
ALGORITHMS;
D O I
10.1109/ICRA48506.2021.9561147
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates reinforcement learning with constraints, which are indispensable in safety-critical environments. To drive the constraint violation to decrease monotonically, we take the constraints as Lyapunov functions and impose new linear constraints on the policy parameters' updating dynamics. As a result, the original safety set can be forward-invariant. However, because the new guaranteed-feasible constraints are imposed on the updating dynamics instead of the original policy parameters, classic optimization algorithms are no longer applicable. To address this, we propose to learn a generic deep neural network (DNN)-based optimizer to optimize the objective while satisfying the linear constraints. The constraint-satisfaction is achieved via projection onto a polytope formulated by multiple linear inequality constraints, which can be solved analytically with our newly designed metric. To the best of our knowledge, this is the first DNN-based optimizer for constrained optimization with the forward invariance guarantee. We show that our optimizer trains a policy to decrease the constraint violation and maximize the cumulative reward monotonically. Results on numerical constrained optimization and obstacle-avoidance navigation validate the theoretical findings.
引用
收藏
页码:10617 / 10624
页数:8
相关论文
共 50 条
  • [31] A Novel Fingerprint Recovery Scheme using Deep Neural Network-based Learning
    Lee, Samuel
    Jang, Seok-Woo
    Kim, Dongho
    Hahn, Hernsoo
    Kim, Gye-Young
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (26-27) : 34121 - 34135
  • [32] Integration of Neural Network-Based Symbolic Regression in Deep Learning for Scientific Discovery
    Kim, Samuel
    Lu, Peter Y.
    Mukherjee, Srijon
    Gilbert, Michael
    Jing, Li
    Ceperic, Vladimir
    Soljacic, Marin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (09) : 4166 - 4177
  • [33] Accelerating the Deep Reinforcement Learning with Neural Network Compression
    Zhang, Hongjie
    He, Zhuocheng
    Li, Jing
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [34] Deep neural network pruning method based on sensitive layers and reinforcement learning
    Yang, Wenchuan
    Yu, Haoran
    Cui, Baojiang
    Sui, Runqi
    Gu, Tianyu
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (SUPPL 2) : 1897 - 1917
  • [35] Autonomous Navigation with Improved Hierarchical Neural Network Based on Deep Reinforcement Learning
    Zhang, Haiying
    Qiu, Tenghai
    Li, Shuxiao
    Zhu, Chengfei
    Lan, Xiaosong
    Chang, Hongxing
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 4715 - 4720
  • [36] Deep neural network pruning method based on sensitive layers and reinforcement learning
    Wenchuan Yang
    Haoran Yu
    Baojiang Cui
    Runqi Sui
    Tianyu Gu
    Artificial Intelligence Review, 2023, 56 : 1897 - 1917
  • [37] A deep neural network-based transfer learning to enhance the performance and learning speed of BCI systems
    Dehghani, Maryam
    Mobaien, Ali
    Boostani, Reza
    BRAIN-COMPUTER INTERFACES, 2021, 8 (1-2) : 14 - 25
  • [38] A novel deep neural network-based technique for network embedding
    Benbatata, Sabrina
    Saoud, Bilal
    Shayea, Ibraheem
    Alsharabi, Naif
    Alhammadi, Abdulraqeb
    Alferaidi, Ali
    Jadi, Amr
    Daradkeh, Yousef Ibrahim
    PEERJ COMPUTER SCIENCE, 2024, 10 : 1 - 29
  • [39] Neural Network-Based Limiter with Transfer Learning
    Abgrall, Remi
    Han Veiga, Maria
    COMMUNICATIONS ON APPLIED MATHEMATICS AND COMPUTATION, 2020,
  • [40] Application of deep neural network and deep reinforcement learning in wireless communication
    Li, Ming
    Li, Hui
    PLOS ONE, 2020, 15 (07):