FISAR: Forward Invariant Safe Reinforcement Learning with a Deep Neural Network-Based Optimizer

被引:3
|
作者
Sun, Chuangchuang [1 ]
Kim, Dong-Ki [1 ]
How, Jonathan P. [1 ]
机构
[1] MIT, Lab Informat & Decis Syst LIDS, 77 Massachusetts Ave, Cambridge, MA 02139 USA
来源
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021) | 2021年
关键词
ALGORITHMS;
D O I
10.1109/ICRA48506.2021.9561147
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates reinforcement learning with constraints, which are indispensable in safety-critical environments. To drive the constraint violation to decrease monotonically, we take the constraints as Lyapunov functions and impose new linear constraints on the policy parameters' updating dynamics. As a result, the original safety set can be forward-invariant. However, because the new guaranteed-feasible constraints are imposed on the updating dynamics instead of the original policy parameters, classic optimization algorithms are no longer applicable. To address this, we propose to learn a generic deep neural network (DNN)-based optimizer to optimize the objective while satisfying the linear constraints. The constraint-satisfaction is achieved via projection onto a polytope formulated by multiple linear inequality constraints, which can be solved analytically with our newly designed metric. To the best of our knowledge, this is the first DNN-based optimizer for constrained optimization with the forward invariance guarantee. We show that our optimizer trains a policy to decrease the constraint violation and maximize the cumulative reward monotonically. Results on numerical constrained optimization and obstacle-avoidance navigation validate the theoretical findings.
引用
收藏
页码:10617 / 10624
页数:8
相关论文
共 50 条
  • [1] A Deep Residual Shrinkage Neural Network-based Deep Reinforcement Learning Strategy in Financial Portfolio Management
    Sun, Ruoyu
    Jiang, Zhengyong
    Su, Jionglong
    2021 IEEE 6TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (ICBDA 2021), 2021, : 76 - 86
  • [2] Intelligent Caching with Graph Neural Network-Based Deep Reinforcement Learning on SDN-Based ICN
    Hou, Jiacheng
    Tao, Tianhao
    Lu, Haoye
    Nayak, Amiya
    FUTURE INTERNET, 2023, 15 (08)
  • [3] CAPES: Unsupervised Storage Performance Tuning Using Neural Network-Based Deep Reinforcement Learning
    Li, Yan
    Chang, Kenneth
    Bel, Oceane
    Miller, Ethan L.
    Long, Darrell D. E.
    SC'17: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2017,
  • [4] DeepSL: Deep Neural Network-based Similarity Learning
    Tourad M.C.
    Abdelmounaim A.
    Dhleima M.
    Telmoud C.A.A.
    Lachgar M.
    International Journal of Advanced Computer Science and Applications, 2024, 15 (03): : 1394 - 1401
  • [5] DeepSL: Deep Neural Network-based Similarity Learning
    Tourad, Mohamedou Cheikh
    Abdelmounaim, Abdali
    Dhleima, Mohamed
    Telmoud, Cheikh Abdelkader Ahmed
    Lachgar, Mohamed
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (03) : 1394 - 1401
  • [6] Deep Neural Network-Based Surrogate Model for Optimal Component Sizing of Power Converters Using Deep Reinforcement Learning
    Bui, Van-Hai
    Chang, Fangyuan
    Su, Wencong
    Wang, Mengqi
    Murphey, Yi Lu
    Da Silva, Felipe Leno
    Huang, Can
    Xue, Lingxiao
    Glatt, Ruben
    IEEE ACCESS, 2022, 10 : 78702 - 78712
  • [7] Deep Learning Neural Network-Based Weibo Sentiment Analysis
    Wang, Yiming
    Fang, Chun
    PROCEEDINGS OF 2024 4TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND INTELLIGENT COMPUTING, BIC 2024, 2024, : 7 - 11
  • [8] Deep convolutional neural network-based Leveraging Lion Swarm Optimizer for gesture recognition and classification
    Maashi, Mashael
    Al-Hagery, Mohammed Abdullah
    Rizwanullah, Mohammed
    Osman, Azza Elneil
    AIMS MATHEMATICS, 2024, 9 (04): : 9380 - 9393
  • [9] Neural Network-based control using Actor-Critic Reinforcement Learning and Grey Wolf Optimizer with experimental servo system validation
    Zamfirache, Iuliu Alexandru
    Precup, Radu-Emil
    Roman, Raul-Cristian
    Petriu, Emil M.
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 225
  • [10] Improving Convolutional Neural Network-Based Webshell Detection Through Reinforcement Learning
    Wu, Yalun
    Song, Minglu
    Li, Yike
    Tian, Yunzhe
    Tong, Endong
    Niu, Wenjia
    Jia, Bowei
    Huang, Haixiang
    Li, Qiong
    Liu, Jiqiang
    INFORMATION AND COMMUNICATIONS SECURITY (ICICS 2021), PT I, 2021, 12918 : 368 - 383