FISAR: Forward Invariant Safe Reinforcement Learning with a Deep Neural Network-Based Optimizer

被引：3

作者：

Sun, Chuangchuang ^{[1
]}

Kim, Dong-Ki ^{[1
]}

How, Jonathan P. ^{[1
]}

机构：

[1] MIT, Lab Informat & Decis Syst LIDS, 77 Massachusetts Ave, Cambridge, MA 02139 USA

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021) | 2021年

关键词：

ALGORITHMS;

D O I：

10.1109/ICRA48506.2021.9561147

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper investigates reinforcement learning with constraints, which are indispensable in safety-critical environments. To drive the constraint violation to decrease monotonically, we take the constraints as Lyapunov functions and impose new linear constraints on the policy parameters' updating dynamics. As a result, the original safety set can be forward-invariant. However, because the new guaranteed-feasible constraints are imposed on the updating dynamics instead of the original policy parameters, classic optimization algorithms are no longer applicable. To address this, we propose to learn a generic deep neural network (DNN)-based optimizer to optimize the objective while satisfying the linear constraints. The constraint-satisfaction is achieved via projection onto a polytope formulated by multiple linear inequality constraints, which can be solved analytically with our newly designed metric. To the best of our knowledge, this is the first DNN-based optimizer for constrained optimization with the forward invariance guarantee. We show that our optimizer trains a policy to decrease the constraint violation and maximize the cumulative reward monotonically. Results on numerical constrained optimization and obstacle-avoidance navigation validate the theoretical findings.

引用

页码：10617 / 10624

页数：8

共 50 条

[1] A Deep Residual Shrinkage Neural Network-based Deep Reinforcement Learning Strategy in Financial Portfolio Management
Sun, Ruoyu
Jiang, Zhengyong
Su, Jionglong
2021 IEEE 6TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (ICBDA 2021), 2021, : 76 - 86
[2] Intelligent Caching with Graph Neural Network-Based Deep Reinforcement Learning on SDN-Based ICN
Hou, Jiacheng
Tao, Tianhao
Lu, Haoye
Nayak, Amiya
FUTURE INTERNET, 2023, 15 (08)
[3] CAPES: Unsupervised Storage Performance Tuning Using Neural Network-Based Deep Reinforcement Learning
Li, Yan
Chang, Kenneth
Bel, Oceane
Miller, Ethan L.
Long, Darrell D. E.
SC'17: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2017,
[4] DeepSL: Deep Neural Network-based Similarity Learning
Tourad M.C.
Abdelmounaim A.
Dhleima M.
Telmoud C.A.A.
Lachgar M.
International Journal of Advanced Computer Science and Applications, 2024, 15 (03): : 1394 - 1401
[5] DeepSL: Deep Neural Network-based Similarity Learning
Tourad, Mohamedou Cheikh
Abdelmounaim, Abdali
Dhleima, Mohamed
Telmoud, Cheikh Abdelkader Ahmed
Lachgar, Mohamed
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (03) : 1394 - 1401
[6] Deep Neural Network-Based Surrogate Model for Optimal Component Sizing of Power Converters Using Deep Reinforcement Learning
Bui, Van-Hai
Chang, Fangyuan
Su, Wencong
Wang, Mengqi
Murphey, Yi Lu
Da Silva, Felipe Leno
Huang, Can
Xue, Lingxiao
Glatt, Ruben
IEEE ACCESS, 2022, 10 : 78702 - 78712
[7] Deep Learning Neural Network-Based Weibo Sentiment Analysis
Wang, Yiming
Fang, Chun
PROCEEDINGS OF 2024 4TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND INTELLIGENT COMPUTING, BIC 2024, 2024, : 7 - 11
[8] Deep convolutional neural network-based Leveraging Lion Swarm Optimizer for gesture recognition and classification
Maashi, Mashael
Al-Hagery, Mohammed Abdullah
Rizwanullah, Mohammed
Osman, Azza Elneil
AIMS MATHEMATICS, 2024, 9 (04): : 9380 - 9393
[9] Neural Network-based control using Actor-Critic Reinforcement Learning and Grey Wolf Optimizer with experimental servo system validation
Zamfirache, Iuliu Alexandru
Precup, Radu-Emil
Roman, Raul-Cristian
Petriu, Emil M.
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 225
[10] Improving Convolutional Neural Network-Based Webshell Detection Through Reinforcement Learning
Wu, Yalun
Song, Minglu
Li, Yike
Tian, Yunzhe
Tong, Endong
Niu, Wenjia
Jia, Bowei
Huang, Haixiang
Li, Qiong
Liu, Jiqiang
INFORMATION AND COMMUNICATIONS SECURITY (ICICS 2021), PT I, 2021, 12918 : 368 - 383

← 1 2 3 4 5 →