FISAR: Forward Invariant Safe Reinforcement Learning with a Deep Neural Network-Based Optimizer

被引：3

作者：

Sun, Chuangchuang ^{[1
]}

Kim, Dong-Ki ^{[1
]}

How, Jonathan P. ^{[1
]}

机构：

[1] MIT, Lab Informat & Decis Syst LIDS, 77 Massachusetts Ave, Cambridge, MA 02139 USA

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021) | 2021年

关键词：

ALGORITHMS;

D O I：

10.1109/ICRA48506.2021.9561147

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper investigates reinforcement learning with constraints, which are indispensable in safety-critical environments. To drive the constraint violation to decrease monotonically, we take the constraints as Lyapunov functions and impose new linear constraints on the policy parameters' updating dynamics. As a result, the original safety set can be forward-invariant. However, because the new guaranteed-feasible constraints are imposed on the updating dynamics instead of the original policy parameters, classic optimization algorithms are no longer applicable. To address this, we propose to learn a generic deep neural network (DNN)-based optimizer to optimize the objective while satisfying the linear constraints. The constraint-satisfaction is achieved via projection onto a polytope formulated by multiple linear inequality constraints, which can be solved analytically with our newly designed metric. To the best of our knowledge, this is the first DNN-based optimizer for constrained optimization with the forward invariance guarantee. We show that our optimizer trains a policy to decrease the constraint violation and maximize the cumulative reward monotonically. Results on numerical constrained optimization and obstacle-avoidance navigation validate the theoretical findings.

引用

页码：10617 / 10624

页数：8

共 50 条

[31] A Novel Fingerprint Recovery Scheme using Deep Neural Network-based Learning
Lee, Samuel
Jang, Seok-Woo
Kim, Dongho
Hahn, Hernsoo
Kim, Gye-Young
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (26-27) : 34121 - 34135
[32] Integration of Neural Network-Based Symbolic Regression in Deep Learning for Scientific Discovery
Kim, Samuel
Lu, Peter Y.
Mukherjee, Srijon
Gilbert, Michael
Jing, Li
Ceperic, Vladimir
Soljacic, Marin
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (09) : 4166 - 4177
[33] Accelerating the Deep Reinforcement Learning with Neural Network Compression
Zhang, Hongjie
He, Zhuocheng
Li, Jing
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[34] Deep neural network pruning method based on sensitive layers and reinforcement learning
Yang, Wenchuan
Yu, Haoran
Cui, Baojiang
Sui, Runqi
Gu, Tianyu
ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (SUPPL 2) : 1897 - 1917
[35] Autonomous Navigation with Improved Hierarchical Neural Network Based on Deep Reinforcement Learning
Zhang, Haiying
Qiu, Tenghai
Li, Shuxiao
Zhu, Chengfei
Lan, Xiaosong
Chang, Hongxing
PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 4715 - 4720
[36] Deep neural network pruning method based on sensitive layers and reinforcement learning
Wenchuan Yang
Haoran Yu
Baojiang Cui
Runqi Sui
Tianyu Gu
Artificial Intelligence Review, 2023, 56 : 1897 - 1917
[37] A deep neural network-based transfer learning to enhance the performance and learning speed of BCI systems
Dehghani, Maryam
Mobaien, Ali
Boostani, Reza
BRAIN-COMPUTER INTERFACES, 2021, 8 (1-2) : 14 - 25
[38] A novel deep neural network-based technique for network embedding
Benbatata, Sabrina
Saoud, Bilal
Shayea, Ibraheem
Alsharabi, Naif
Alhammadi, Abdulraqeb
Alferaidi, Ali
Jadi, Amr
Daradkeh, Yousef Ibrahim
PEERJ COMPUTER SCIENCE, 2024, 10 : 1 - 29
[39] Neural Network-Based Limiter with Transfer Learning
Abgrall, Remi
Han Veiga, Maria
COMMUNICATIONS ON APPLIED MATHEMATICS AND COMPUTATION, 2020,
[40] Application of deep neural network and deep reinforcement learning in wireless communication
Li, Ming
Li, Hui
PLOS ONE, 2020, 15 (07):

← 1 2 3 4 5 →