FISAR: Forward Invariant Safe Reinforcement Learning with a Deep Neural Network-Based Optimizer

被引：3

作者：

Sun, Chuangchuang ^{[1
]}

Kim, Dong-Ki ^{[1
]}

How, Jonathan P. ^{[1
]}

机构：

[1] MIT, Lab Informat & Decis Syst LIDS, 77 Massachusetts Ave, Cambridge, MA 02139 USA

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021) | 2021年

关键词：

ALGORITHMS;

D O I：

10.1109/ICRA48506.2021.9561147

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper investigates reinforcement learning with constraints, which are indispensable in safety-critical environments. To drive the constraint violation to decrease monotonically, we take the constraints as Lyapunov functions and impose new linear constraints on the policy parameters' updating dynamics. As a result, the original safety set can be forward-invariant. However, because the new guaranteed-feasible constraints are imposed on the updating dynamics instead of the original policy parameters, classic optimization algorithms are no longer applicable. To address this, we propose to learn a generic deep neural network (DNN)-based optimizer to optimize the objective while satisfying the linear constraints. The constraint-satisfaction is achieved via projection onto a polytope formulated by multiple linear inequality constraints, which can be solved analytically with our newly designed metric. To the best of our knowledge, this is the first DNN-based optimizer for constrained optimization with the forward invariance guarantee. We show that our optimizer trains a policy to decrease the constraint violation and maximize the cumulative reward monotonically. Results on numerical constrained optimization and obstacle-avoidance navigation validate the theoretical findings.

引用

页码：10617 / 10624

页数：8

共 50 条

[21] Deep Dense Network-Based Curriculum Reinforcement Learning for High-Speed Overtaking
Liu, Jia
Li, Huiyun
Yang, Zhiheng
Dang, Shaobo
Huang, Zhejun
IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2023, 15 (01) : 453 - 466
[22] Neural network-based reinforcement learning control for combined spacecraft attitude tracking maneuvers
Liu, Yuhan
Ma, Guangfu
Lyu, Yueyong
Wang, Pengyu
NEUROCOMPUTING, 2022, 484 : 67 - 78
[23] Use of Neural Network-Based Deep Learning Techniques for the Diagnostics of Skin Diseases
D. A. Gavrilov
A. V. Melerzanov
N. N. Shchelkunov
E. I. Zakirov
Biomedical Engineering, 2019, 52 : 348 - 352
[24] Transfer learning for deep neural network-based partial differential equations solving
Chen, Xinhai
Gong, Chunye
Wan, Qian
Deng, Liang
Wan, Yunbo
Liu, Yang
Chen, Bo
Liu, Jie
ADVANCES IN AERODYNAMICS, 2021, 3 (01)
[25] A Graph Convolutional Network-Based Deep Reinforcement Learning Approach for Resource Allocation in a Cognitive Radio Network
Zhao, Di
Qin, Hao
Song, Bin
Han, Beichen
Du, Xiaojiang
Guizani, Mohsen
SENSORS, 2020, 20 (18) : 1 - 23
[26] Use of Neural Network-Based Deep Learning Techniques for the Diagnostics of Skin Diseases
Gavrilov, D. A.
Melerzanov, A., V
Shchelkunov, N. N.
Zakirov, E., I
BIOMEDICAL ENGINEERING-MEDITSINSKAYA TEKNIKA, 2019, 52 (05): : 348 - 352
[27] Transfer learning for deep neural network-based partial differential equations solving
Xinhai Chen
Chunye Gong
Qian Wan
Liang Deng
Yunbo Wan
Yang Liu
Bo Chen
Jie Liu
Advances in Aerodynamics, 3
[28] Multitask Learning of Deep Neural Network-Based Keyword Spotting for IoT Devices
Leem, Seong-Gyun
Yoo, In-Chul
Yook, Dongsuk
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2019, 65 (02) : 188 - 194
[29] Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian
Stankovic, Ranka
Sandrih, Branislava
Krstev, Cvetana
Utvic, Milos
Skoric, Mihailo
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3954 - 3962
[30] A Novel Fingerprint Recovery Scheme using Deep Neural Network-based Learning
Samuel Lee
Seok-Woo Jang
Dongho Kim
Hernsoo Hahn
Gye-Young Kim
Multimedia Tools and Applications, 2021, 80 : 34121 - 34135

← 1 2 3 4 5 →