FISAR: Forward Invariant Safe Reinforcement Learning with a Deep Neural Network-Based Optimizer

被引：3

作者：

Sun, Chuangchuang ^{[1
]}

Kim, Dong-Ki ^{[1
]}

How, Jonathan P. ^{[1
]}

机构：

[1] MIT, Lab Informat & Decis Syst LIDS, 77 Massachusetts Ave, Cambridge, MA 02139 USA

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021) | 2021年

关键词：

ALGORITHMS;

D O I：

10.1109/ICRA48506.2021.9561147

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper investigates reinforcement learning with constraints, which are indispensable in safety-critical environments. To drive the constraint violation to decrease monotonically, we take the constraints as Lyapunov functions and impose new linear constraints on the policy parameters' updating dynamics. As a result, the original safety set can be forward-invariant. However, because the new guaranteed-feasible constraints are imposed on the updating dynamics instead of the original policy parameters, classic optimization algorithms are no longer applicable. To address this, we propose to learn a generic deep neural network (DNN)-based optimizer to optimize the objective while satisfying the linear constraints. The constraint-satisfaction is achieved via projection onto a polytope formulated by multiple linear inequality constraints, which can be solved analytically with our newly designed metric. To the best of our knowledge, this is the first DNN-based optimizer for constrained optimization with the forward invariance guarantee. We show that our optimizer trains a policy to decrease the constraint violation and maximize the cumulative reward monotonically. Results on numerical constrained optimization and obstacle-avoidance navigation validate the theoretical findings.

引用

页码：10617 / 10624

页数：8

共 50 条

[41] Neural Network-Based Limiter with Transfer Learning
Rémi Abgrall
Maria Han Veiga
Communications on Applied Mathematics and Computation, 2023, 5 (2) : 532 - 572
[42] Neural Network-Based Limiter with Transfer Learning
Abgrall, Remi
Han Veiga, Maria
COMMUNICATIONS ON APPLIED MATHEMATICS AND COMPUTATION, 2023, 5 (02) : 532 - 572
[43] Seismic waveform inversion using a neural network-based forward
Fu, Hongsun
Zhang, Yan
Ma, Mingyue
SECOND INTERNATIONAL CONFERENCE ON PHYSICS, MATHEMATICS AND STATISTICS, 2019, 1324
[44] Graph Convolutional Network-Based Topology Embedded Deep Reinforcement Learning for Voltage Stability Control
Hossain, Ramij R.
Huang, Qiuhua
Huang, Renke
IEEE TRANSACTIONS ON POWER SYSTEMS, 2021, 36 (05) : 4848 - 4851
[45] Deep Convolutional Neural Network Assisted Reinforcement Learning Based Mobile Network Power Saving
Wu, Shangbin
Wang, Yue
Bai, Lu
IEEE ACCESS, 2020, 8 (08): : 93671 - 93681
[46] A Neural Network based Deep Reinforcement Learning Controller for Voltage Regulation of Active Distribution Network
Jain, Jatin
Mohamed, Ahmed
Rahman, Tanvir
Ali, Mohamed
2024 IEEE 5TH ANNUAL WORLD AI IOT CONGRESS, AIIOT 2024, 2024, : 0280 - 0285
[47] Performance enhancement of the artificial neural network-based reinforcement learning for wind turbine yaw control
Saenz-Aguirre, Aitor
Zulueta, Ekaitz
Fernandez-Gamiz, Unai
Ulazia, Alain
Teso-Fz-Betono, Daniel
WIND ENERGY, 2020, 23 (03) : 676 - 690
[48] Distributed Neural Network-based Policy Gradient Reinforcement Learning for Multi-Robot Formations
Shang, Wen
Sun, Dong
2008 INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, VOLS 1-4, 2008, : 113 - +
[49] Political Optimizer with Probabilistic Neural Network-Based Arabic Comparative Opinion Mining
Alotaibi, Najm
Al-onazi, Badriyya B.
Nour, Mohamed K.
Mohamed, Abdullah
Motwakel, Abdelwahed
Mohammed, Gouse Pasha
Yaseen, Ishfaq
Rizwanullah, Mohammed
INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (03): : 3121 - 3137
[50] Safe batch constrained deep reinforcement learning with generative adversarial network
Dong, Wenbo
Liu, Shaofan
Sun, Shiliang
INFORMATION SCIENCES, 2023, 634 : 259 - 270

← 1 2 3 4 5 →