FISAR: Forward Invariant Safe Reinforcement Learning with a Deep Neural Network-Based Optimizer

被引:3
|
作者
Sun, Chuangchuang [1 ]
Kim, Dong-Ki [1 ]
How, Jonathan P. [1 ]
机构
[1] MIT, Lab Informat & Decis Syst LIDS, 77 Massachusetts Ave, Cambridge, MA 02139 USA
来源
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021) | 2021年
关键词
ALGORITHMS;
D O I
10.1109/ICRA48506.2021.9561147
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates reinforcement learning with constraints, which are indispensable in safety-critical environments. To drive the constraint violation to decrease monotonically, we take the constraints as Lyapunov functions and impose new linear constraints on the policy parameters' updating dynamics. As a result, the original safety set can be forward-invariant. However, because the new guaranteed-feasible constraints are imposed on the updating dynamics instead of the original policy parameters, classic optimization algorithms are no longer applicable. To address this, we propose to learn a generic deep neural network (DNN)-based optimizer to optimize the objective while satisfying the linear constraints. The constraint-satisfaction is achieved via projection onto a polytope formulated by multiple linear inequality constraints, which can be solved analytically with our newly designed metric. To the best of our knowledge, this is the first DNN-based optimizer for constrained optimization with the forward invariance guarantee. We show that our optimizer trains a policy to decrease the constraint violation and maximize the cumulative reward monotonically. Results on numerical constrained optimization and obstacle-avoidance navigation validate the theoretical findings.
引用
收藏
页码:10617 / 10624
页数:8
相关论文
共 50 条
  • [21] Deep Dense Network-Based Curriculum Reinforcement Learning for High-Speed Overtaking
    Liu, Jia
    Li, Huiyun
    Yang, Zhiheng
    Dang, Shaobo
    Huang, Zhejun
    IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2023, 15 (01) : 453 - 466
  • [22] Neural network-based reinforcement learning control for combined spacecraft attitude tracking maneuvers
    Liu, Yuhan
    Ma, Guangfu
    Lyu, Yueyong
    Wang, Pengyu
    NEUROCOMPUTING, 2022, 484 : 67 - 78
  • [23] Use of Neural Network-Based Deep Learning Techniques for the Diagnostics of Skin Diseases
    D. A. Gavrilov
    A. V. Melerzanov
    N. N. Shchelkunov
    E. I. Zakirov
    Biomedical Engineering, 2019, 52 : 348 - 352
  • [24] Transfer learning for deep neural network-based partial differential equations solving
    Chen, Xinhai
    Gong, Chunye
    Wan, Qian
    Deng, Liang
    Wan, Yunbo
    Liu, Yang
    Chen, Bo
    Liu, Jie
    ADVANCES IN AERODYNAMICS, 2021, 3 (01)
  • [25] A Graph Convolutional Network-Based Deep Reinforcement Learning Approach for Resource Allocation in a Cognitive Radio Network
    Zhao, Di
    Qin, Hao
    Song, Bin
    Han, Beichen
    Du, Xiaojiang
    Guizani, Mohsen
    SENSORS, 2020, 20 (18) : 1 - 23
  • [26] Use of Neural Network-Based Deep Learning Techniques for the Diagnostics of Skin Diseases
    Gavrilov, D. A.
    Melerzanov, A., V
    Shchelkunov, N. N.
    Zakirov, E., I
    BIOMEDICAL ENGINEERING-MEDITSINSKAYA TEKNIKA, 2019, 52 (05): : 348 - 352
  • [27] Transfer learning for deep neural network-based partial differential equations solving
    Xinhai Chen
    Chunye Gong
    Qian Wan
    Liang Deng
    Yunbo Wan
    Yang Liu
    Bo Chen
    Jie Liu
    Advances in Aerodynamics, 3
  • [28] Multitask Learning of Deep Neural Network-Based Keyword Spotting for IoT Devices
    Leem, Seong-Gyun
    Yoo, In-Chul
    Yook, Dongsuk
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2019, 65 (02) : 188 - 194
  • [29] Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian
    Stankovic, Ranka
    Sandrih, Branislava
    Krstev, Cvetana
    Utvic, Milos
    Skoric, Mihailo
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3954 - 3962
  • [30] A Novel Fingerprint Recovery Scheme using Deep Neural Network-based Learning
    Samuel Lee
    Seok-Woo Jang
    Dongho Kim
    Hernsoo Hahn
    Gye-Young Kim
    Multimedia Tools and Applications, 2021, 80 : 34121 - 34135