OptLayer - Practical Constrained Optimization for Deep Reinforcement Learning in the Real World

被引：0

作者：

Tu-Hoa Pham ^{[1
]}

De Magistris, Giovanni ^{[1
]}

Tachibana, Ryuki ^{[1
]}

机构：

[1] IBM Res AI, Tokyo, Japan

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) | 2018年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

While deep reinforcement learning techniques have recently produced considerable achievements on many decision-making problems, their use in robotics has largely been limited to simulated worlds or restricted motions, since unconstrained trial-and-error interactions in the real world can have undesirable consequences for the robot or its environment. To overcome such limitations, we propose a novel reinforcement learning architecture, OptLayer, that takes as inputs possibly unsafe actions predicted by a neural network and outputs the closest actions that satisfy chosen constraints. While learning control policies often requires carefully crafted rewards and penalties while exploring the range of possible actions, OptLayer ensures that only safe actions are actually executed and unsafe predictions are penalized during training. We demonstrate the effectiveness of our approach on robot reaching tasks, both simulated and in the real world.

引用

页码：6236 / 6243

页数：8

共 50 条

[21] Constrained deep reinforcement learning for maritime platform defense
Markowitz, Jared
ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS VI, 2024, 13051
[22] Deep Reinforcement Learning for Constrained Field Development Optimization in Subsurface Two-phase Flow
Nasir, Yusuf
He, Jincong
Hu, Chaoshun
Tanaka, Shusei
Wang, Kainan
Wen, XianHuan
FRONTIERS IN APPLIED MATHEMATICS AND STATISTICS, 2021, 7
[23] Practical Deep Learning Architecture Optimization
Wistuba, Martin
2018 IEEE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2018, : 263 - 272
[24] Constrained Reinforcement Learning for Dynamic Optimization under Uncertainty
Petsagkourakis, P.
Sandoval, I. O.
Bradford, E.
Zhang, D.
del Rio-Chanona, E. A.
IFAC PAPERSONLINE, 2020, 53 (02): : 11264 - 11270
[25] Constrained Variational Policy Optimization for Safe Reinforcement Learning
Liu, Zuxin
Cen, Zhepeng
Isenbaev, Vladislav
Liu, Wei
Wu, Zhiwei Steven
Li, Bo
Zhao, Ding
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[26] Domain Adapting Deep Reinforcement Learning for Real-World Speech Emotion Recognition
Rajapakshe, Thejan
Rana, Rajib
Khalifa, Sara
Schuller, Bjoern W.
IEEE ACCESS, 2024, 12 : 193101 - 193114
[27] Exploring Applications of Deep Reinforcement Learning for Real-world Autonomous Driving Systems
Talpaert, Victor
Sobh, Ibrahim
Kiran, B. Ravi
Mannion, Patrick
Yogamani, Senthil
El-Sallab, Ahmad
Perez, Patrick
PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 564 - 572
[28] A Survey on Reproducibility by Evaluating Deep Reinforcement Learning Algorithms on Real-World Robots
Lynnerup, Nicolai A.
Nolling, Laura
Hasle, Rasmus
Hallam, John
CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
[29] ENERO: Efficient real-time WAN routing optimization with Deep Reinforcement Learning
Almasan, Paul
Xiao, Shihan
Cheng, Xiangle
Shi, Xiang
Barlet-Ros, Pere
Cabellos-Aparicio, Albert
COMPUTER NETWORKS, 2022, 214
[30] Deep Reinforcement Learning for Real-Time Optimization of Pumps in Water Distribution Systems
Hajgato, Gergely
Paal, Gyorgy
Gyires-Toth, Balint
JOURNAL OF WATER RESOURCES PLANNING AND MANAGEMENT, 2020, 146 (11)

← 1 2 3 4 5 →