OptLayer - Practical Constrained Optimization for Deep Reinforcement Learning in the Real World

被引：0

作者：

Tu-Hoa Pham ^{[1
]}

De Magistris, Giovanni ^{[1
]}

Tachibana, Ryuki ^{[1
]}

机构：

[1] IBM Res AI, Tokyo, Japan

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) | 2018年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

While deep reinforcement learning techniques have recently produced considerable achievements on many decision-making problems, their use in robotics has largely been limited to simulated worlds or restricted motions, since unconstrained trial-and-error interactions in the real world can have undesirable consequences for the robot or its environment. To overcome such limitations, we propose a novel reinforcement learning architecture, OptLayer, that takes as inputs possibly unsafe actions predicted by a neural network and outputs the closest actions that satisfy chosen constraints. While learning control policies often requires carefully crafted rewards and penalties while exploring the range of possible actions, OptLayer ensures that only safe actions are actually executed and unsafe predictions are penalized during training. We demonstrate the effectiveness of our approach on robot reaching tasks, both simulated and in the real world.

引用

页码：6236 / 6243

页数：8

共 50 条

[1] Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications
Nambiar, Mila
Ghosh, Supriyo
Ong, Priscilla
Chan, Yu En
Bee, Yong Mong
Krishnaswamy, Pavitra
PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 4673 - 4684
[2] Constrained Deep Reinforcement Learning for Fronthaul Compression Optimization
Gronland, Axel
Russo, Alessio
Jedra, Yassir
Klaiqi, Bleron
Gelabert, Xavier
2024 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING FOR COMMUNICATION AND NETWORKING, ICMLCN 2024, 2024, : 498 - 504
[3] Deep reinforcement learning-guided coevolutionary algorithm for constrained multiobjective optimization
Luo, Wenguan
Yu, Xiaobing
Yen, Gary G.
Wei, Yifan
Information Sciences, 2025, 692
[4] Real-world Robot Reaching Skill Learning Based on Deep Reinforcement Learning
Liu, Naijun
Lu, Tao
Cai, Yinghao
Wang, Rui
Wang, Shuo
PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 4780 - 4784
[5] Real-Sim-Real Transfer for Real-World Robot Control Policy Learning with Deep Reinforcement Learning
Liu, Naijun
Cai, Yinghao
Lu, Tao
Wang, Rui
Wang, Shuo
APPLIED SCIENCES-BASEL, 2020, 10 (05):
[6] Deep learning, reinforcement learning, and world models
Matsuo, Yutaka
LeCun, Yann
Sahani, Maneesh
Precup, Doina
Silver, David
Sugiyama, Masashi
Uchibe, Eiji
Morimoto, Jun
NEURAL NETWORKS, 2022, 152 : 267 - 275
[7] On Sampling Efficiency Optimization in Constrained Reinforcement Learning
Jia, Qing-Shan
2024 IEEE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS, AIM 2024, 2024, : 966 - 971
[8] Deep reinforcement learning assisted co-evolutionary differential evolution for constrained optimization
Hu, Zhenzhen
Gong, Wenyin
Pedrycz, Witold
Li, Yanchi
SWARM AND EVOLUTIONARY COMPUTATION, 2023, 83
[9] Constrained Multi-Objective Optimization With Deep Reinforcement Learning Assisted Operator Selection
Fei Ming
Wenyin Gong
Ling Wang
Yaochu Jin
IEEE/CAA Journal of Automatica Sinica, 2024, 11 (04) : 919 - 959
[10] Deep reinforcement learning-based framework for constrained any-objective optimization
Honari H.
Khodaygan S.
Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (07) : 9575 - 9591

← 1 2 3 4 5 →