Sim-to-Real in Reinforcement Learning for Everyone

被引:7
|
作者
Vacaro, Juliano [1 ]
Marques, Guilherme [1 ]
Oliveira, Bruna [1 ]
Paz, Gabriel [1 ]
Paula, Thomas [1 ]
Staehler, Wagston [1 ]
Murphy, David [2 ]
机构
[1] HP Labs AIECL, Porto Alegre, RS, Brazil
[2] HP Labs AIECL, Palo Alto, CA USA
关键词
D O I
10.1109/LARS-SBR-WRE48964.2019.00060
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In reinforcement learning (RL), it remains a challenge to have a robotic agent perform a task in the real world for which it was trained in simulation. In this paper, we present our work training a low-cost robotic arm in simulation to move towards a predefined target in space, represented by a red ball in an RGB image, and transferring the capability to the real arm. We exercised the entire end-to-end flow including the 3D modeling of the arm, training of a state-of-the-art RL policy in simulation with multiple actors in a distributed fashion, domain randomization in order to close the sim-to-real gap, and finally the execution of the trained model in the real robot. We also implemented a mechanism to edit the image captured from the camera before sending it to the model for inference, which allowed us to automate reward computation in the physical world. Our work highlights important challenges of training RL agents and moving them to the real world, validating important aspects shown by other works as well as detailing steps not explained by some of them (e.g. how to compute the reward in the real world). The conducted experiments show the improvements observed as the techniques were added to the final solution.
引用
收藏
页码:305 / 310
页数:6
相关论文
共 50 条
  • [31] Accelerated Robot Skill Acquisition by Reinforcement Learning-Aided Sim-to-Real Domain Adaptation
    Loncarcvic, Zvezdan
    Ude, Ales
    Gams, Andrej
    [J]. 2021 20TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2021, : 269 - 274
  • [32] Cyclic policy distillation: Sample-efficient sim-to-real reinforcement learning with domain randomization
    Kadokawa, Yuki
    Zhu, Lingwei
    Tsurumine, Yoshihisa
    Matsubara, Takamitsu
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2023, 165
  • [33] A Domain Data Pattern Randomization based Deep Reinforcement Learning method for Sim-to-Real transfer
    Gong, Peng
    Shi, Dianxi
    Xue, Chao
    Chen, Xucan
    [J]. 2021 5TH INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE (ICIAI 2021), 2021, : 1 - 7
  • [34] A Sim-to-Real Pipeline for Deep Reinforcement Learning for Autonomous Robot Navigation in Cluttered Rough Terrain
    Hu, Han
    Zhang, Kaicheng
    Tan, Aaron Hao
    Ruan, Michael
    Agia, Christopher
    Nejat, Goldie
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) : 6569 - 6576
  • [35] Sim-to-real transfer reinforcement learning for control of thermal effects of an atmospheric pressure plasma jet
    Witman, Matthew
    Gidon, Dogan
    Graves, David B.
    Smit, Berend
    Mesbah, Ali
    [J]. PLASMA SOURCES SCIENCE & TECHNOLOGY, 2019, 28 (09):
  • [36] Sim-to-Real Deep Reinforcement Learning for Safe End-to-End Planning of Aerial Robots
    Ugurlu, Halil Ibrahim
    Xuan Huy Pham
    Kayacan, Erdal
    [J]. ROBOTICS, 2022, 11 (05)
  • [37] AutoVRL: A High Fidelity Autonomous Ground Vehicle Simulator for Sim-to-Real Deep Reinforcement Learning
    Sivashangaran, Shathushan
    Khairnar, Apoorva
    Eskandarian, Azim
    [J]. IFAC PAPERSONLINE, 2023, 56 (03): : 475 - 480
  • [38] Transferring Multi-Agent Reinforcement Learning Policies for Autonomous Driving using Sim-to-Real
    Candela, Eduardo
    Parada, Leandro
    Marques, Luis
    Georgescu, Tiberiu-Andrei
    Demiris, Yiannis
    Angeloudis, Panagiotis
    [J]. 2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 8814 - 8820
  • [39] Autonomous Blood Suction for Robot-Assisted Surgery: A Sim-to-Real Reinforcement Learning Approach
    Ou, Yafei
    Soleymani, Abed
    Li, Xingyu
    Tavakoli, Mahdi
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (08): : 7246 - 7253
  • [40] Sim-to-Real: Learning Agile Locomotion For Quadruped Robots
    Tan, Jie
    Zhang, Tingnan
    Coumans, Erwin
    Iscen, Atil
    Bai, Yunfei
    Hafner, Danijar
    Bohez, Steven
    Vanhoucke, Vincent
    [J]. ROBOTICS: SCIENCE AND SYSTEMS XIV, 2018,