Sim-to-Real in Reinforcement Learning for Everyone

被引:7
|
作者
Vacaro, Juliano [1 ]
Marques, Guilherme [1 ]
Oliveira, Bruna [1 ]
Paz, Gabriel [1 ]
Paula, Thomas [1 ]
Staehler, Wagston [1 ]
Murphy, David [2 ]
机构
[1] HP Labs AIECL, Porto Alegre, RS, Brazil
[2] HP Labs AIECL, Palo Alto, CA USA
关键词
D O I
10.1109/LARS-SBR-WRE48964.2019.00060
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In reinforcement learning (RL), it remains a challenge to have a robotic agent perform a task in the real world for which it was trained in simulation. In this paper, we present our work training a low-cost robotic arm in simulation to move towards a predefined target in space, represented by a red ball in an RGB image, and transferring the capability to the real arm. We exercised the entire end-to-end flow including the 3D modeling of the arm, training of a state-of-the-art RL policy in simulation with multiple actors in a distributed fashion, domain randomization in order to close the sim-to-real gap, and finally the execution of the trained model in the real robot. We also implemented a mechanism to edit the image captured from the camera before sending it to the model for inference, which allowed us to automate reward computation in the physical world. Our work highlights important challenges of training RL agents and moving them to the real world, validating important aspects shown by other works as well as detailing steps not explained by some of them (e.g. how to compute the reward in the real world). The conducted experiments show the improvements observed as the techniques were added to the final solution.
引用
收藏
页码:305 / 310
页数:6
相关论文
共 50 条
  • [1] Grounded action transformation for sim-to-real reinforcement learning
    Josiah P. Hanna
    Siddharth Desai
    Haresh Karnan
    Garrett Warnell
    Peter Stone
    [J]. Machine Learning, 2021, 110 : 2469 - 2499
  • [2] Grounded action transformation for sim-to-real reinforcement learning
    Hanna, Josiah P.
    Desai, Siddharth
    Karnan, Haresh
    Warnell, Garrett
    Stone, Peter
    [J]. MACHINE LEARNING, 2021, 110 (09) : 2469 - 2499
  • [3] Meta Reinforcement Learning for Sim-to-real Domain Adaptation
    Arndt, Karol
    Hazara, Murtaza
    Ghadirzadeh, Ali
    Kyrki, Ville
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 2725 - 2731
  • [4] Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey
    Zhao, Wenshuai
    Queralta, Jorge Pena
    Westerlund, Tomi
    [J]. 2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 737 - 744
  • [5] Survey on Sim-to-real Transfer Reinforcement Learning in Robot Systems
    Lin, Qian
    Yu, Chao
    Wu, Xia-Wei
    Dong, Yin-Zhao
    Xu, Xin
    Zhang, Qiang
    Guo, Xian
    [J]. Ruan Jian Xue Bao/Journal of Software, 2024, 35 (02): : 711 - 738
  • [6] Dynamic Bipedal Turning through Sim-to-Real Reinforcement Learning
    Yu, Fangzhou
    Batke, Ryan
    Dao, Jeremy
    Hurst, Jonathan
    Green, Kevin
    Fern, Alan
    [J]. 2022 IEEE-RAS 21ST INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2022, : 903 - 910
  • [7] Sim-to-Real Application of Reinforcement Learning Agents for Autonomous, Real Vehicle Drifting
    Toth, Szilard Hunor
    Viharos, Zsolt Janos
    Bardos, Adam
    Szalay, Zsolt
    [J]. VEHICLES, 2024, 6 (02): : 781 - 798
  • [8] Sim-to-Real Robotic Sketching using Behavior Cloning and Reinforcement Learning
    [J]. Jia, Biao (biao@umd.edu), 1600, Institute of Electrical and Electronics Engineers Inc.
  • [9] Sim-to-Real: Mapless Navigation for USVs Using Deep Reinforcement Learning
    Wang, Ning
    Wang, Yabiao
    Zhao, Yuming
    Wang, Yong
    Li, Zhigang
    [J]. JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (07)
  • [10] Blind Bipedal Stair Traversal via Sim-to-Real Reinforcement Learning
    Siekmann, Jonah
    Green, Kevin
    Warila, John
    Fern, Alan
    Hurst, Jonathan
    [J]. ROBOTICS: SCIENCE AND SYSTEM XVII, 2021,