Real-Sim-Real Transfer for Real-World Robot Control Policy Learning with Deep Reinforcement Learning

被引:20
|
作者
Liu, Naijun [1 ,2 ]
Cai, Yinghao [1 ]
Lu, Tao [1 ]
Wang, Rui [1 ,3 ]
Wang, Shuo [1 ,2 ,4 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100190, Peoples R China
[3] Chinese Acad Sci, Shenzhen Inst Adv Technol, Guangdong Prov Key Lab Robot & Intelligent Syst, Shenzhen 518055, Peoples R China
[4] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Shanghai 200031, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2020年 / 10卷 / 05期
基金
中国国家自然科学基金;
关键词
robot; policy learning; reality gap; simulated environment; deep reinforcement learning; DOMAIN ADAPTATION;
D O I
10.3390/app10051555
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Compared to traditional data-driven learning methods, recently developed deep reinforcement learning (DRL) approaches can be employed to train robot agents to obtain control policies with appealing performance. However, learning control policies for real-world robots through DRL is costly and cumbersome. A promising alternative is to train policies in simulated environments and transfer the learned policies to real-world scenarios. Unfortunately, due to the reality gap between simulated and real-world environments, the policies learned in simulated environments often cannot be generalized well to the real world. Bridging the reality gap is still a challenging problem. In this paper, we propose a novel real-sim-real (RSR) transfer method that includes a real-to-sim training phase and a sim-to-real inference phase. In the real-to-sim training phase, a task-relevant simulated environment is constructed based on semantic information of the real-world scenario and coordinate transformation, and then a policy is trained with the DRL method in the built simulated environment. In the sim-to-real inference phase, the learned policy is directly applied to control the robot in real-world scenarios without any real-world data. Experimental results in two different robot control tasks show that the proposed RSR method can train skill policies with high generalization performance and significantly low training costs.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] A Sim-to-Real Pipeline for Deep Reinforcement Learning for Autonomous Robot Navigation in Cluttered Rough Terrain
    Hu, Han
    Zhang, Kaicheng
    Tan, Aaron Hao
    Ruan, Michael
    Agia, Christopher
    Nejat, Goldie
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) : 6569 - 6576
  • [32] "Good Robot!": Efficient Reinforcement Learning for Multi-Step Visual Tasks with Sim to Real Transfer
    Hundt, Andrew
    Killeen, Benjamin
    Greene, Nicholas
    Wu, Hongtao
    Kwon, Heeyeon
    Paxton, Chris
    Hager, Gregory D.
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04) : 6724 - 6731
  • [33] Demonstration of Intelligent HVAC Load Management With Deep Reinforcement Learning: Real-World Experience of Machine Learning in Demand Control
    Du, Yan
    Li, Fangxing
    Kurte, Kuldeep
    Munk, Jeffrey
    Zandi, Helia
    [J]. IEEE POWER & ENERGY MAGAZINE, 2022, 20 (03): : 42 - 53
  • [34] Facilitating Sim-to-Real by Intrinsic Stochasticity of Real-Time Simulation in Reinforcement Learning for Robot Manipulation
    Enayati, Amir M. Soufi
    Dershan, Ram
    Zhang, Zengjie
    Richert, Dean
    Najjaran, Homayoun
    [J]. IEEE Transactions on Artificial Intelligence, 2024, 5 (04): : 1791 - 1804
  • [35] A digital twin-based sim-to-real transfer for deep reinforcement learning-enabled industrial robot grasping
    Liu, Yongkui
    Xu, He
    Liu, Ding
    Wang, Lihui
    [J]. ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2022, 78
  • [36] Reinforcement Learning and Sim-to-Real Transfer of Reorientation and Landing Control for Quadruped Robots on Asteroids
    Qi, Ji
    Gao, Haibo
    Su, Huanli
    Huo, Mingying
    Yu, Haitao
    Deng, Zongquan
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, : 1 - 9
  • [37] Optimizing Reinforcement Learning Control Model in Furuta Pendulum and Transferring it to Real-World
    Hong, Myung Rae
    Kang, Sanghun
    Lee, Jingoo
    Seo, Sungchul
    Han, Seungyong
    Koh, Je-Sung
    Kang, Daeshik
    [J]. IEEE ACCESS, 2023, 11 : 95195 - 95200
  • [38] Learning With Real-World Data
    不详
    [J]. IEEE CONTROL SYSTEMS MAGAZINE, 2023, 43 (05): : 158 - 159
  • [39] A REFUGE FOR REAL-WORLD LEARNING
    MCFADEN, D
    NELSON, B
    [J]. EDUCATIONAL LEADERSHIP, 1995, 52 (08) : 11 - 13
  • [40] Sim-to-Real Deep Reinforcement Learning with Manipulators for Pick-and-Place
    Liu, Wenxing
    Niu, Hanlin
    Skilton, Robert
    Carrasco, Joaquin
    [J]. TOWARDS AUTONOMOUS ROBOTIC SYSTEMS, TAROS 2023, 2023, 14136 : 240 - 252