Bridging the simulation-to-real gap of depth images for deep reinforcement learning

被引:0
|
作者
Jang, Yoonsu [1 ]
Baek, Jongchan [3 ]
Jeon, Soo [4 ]
Han, Soohee [1 ,2 ]
机构
[1] Pohang Univ Sci & Technol, Dept Convergence IT Engn, 77 Cheongam Ro, Pohang Si 36763, Gyeongbuk, South Korea
[2] Pohang Univ Sci & Technol, Dept Elect Engn, 77 Cheongam Ro, Pohang Si 36763, Gyeongbuk, South Korea
[3] Elect & Telecommun Res Inst, 218 Gajeong Ro, Daejeon 34129, South Korea
[4] Univ Waterloo, Dept Mech & Mechatron Engn, 200 Univ Ave West, Waterloo, ON N2L 3G1, Canada
关键词
Autonomous navigation; Depth image; Mobile robot; Sim-to-real;
D O I
10.1016/j.eswa.2024.124310
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While deep reinforcement learning (DRL) models are effective at learning appropriate actions from highdimensional data, they require large amounts of costly and time-consuming training data to be collected in real -world settings. For this reason, collecting data in simulations offers a promising alternative, but transferring policy networks from simulation to reality can be challenging due to differences in perception between the virtual and real worlds. This paper proposes a two-level method to bridge the simulation-toreality (sim-to-real) gap for depth images, specifically for autonomous environmental navigation that uses DRL. Simulated depth images are first translated at a perception level through generative adversarial network (GAN) to make them look like real data from a depth sensor. Simulated and GAN-generated depth images are encoded into latent representations, and the encoder is trained in the latent space to make the two images paired. This encoder is trained simultaneously with a reinforcement learning network model to extract domain-invariant and task-relevant features from depth images and map the behavioral similarity of states to the latent space. Our experimental results demonstrate that our approach can effectively bridge the sim-to-real gap, enabling policies learned in simulation to maintain their control performance in the real world.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Scaling Simulation-to-Real Transfer by Learning Composable Robot Skills
    Julian, Ryan
    Heiden, Eric
    He, Zhanpeng
    Zhang, Hejia
    Schaal, Stefan
    Lim, Joseph
    Sukhatme, Gaurav
    Hausmann, Karol
    PROCEEDINGS OF THE 2018 INTERNATIONAL SYMPOSIUM ON EXPERIMENTAL ROBOTICS, 2020, 11 : 267 - 279
  • [2] Deep Reinforcement Learning With Adversarial Training for Automated Excavation Using Depth Images
    Osa, Takayuki
    Aizawa, Masanori
    IEEE ACCESS, 2022, 10 : 4523 - 4535
  • [3] Deep Reinforcement Learning for Motion Planning of Quadrotors Using Raw Depth Images
    Camci, Efe
    Campolo, Domenico
    Kayacan, Erdal
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [4] SafeAPT: Safe Simulation-to-Real Robot Learning Using Diverse Policies Learned in Simulation
    Kaushik, Rituraj
    Arndt, Karol
    Kyrki, Ville
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03): : 6838 - 6845
  • [5] Simulation-to-real generalization for deep-learning-based refraction-corrected ultrasound tomography image reconstruction
    Zhao, Wenzhao
    Fan, Yuling
    Wang, Hongjian
    Gemmeke, Hartmut
    van Dongen, Koen W. A.
    Hopp, Torsten
    Hesser, Juergen
    PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (03):
  • [6] Bridging the Gap Between Imitation Learning and Inverse Reinforcement Learning
    Piot, Bilal
    Geist, Matthieu
    Pietquin, Olivier
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (08) : 1814 - 1826
  • [7] Bridging the simulation-to-real gap for AI-based needle and target detection in robot-assisted ultrasound-guided interventions
    Arapi, Visar
    Hardt-Stremayr, Alexander
    Weiss, Stephan
    Steinbrener, Jan
    EUROPEAN RADIOLOGY EXPERIMENTAL, 2023, 7 (01)
  • [8] Simulation-to-real domain adaptation with teacher–student learning for endoscopic instrument segmentation
    Manish Sahu
    Anirban Mukhopadhyay
    Stefan Zachow
    International Journal of Computer Assisted Radiology and Surgery, 2021, 16 : 849 - 859
  • [9] Correction: Bridging the simulation-to-real gap for AI-based needle and target detection in robot-assisted ultrasound-guided interventions
    Visar Arapi
    Alexander Hardt-Stremayr
    Stephan Weiss
    Jan Steinbrener
    European Radiology Experimental, 7
  • [10] Scaling simulation-to-real transfer by learning a latent space of robot skills
    Julian, Ryan C.
    Heiden, Eric
    He, Zhanpeng
    Zhang, Hejia
    Schaal, Stefan
    Lim, Joseph J.
    Sukhatme, Gaurav S.
    Hausman, Karol
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2020, 39 (10-11): : 1259 - 1278