Bridging the simulation-to-real gap of depth images for deep reinforcement learning

被引:0
|
作者
Jang, Yoonsu [1 ]
Baek, Jongchan [3 ]
Jeon, Soo [4 ]
Han, Soohee [1 ,2 ]
机构
[1] Pohang Univ Sci & Technol, Dept Convergence IT Engn, 77 Cheongam Ro, Pohang Si 36763, Gyeongbuk, South Korea
[2] Pohang Univ Sci & Technol, Dept Elect Engn, 77 Cheongam Ro, Pohang Si 36763, Gyeongbuk, South Korea
[3] Elect & Telecommun Res Inst, 218 Gajeong Ro, Daejeon 34129, South Korea
[4] Univ Waterloo, Dept Mech & Mechatron Engn, 200 Univ Ave West, Waterloo, ON N2L 3G1, Canada
关键词
Autonomous navigation; Depth image; Mobile robot; Sim-to-real;
D O I
10.1016/j.eswa.2024.124310
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While deep reinforcement learning (DRL) models are effective at learning appropriate actions from highdimensional data, they require large amounts of costly and time-consuming training data to be collected in real -world settings. For this reason, collecting data in simulations offers a promising alternative, but transferring policy networks from simulation to reality can be challenging due to differences in perception between the virtual and real worlds. This paper proposes a two-level method to bridge the simulation-toreality (sim-to-real) gap for depth images, specifically for autonomous environmental navigation that uses DRL. Simulated depth images are first translated at a perception level through generative adversarial network (GAN) to make them look like real data from a depth sensor. Simulated and GAN-generated depth images are encoded into latent representations, and the encoder is trained in the latent space to make the two images paired. This encoder is trained simultaneously with a reinforcement learning network model to extract domain-invariant and task-relevant features from depth images and map the behavioral similarity of states to the latent space. Our experimental results demonstrate that our approach can effectively bridge the sim-to-real gap, enabling policies learned in simulation to maintain their control performance in the real world.
引用
收藏
页数:10
相关论文
共 50 条
  • [11] Transfer-Learning: Bridging the Gap between Real and Simulation Data for Machine Learning in Injection Molding
    Tercan, Hasan
    Guajardo, Alexandro
    Heinisch, Julian
    Thiele, Thomas
    Hopmann, Christian
    Meisen, Tobias
    51ST CIRP CONFERENCE ON MANUFACTURING SYSTEMS, 2018, 72 : 185 - 190
  • [12] Bridging the Gap Between Value and Policy Based Reinforcement Learning
    Nachum, Ofir
    Norouzi, Mohammad
    Xu, Kelvin
    Schuurmans, Dale
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [13] From Simulation to Real World Maneuver Execution using Deep Reinforcement Learning
    Capasso, Alessandro Paolo
    Bacchiani, Giulio
    Broggi, Alberto
    2020 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2020, : 1570 - 1575
  • [14] Crowd Simulation by Deep Reinforcement Learning
    Lee, Jaedong
    Won, Jungdam
    Lee, Jehee
    ACM SIGGRAPH CONFERENCE ON MOTION, INTERACTION, AND GAMES (MIG 2018), 2018,
  • [15] Simulation-to-real domain adaptation with teacher-student learning for endoscopic instrument segmentation
    Sahu, Manish
    Mukhopadhyay, Anirban
    Zachow, Stefan
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2021, 16 (05) : 849 - 859
  • [16] Bridging the gap between debiasing and privacy for deep learning
    Barbano, Carlo Alberto
    Tartaglione, Enzo
    Grangetto, Marco
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3799 - 3808
  • [17] Clustering Mixed Data: Bridging the Gap with Deep Learning
    Yerra, Harini
    Kommu, Siddartha
    Kumar, B. Vijay
    Sudam, Rachana
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, MACHINE LEARNING AND APPLICATIONS, VOL 1, ICDSMLA 2023, 2025, 1273 : 695 - 702
  • [18] Focus on defocus: bridging the synthetic to real domain gap for depth estimation
    Maximov, Maxim
    Galim, Kevin
    Leal-Taixe, Laura
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1068 - 1077
  • [19] Robust deep learning for eye fundus images: Bridging real and synthetic data for enhancing generalization
    Oliveira, Guilherme C.
    Rosa, Gustavo H.
    Pedronette, Daniel C. G.
    Papa, Joao P.
    Kumar, Himeesh
    Passos, Leandro A.
    Kumar, Dinesh
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 94
  • [20] Robust deep learning for eye fundus images: Bridging real and synthetic data for enhancing generalization
    Oliveira, Guilherme C.
    Rosa, Gustavo H.
    Pedronette, Daniel C.G.
    Papa, João P.
    Kumar, Himeesh
    Passos, Leandro A.
    Kumar, Dinesh
    Biomedical Signal Processing and Control, 2024, 94