Bridging the simulation-to-real gap of depth images for deep reinforcement learning

被引:0
|
作者
Jang, Yoonsu [1 ]
Baek, Jongchan [3 ]
Jeon, Soo [4 ]
Han, Soohee [1 ,2 ]
机构
[1] Pohang Univ Sci & Technol, Dept Convergence IT Engn, 77 Cheongam Ro, Pohang Si 36763, Gyeongbuk, South Korea
[2] Pohang Univ Sci & Technol, Dept Elect Engn, 77 Cheongam Ro, Pohang Si 36763, Gyeongbuk, South Korea
[3] Elect & Telecommun Res Inst, 218 Gajeong Ro, Daejeon 34129, South Korea
[4] Univ Waterloo, Dept Mech & Mechatron Engn, 200 Univ Ave West, Waterloo, ON N2L 3G1, Canada
关键词
Autonomous navigation; Depth image; Mobile robot; Sim-to-real;
D O I
10.1016/j.eswa.2024.124310
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While deep reinforcement learning (DRL) models are effective at learning appropriate actions from highdimensional data, they require large amounts of costly and time-consuming training data to be collected in real -world settings. For this reason, collecting data in simulations offers a promising alternative, but transferring policy networks from simulation to reality can be challenging due to differences in perception between the virtual and real worlds. This paper proposes a two-level method to bridge the simulation-toreality (sim-to-real) gap for depth images, specifically for autonomous environmental navigation that uses DRL. Simulated depth images are first translated at a perception level through generative adversarial network (GAN) to make them look like real data from a depth sensor. Simulated and GAN-generated depth images are encoded into latent representations, and the encoder is trained in the latent space to make the two images paired. This encoder is trained simultaneously with a reinforcement learning network model to extract domain-invariant and task-relevant features from depth images and map the behavioral similarity of states to the latent space. Our experimental results demonstrate that our approach can effectively bridge the sim-to-real gap, enabling policies learned in simulation to maintain their control performance in the real world.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Depth Estimation of Computer Video Images Based on Deep Learning
    Xu, Ru
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2020, 39 (02) : 721 - 733
  • [42] Towards dense people detection with deep learning and depth images
    Fuentes-Jimenez, David
    Losada-Gutierrez, Cristina
    Casillas-Perez, David
    Macias-Guarasa, Javier
    Pizarro, Daniel
    Martin-Lopez, Roberto
    Luna, Carlos A.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 106
  • [43] Depth-based Obstacle Avoidance through Deep Reinforcement Learning
    Wu, Keyu
    Esfahani, Mahdi Abolfazli
    Yuan, Shenghai
    Wang, Han
    PROCEEDINGS OF 2019 5TH INTERNATIONAL CONFERENCE ON MECHATRONICS AND ROBOTICS ENGINEERING (ICMRE 2019), 2019, : 102 - 106
  • [44] Bridging the gap between QP-based and MPC-based Reinforcement Learning
    Sawant, Shambhuraj
    Gros, Sebastien
    IFAC PAPERSONLINE, 2022, 55 (15): : 7 - 12
  • [45] Bridging the Gap between Simulation and Real Autonomous UAV Flights in Industrial Applications
    Perez-Segui, Rafael
    Arias-Perez, Pedro
    Melero-Deza, Javier
    Fernandez-Cortizas, Miguel
    Perez-Saura, David
    Campoy, Pascual
    AEROSPACE, 2023, 10 (09)
  • [46] A hybrid image dataset toward bridging the gap between real and simulation environments for robotics: Annotated desktop objects real and synthetic images dataset: ADORESet
    Bayraktar, Ertugrul
    Yigit, Cihat Bora
    Boyraz, Pinar
    MACHINE VISION AND APPLICATIONS, 2019, 30 (01) : 23 - 40
  • [47] Bridging the Gap Between Self-Report and Behavioral Laboratory Measures: A Real-Time Driving Task With Inverse Reinforcement Learning
    Lee, Sang Ho
    Song, Myeong Seop
    Oh, Min-hwan
    Ahn, Woo-Young
    PSYCHOLOGICAL SCIENCE, 2024, 35 (04) : 345 - 357
  • [48] Real-time model calibration with deep reinforcement learning
    Tian, Yuan
    Chao, Manuel Arias
    Kulkarni, Chetan
    Goebel, Kai
    Fink, Olga
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2022, 165
  • [49] Mutation Testing of Deep Reinforcement Learning Based on Real Faults
    Tambon, Florian
    Majdinasab, Vahid
    Nikanjam, Amin
    Khomh, Foutse
    Antoniol, Giuliano
    2023 IEEE CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION, ICST, 2023, : 188 - 198
  • [50] Deep reinforcement learning for the real time control of stormwater systems
    Mullapudi, Abhiram
    Lewis, Matthew J.
    Gruden, Cyndee L.
    Kerkez, Branko
    ADVANCES IN WATER RESOURCES, 2020, 140