Bridging the simulation-to-real gap of depth images for deep reinforcement learning

被引:0
|
作者
Jang, Yoonsu [1 ]
Baek, Jongchan [3 ]
Jeon, Soo [4 ]
Han, Soohee [1 ,2 ]
机构
[1] Pohang Univ Sci & Technol, Dept Convergence IT Engn, 77 Cheongam Ro, Pohang Si 36763, Gyeongbuk, South Korea
[2] Pohang Univ Sci & Technol, Dept Elect Engn, 77 Cheongam Ro, Pohang Si 36763, Gyeongbuk, South Korea
[3] Elect & Telecommun Res Inst, 218 Gajeong Ro, Daejeon 34129, South Korea
[4] Univ Waterloo, Dept Mech & Mechatron Engn, 200 Univ Ave West, Waterloo, ON N2L 3G1, Canada
关键词
Autonomous navigation; Depth image; Mobile robot; Sim-to-real;
D O I
10.1016/j.eswa.2024.124310
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While deep reinforcement learning (DRL) models are effective at learning appropriate actions from highdimensional data, they require large amounts of costly and time-consuming training data to be collected in real -world settings. For this reason, collecting data in simulations offers a promising alternative, but transferring policy networks from simulation to reality can be challenging due to differences in perception between the virtual and real worlds. This paper proposes a two-level method to bridge the simulation-toreality (sim-to-real) gap for depth images, specifically for autonomous environmental navigation that uses DRL. Simulated depth images are first translated at a perception level through generative adversarial network (GAN) to make them look like real data from a depth sensor. Simulated and GAN-generated depth images are encoded into latent representations, and the encoder is trained in the latent space to make the two images paired. This encoder is trained simultaneously with a reinforcement learning network model to extract domain-invariant and task-relevant features from depth images and map the behavioral similarity of states to the latent space. Our experimental results demonstrate that our approach can effectively bridge the sim-to-real gap, enabling policies learned in simulation to maintain their control performance in the real world.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] The CoachAI Badminton Environment: Bridging the Gap between a Reinforcement Learning Environment and Real-World Badminton Games
    Wang, Kuang-Da
    Chen, Yu-Tse
    Lin, Yu-Heng
    Wang, Wei-Yao
    Peng, Wen-Chih
    THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23844 - 23846
  • [22] “Transfer Learning” for Bridging the Gap Between Data Sciences and the Deep Learning
    Sohail A.
    Annals of Data Science, 2024, 11 (01) : 337 - 345
  • [23] High-Fidelity Simulation of a Cartpole for Sim-to-Real Deep Reinforcement Learning
    Bantel, Linus
    Domanski, Peter
    Pflueger, Dirk
    4TH INTERDISCIPLINARY CONFERENCE ON ELECTRICS AND COMPUTER, INTCEC 2024, 2024,
  • [24] Bridging the gap between Natural and Medical Images through Deep Colorization
    Morra, Lia
    Piano, Luca
    Lamberti, Fabrizio
    Tommasi, Tatiana
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 835 - 842
  • [25] Distributed deep reinforcement learning for simulation control
    Pawar, Suraj
    Maulik, Romit
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2021, 2 (02):
  • [26] Bridging the Gap: Exploring Interpretability in Deep Learning Models for Brain Tumor Detection and Diagnosis from MRI Images
    Nhlapho, Wandile
    Atemkeng, Marcellin
    Brima, Yusuf
    Ndogmo, Jean-Claude
    INFORMATION, 2024, 15 (04)
  • [27] Depth Estimation for Hazy Images using Deep Learning
    Rahadianti, Laksmita
    Sakaue, Fumihiko
    Sato, Jun
    PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 238 - 243
  • [28] A SIMULATION-BASED REAL-TIME DEEP REINFORCEMENT LEARNING APPROACH FOR FIGHTING WILDFIRES
    Tupayachi, Jose
    Ferguson, Madelaine Martinez
    Li, Xueping
    2024 ANNUAL MODELING AND SIMULATION CONFERENCE, ANNSIM 2024, 2024,
  • [29] REAL-TIME SCHEDULING BASED ON SIMULATION AND DEEP REINFORCEMENT LEARNING WITH FEATURED ACTION SPACE
    Xie, Shufang
    Zhang, Tao
    Rose, Oliver
    2022 WINTER SIMULATION CONFERENCE (WSC), 2022, : 1731 - 1739
  • [30] Bridging the Reality Gap Between Virtual and Physical Environments Through Reinforcement Learning
    Ranaweera, Mahesh
    Mahmoud, Qusay H.
    IEEE ACCESS, 2023, 11 : 19914 - 19927