Zero-shot sim-to-real transfer using Siamese-Q-Based reinforcement learning

被引:0
|
作者
Zhang, Zhenyu [1 ]
Xie, Shaorong [1 ]
Zhang, Han [1 ]
Luo, Xiangfeng [1 ]
Yu, Hang [1 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, 99 Shangda Rd, Shanghai 200444, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Representation learning; Simulation to real; Contrastive learning; NETWORK;
D O I
10.1016/j.inffus.2024.102664
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To address real world decision problems in reinforcement learning, it is common to train a policy in a simulator first for safety. Unfortunately, the sim-real gap hinders effective simulation-to-real transfer without substantial training data. However, collecting real samples of complex tasks is often impractical, and the sample inefficiency of reinforcement learning exacerbates the simulation-to-real problem, even with online interaction or data. Representation learning can improve sample efficiency while keeping generalization by projecting high-dimensional inputs into low-dimensional representations. However, whether trained independently or simultaneously with reinforcement learning, representation learning remains a separate auxiliary task, lacking task-related features and generalization for simulation-to-real transfer. This paper proposes Siamese-Q, a new representation learning method employing Siamese networks and zero-shot simulation-to-real transfer, which narrows the distance between inputs with the same semantics in the latent space with respect to Q values. This allows us to fuse task-related information into the representation and improve the generalization of the policy. Evaluation in virtual and real autonomous vehicle scenarios demonstrates substantial improvements of 19.5% and 94.2% respectively over conventional representation learning, without requiring any real-world observations or on-policy interaction, and enabling reinforcement learning policies trained in simulations transfer to reality.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Real-Time Parameter Control for Trajectory Generation Using Reinforcement Learning With Zero-Shot Sim-to-Real Transfer
    Ji, Chang-Hun
    Lim, Gyeonghun
    Han, Youn-Hee
    Moon, Sungtae
    IEEE ACCESS, 2024, 12 : 171662 - 171674
  • [2] Zero-shot sim-to-real transfer of reinforcement learning framework for robotics manipulation with demonstration and force feedback
    Chen, Yuanpei
    Zeng, Chao
    Wang, Zhiping
    Lu, Peng
    Yang, Chenguang
    ROBOTICA, 2023, 41 (03) : 1015 - 1024
  • [3] Crossing the Gap: A Deep Dive into Zero-Shot Sim-to-Real Transfer for Dynamics
    Valassakis, Eugene
    Ding, Zihan
    Johns, Edward
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5372 - 5379
  • [4] PencilNet: Zero-Shot Sim-to-Real Transfer Learning for Robust Gate Perception in Autonomous Drone Racing
    Pham, Huy Xuan
    Sarabakha, Andriy
    Odnoshyvkin, Mykola
    Kayacan, Erdal
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04): : 11847 - 11854
  • [5] TactGen: Tactile Sensory Data Generation via Zero-Shot Sim-to-Real Transfer
    Zhong, Shaohong
    Albini, Alessandro
    Maiolino, Perla
    Posner, Ingmar
    IEEE TRANSACTIONS ON ROBOTICS, 2025, 41 : 1316 - 1328
  • [6] KOVIS: Keypoint-based Visual Servoing with Zero-Shot Sim-to-Real Transfer for Robotics Manipulation
    Puang, En Yen
    Tee, Keng Peng
    Jing, Wei
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 7527 - 7533
  • [7] Toward Zero-Shot Sim-to-Real Transfer Learning for Pneumatic Soft Robot 3D Proprioceptive Sensing
    Yoo, Uksang
    Zhao, Hanwen
    Altamirano, Alvaro
    Yuan, Wenzhen
    Feng, Chen
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 544 - 551
  • [8] Zero-Shot Sim-to-Real Transfer of Tactile Control Policies for Aggressive Swing-Up Manipulation
    Bi, Thomas
    Sferrazza, Carmelo
    D'Andrea, Raffaello
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (03) : 5761 - 5768
  • [9] Digital Twin (DT)-CycleGAN: Enabling Zero-Shot Sim-to-Real Transfer of Visual Grasping Models
    Liu, David
    Chen, Yuzhong
    Wu, Zihao
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (05) : 2421 - 2428
  • [10] Zero-Shot Sim2Real Transfer of Deep Reinforcement Learning Controller for Tower Crane System
    Mohiuddin, Mohammed B.
    Haddad, Abdel Gafoor
    Boiko, Igor
    Zweiri, Yahya
    IFAC PAPERSONLINE, 2023, 56 (02): : 10016 - 10020