Sim-to-Real Transfer for Quadrupedal Locomotion via Terrain Transformer

被引:2
|
作者
Lai, Hang [1 ,2 ]
Zhang, Weinan [1 ]
He, Xialin [1 ]
Yu, Chen [2 ,3 ]
Tian, Zheng [4 ]
Yu, Yong [1 ]
Wang, Jun [2 ,5 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China
[2] Digital Brain Lab, Shanghai, Peoples R China
[3] Shanghai Tech Univ, Sch Info Sci & Tech, Shanghai, Peoples R China
[4] Shanghai Tech Univ, Sch Creat & Art, Shanghai, Peoples R China
[5] UCL, Ctr Artificial Intelligence, London, England
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICRA48891.2023.10160497
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep reinforcement learning has recently emerged as an appealing alternative for legged locomotion over multiple terrains by training a policy in physical simulation and then transferring it to the real world (i.e., sim-to-real transfer). Despite considerable progress, the capacity and scalability of traditional neural networks are still limited, which may hinder their applications in more complex environments. In contrast, the Transformer architecture has shown its superiority in a wide range of large-scale sequence modeling tasks, including natural language processing and decision-making problems. In this paper, we propose Terrain Transformer (TERT), a high-capacity Transformer model for quadrupedal locomotion control on various terrains. Furthermore, to better leverage Transformer in sim-to-real scenarios, we present a novel two-stage training framework consisting of an offline pretraining stage and an online correction stage, which can naturally integrate Transformer with privileged training. Extensive experiments in simulation demonstrate that TERT outperforms state-of-the-art baselines on different terrains in terms of return, energy consumption and control smoothness. In further real-world validation, TERT successfully traverses nine challenging terrains, including sand pit and stair down, which can not be accomplished by strong baselines.
引用
收藏
页码:5141 / 5147
页数:7
相关论文
共 50 条
  • [31] Survey on Sim-to-real Transfer Reinforcement Learning in Robot Systems
    Lin Q.
    Yu C.
    Wu X.-W.
    Dong Y.-Z.
    Xu X.
    Zhang Q.
    Guo X.
    Ruan Jian Xue Bao/Journal of Software, 2024, 35 (02): : 711 - 738
  • [32] Bidirectional Sim-to-Real Transfer for GelSight Tactile Sensors With CycleGAN
    Chen, Weihang
    Xu, Yuan
    Chen, Zhenyang
    Zeng, Peiyu
    Dang, Renjun
    Chen, Rui
    Xu, Jing
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) : 6187 - 6194
  • [33] TactGen: Tactile Sensory Data Generation via Zero-Shot Sim-to-Real Transfer
    Zhong, Shaohong
    Albini, Alessandro
    Maiolino, Perla
    Posner, Ingmar
    IEEE TRANSACTIONS ON ROBOTICS, 2025, 41 : 1316 - 1328
  • [34] Prompt to Transfer: Sim-to-Real Transfer for Traffic Signal Control with Prompt Learning
    Da, Longchao
    Gao, Minquan
    Mei, Hao
    Wei, Hua
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 1, 2024, : 82 - 90
  • [35] Learning quadrupedal locomotion over challenging terrain
    Lee J.
    Hwangbo J.
    Wellhausen L.
    Koltun V.
    Hutter M.
    Lee, Joonho (jolee@ethz.ch), 1600, American Association for the Advancement of Science (05):
  • [36] On the Role of the Action Space in Robot Manipulation Learning and Sim-to-Real Transfer
    Aljalbout, Elie
    Frank, Felix
    Karl, Maximilian
    van der Smagt, Patrick
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (06): : 5895 - 5902
  • [37] Quadrupedal Locomotion on Uneven Terrain With Sensorized Feet
    Valsecchi, Giorgio
    Grandia, Ruben
    Hutter, Marco
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) : 1548 - 1555
  • [38] Learning quadrupedal locomotion over challenging terrain
    Lee, Joonho
    Hwangbo, Jemin
    Wellhausen, Lorenz
    Koltun, Vladlen
    Hutter, Marco
    SCIENCE ROBOTICS, 2020, 5 (47)
  • [39] Sim-to-real transfer of co-optimized soft robot crawlers
    Charles Schaff
    Audrey Sedal
    Shiyao Ni
    Matthew R. Walter
    Autonomous Robots, 2023, 47 : 1195 - 1211
  • [40] Sim-to-Real Policy and Reward Transfer with Adaptive Forward Dynamics Model
    Juan, Rongshun
    Ju, Hao
    Huang, Jie
    Gomez, Randy
    Nakamura, Keisuke
    Li, Guangliang
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 7212 - 7218