Meta Reinforcement Learning of Locomotion Policy for Quadruped Robots With Motor Stuck

被引:0
|
作者
Chen, Ci [1 ,2 ]
Li, Chao [3 ]
Lu, Haojian [1 ,2 ]
Wang, Yue [1 ,2 ]
Xiong, Rong [1 ,2 ]
机构
[1] Zhejiang Univ, State Key Lab Ind Control & Technol, Hangzhou 310027, Peoples R China
[2] Zhejiang Univ, Inst Cyber Syst & Control, Hangzhou 310027, Peoples R China
[3] DeepRobot Co, Hangzhou 310058, Peoples R China
关键词
Meta reinforcement learning; quadruped robots; fault tolerance; FAULT-TOLERANT GAITS;
D O I
10.1109/TASE.2024.3424328
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Significant progress has been made in enhancing the motion capabilities of quadruped robots in unstructured environments due to advancements in hardware and control algorithms. However, limited research has been conducted on the fault-tolerant control of quadruped robots, which is crucial for their operation in remote or extreme environments like disaster sites. In this paper, we primarily focus on fault-tolerant strategies for common joint-stuck situations. By leveraging the static stability of quadruped robots, it becomes possible to adjust their control policies and enable them to continue following predetermined trajectories. We introduce a contextual meta-reinforcement learning (Meta-RL) method to design fault-tolerant policies. This method infers task-related latent vectors from the context to assist in training the policy network, ensuring both conciseness and optimality in various situations. Additionally, to expedite algorithm training, we propose a reference action generator (RAG). To validate the proposed algorithm, extensive simulations and physical experiments are conducted. The results demonstrate that our method allows the robot to maintain its trajectory even when faced with motor locking. Furthermore, our method outperforms all baseline algorithms, highlighting its superiority in terms of fault tolerance. Note to Practitioners-The motivation of this article is to provide fault-tolerant policies for quadruped robots, specifically referring to the policies for joint-stuck situations. Previous fault-tolerant strategies either require individually designing control strategies for each joint stuck task, which brings a significant workload to designers, or adopting a unified strategy that cannot provide the optimal strategy for each task. In this article, we utilize the Meta-RL method to handle the joint stuck issue in robots for the first time. By combining the context encoder and RAG, we can provide more suitable policies for various motor-stuck tasks. Both the simulation and physical experiments validate the effectiveness and applicability of this method.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Reinforcement Learning for Quadruped Locomotion
    Zhao, Kangqiao
    Lin, Feng
    Seah, Hock Soon
    [J]. ADVANCES IN COMPUTER GRAPHICS, CGI 2021, 2021, 13002 : 167 - 177
  • [2] Hierarchical Reinforcement Learning for Quadruped Locomotion
    Jain, Deepali
    Iscen, Atil
    Caluwaerts, Ken
    [J]. 2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 7551 - 7557
  • [3] Learning Smooth and Omnidirectional Locomotion for Quadruped Robots
    Wu, Jiaxi
    Wang, Chen'an
    Zhang, Dianmin
    Zhong, Shanlin
    Wang, Boxing
    Qiao, Hong
    [J]. 2021 6TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2021), 2021, : 633 - 638
  • [4] Continual Reinforcement Learning for Quadruped Robot Locomotion
    Gai, Sibo
    Lyu, Shangke
    Zhang, Hongyin
    Wang, Donglin
    [J]. ENTROPY, 2024, 26 (01)
  • [5] A Hierarchical Framework for Quadruped Locomotion Based on Reinforcement Learning
    Tan, Wenhao
    Fang, Xing
    Zhang, Wei
    Song, Ran
    Chen, Teng
    Zheng, Yu
    Li, Yibin
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 8462 - 8468
  • [6] CARL: Controllable Agent with Reinforcement Learning for Quadruped Locomotion
    Luo, Ying-Sheng
    Soeseno, Jonathan Hans
    Chen, Trista Pei-Chun
    Chen, Wei-Chao
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2020, 39 (04):
  • [7] PSTO: Learning Energy-Efficient Locomotion for Quadruped Robots
    Zhu, Wangshu
    Rosendo, Andre
    [J]. MACHINES, 2022, 10 (03)
  • [8] Sim-to-Real: Learning Agile Locomotion For Quadruped Robots
    Tan, Jie
    Zhang, Tingnan
    Coumans, Erwin
    Iscen, Atil
    Bai, Yunfei
    Hafner, Danijar
    Bohez, Steven
    Vanhoucke, Vincent
    [J]. ROBOTICS: SCIENCE AND SYSTEMS XIV, 2018,
  • [9] Policy gradient learning for quadruped soccer robots
    Cherubini, A.
    Giannone, F.
    Iocchi, L.
    Nardi, D.
    Palamara, P. F.
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2010, 58 (07) : 872 - 878
  • [10] A Hierarchical Framework for Quadruped Omnidirectional Locomotion Based on Reinforcement Learning
    Tan, Wenhao
    Fang, Xing
    Zhang, Wei
    Song, Ran
    Chen, Teng
    Zheng, Yu
    Li, Yibin
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, : 1 - 12