Meta Reinforcement Learning of Locomotion Policy for Quadruped Robots With Motor Stuck

被引:0
|
作者
Chen, Ci [1 ,2 ]
Li, Chao [3 ]
Lu, Haojian [1 ,2 ]
Wang, Yue [1 ,2 ]
Xiong, Rong [1 ,2 ]
机构
[1] Zhejiang Univ, State Key Lab Ind Control & Technol, Hangzhou 310027, Peoples R China
[2] Zhejiang Univ, Inst Cyber Syst & Control, Hangzhou 310027, Peoples R China
[3] DeepRobot Co, Hangzhou 310058, Peoples R China
关键词
Meta reinforcement learning; quadruped robots; fault tolerance; FAULT-TOLERANT GAITS;
D O I
10.1109/TASE.2024.3424328
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Significant progress has been made in enhancing the motion capabilities of quadruped robots in unstructured environments due to advancements in hardware and control algorithms. However, limited research has been conducted on the fault-tolerant control of quadruped robots, which is crucial for their operation in remote or extreme environments like disaster sites. In this paper, we primarily focus on fault-tolerant strategies for common joint-stuck situations. By leveraging the static stability of quadruped robots, it becomes possible to adjust their control policies and enable them to continue following predetermined trajectories. We introduce a contextual meta-reinforcement learning (Meta-RL) method to design fault-tolerant policies. This method infers task-related latent vectors from the context to assist in training the policy network, ensuring both conciseness and optimality in various situations. Additionally, to expedite algorithm training, we propose a reference action generator (RAG). To validate the proposed algorithm, extensive simulations and physical experiments are conducted. The results demonstrate that our method allows the robot to maintain its trajectory even when faced with motor locking. Furthermore, our method outperforms all baseline algorithms, highlighting its superiority in terms of fault tolerance. Note to Practitioners-The motivation of this article is to provide fault-tolerant policies for quadruped robots, specifically referring to the policies for joint-stuck situations. Previous fault-tolerant strategies either require individually designing control strategies for each joint stuck task, which brings a significant workload to designers, or adopting a unified strategy that cannot provide the optimal strategy for each task. In this article, we utilize the Meta-RL method to handle the joint stuck issue in robots for the first time. By combining the context encoder and RAG, we can provide more suitable policies for various motor-stuck tasks. Both the simulation and physical experiments validate the effectiveness and applicability of this method.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Policy gradient reinforcement learning for fast quadrupedal locomotion
    Kohl, N
    Stone, P
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1- 5, PROCEEDINGS, 2004, : 2619 - 2624
  • [22] Autonomous learning of stable quadruped locomotion
    Saggar, Manish
    D'Silva, Thomas
    Kohl, Nate
    Stone, Peter
    [J]. ROBOCUP 2006: ROBOT SOCCER WORLD CUP X, 2007, 4434 : 98 - +
  • [23] Adaptive Locomotion Learning for Quadruped Robots by Combining DRL with a Cosine Oscillator Based Rhythm Controller
    Zhang, Xiaoping
    Wu, Yitong
    Wang, Huijiang
    Iida, Fumiya
    Wang, Li
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (19):
  • [24] A learning-based control pipeline for generic motor skills for quadruped robots
    Shao, Yecheng
    Jin, Yongbin
    Huang, Zhilong
    Wang, Hongtao
    Yang, Wei
    [J]. JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE A, 2024, 25 (06): : 443 - 454
  • [25] Online Hierarchical Planning for Multicontact Locomotion Control of Quadruped Robots
    Sun, Hao
    Yang, Junjie
    Jia, Yinghao
    Wang, Changhong
    [J]. IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024,
  • [26] FT-Net: Learning Failure Recovery and Fault-Tolerant Locomotion for Quadruped Robots
    Luo, Zeren
    Xiao, Erdong
    Lu, Peng
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (12) : 8414 - 8421
  • [27] Slope Handling for Quadruped Robots Using Deep Reinforcement Learning and Toe Trajectory Planning
    Mastrogeorgiou, Athanasios S.
    Elbahrawy, Yehia S.
    Kecskemethy, Andres
    Papadopoulos, Evangelos G.
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 3777 - 3782
  • [28] Leg Locomotion Adaption for Quadruped Robots with Ground Compliance Estimation
    Zhang, Songyuan
    Zhang, Hongji
    Fu, Yili
    [J]. APPLIED BIONICS AND BIOMECHANICS, 2020, 2020
  • [29] Locomotion control of quadruped robots based on workspace trajectory modulations
    School of Electronics and Information Engineering, Tongji University, Ministry of Education, Shanghai 201804, China
    不详
    不详
    [J]. Int J Rob Autom, 2012, 4 (345-354):
  • [30] LOCOMOTION CONTROL OF QUADRUPED ROBOTS BASED ON WORKSPACE TRAJECTORY MODULATIONS
    Liu, Chengju J.
    Wang, Danwei W.
    Chen, Qijun J.
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2012, 27 (04): : 345 - 354