Meta Reinforcement Learning of Locomotion Policy for Quadruped Robots With Motor Stuck

被引：0

作者：

Chen, Ci ^{[1
,2
]}

Li, Chao ^{[3
]}

Lu, Haojian ^{[1
,2
]}

Wang, Yue ^{[1
,2
]}

Xiong, Rong ^{[1
,2
]}

机构：

[1] Zhejiang Univ, State Key Lab Ind Control & Technol, Hangzhou 310027, Peoples R China

[2] Zhejiang Univ, Inst Cyber Syst & Control, Hangzhou 310027, Peoples R China

[3] DeepRobot Co, Hangzhou 310058, Peoples R China

来源：

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING | 2024年

关键词：

Meta reinforcement learning; quadruped robots; fault tolerance; FAULT-TOLERANT GAITS;

D O I：

10.1109/TASE.2024.3424328

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Significant progress has been made in enhancing the motion capabilities of quadruped robots in unstructured environments due to advancements in hardware and control algorithms. However, limited research has been conducted on the fault-tolerant control of quadruped robots, which is crucial for their operation in remote or extreme environments like disaster sites. In this paper, we primarily focus on fault-tolerant strategies for common joint-stuck situations. By leveraging the static stability of quadruped robots, it becomes possible to adjust their control policies and enable them to continue following predetermined trajectories. We introduce a contextual meta-reinforcement learning (Meta-RL) method to design fault-tolerant policies. This method infers task-related latent vectors from the context to assist in training the policy network, ensuring both conciseness and optimality in various situations. Additionally, to expedite algorithm training, we propose a reference action generator (RAG). To validate the proposed algorithm, extensive simulations and physical experiments are conducted. The results demonstrate that our method allows the robot to maintain its trajectory even when faced with motor locking. Furthermore, our method outperforms all baseline algorithms, highlighting its superiority in terms of fault tolerance. Note to Practitioners-The motivation of this article is to provide fault-tolerant policies for quadruped robots, specifically referring to the policies for joint-stuck situations. Previous fault-tolerant strategies either require individually designing control strategies for each joint stuck task, which brings a significant workload to designers, or adopting a unified strategy that cannot provide the optimal strategy for each task. In this article, we utilize the Meta-RL method to handle the joint stuck issue in robots for the first time. By combining the context encoder and RAG, we can provide more suitable policies for various motor-stuck tasks. Both the simulation and physical experiments validate the effectiveness and applicability of this method.

引用

页数：15

共 50 条

[21] Policy gradient reinforcement learning for fast quadrupedal locomotion
Kohl, N
Stone, P
[J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1- 5, PROCEEDINGS, 2004, : 2619 - 2624
[22] Autonomous learning of stable quadruped locomotion
Saggar, Manish
D'Silva, Thomas
Kohl, Nate
Stone, Peter
[J]. ROBOCUP 2006: ROBOT SOCCER WORLD CUP X, 2007, 4434 : 98 - +
[23] Adaptive Locomotion Learning for Quadruped Robots by Combining DRL with a Cosine Oscillator Based Rhythm Controller
Zhang, Xiaoping
Wu, Yitong
Wang, Huijiang
Iida, Fumiya
Wang, Li
[J]. APPLIED SCIENCES-BASEL, 2023, 13 (19):
[24] A learning-based control pipeline for generic motor skills for quadruped robots
Shao, Yecheng
Jin, Yongbin
Huang, Zhilong
Wang, Hongtao
Yang, Wei
[J]. JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE A, 2024, 25 (06): : 443 - 454
[25] Online Hierarchical Planning for Multicontact Locomotion Control of Quadruped Robots
Sun, Hao
Yang, Junjie
Jia, Yinghao
Wang, Changhong
[J]. IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024,
[26] FT-Net: Learning Failure Recovery and Fault-Tolerant Locomotion for Quadruped Robots
Luo, Zeren
Xiao, Erdong
Lu, Peng
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (12) : 8414 - 8421
[27] Slope Handling for Quadruped Robots Using Deep Reinforcement Learning and Toe Trajectory Planning
Mastrogeorgiou, Athanasios S.
Elbahrawy, Yehia S.
Kecskemethy, Andres
Papadopoulos, Evangelos G.
[J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 3777 - 3782
[28] Leg Locomotion Adaption for Quadruped Robots with Ground Compliance Estimation
Zhang, Songyuan
Zhang, Hongji
Fu, Yili
[J]. APPLIED BIONICS AND BIOMECHANICS, 2020, 2020
[29] Locomotion control of quadruped robots based on workspace trajectory modulations
School of Electronics and Information Engineering, Tongji University, Ministry of Education, Shanghai 201804, China
不详
不详
[J]. Int J Rob Autom, 2012, 4 (345-354):
[30] LOCOMOTION CONTROL OF QUADRUPED ROBOTS BASED ON WORKSPACE TRAJECTORY MODULATIONS
Liu, Chengju J.
Wang, Danwei W.
Chen, Qijun J.
[J]. INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2012, 27 (04): : 345 - 354

← 1 2 3 4 5 →