Continual Reinforcement Learning for Quadruped Robot Locomotion

被引:3
|
作者
Gai, Sibo [1 ,2 ]
Lyu, Shangke [2 ]
Zhang, Hongyin [2 ]
Wang, Donglin [2 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China
[2] Westlake Univ, Sch Engineer, Hangzhou 310030, Peoples R China
关键词
continual learning; quadruped robot locomotion; reinforcement learning; plasticity; entropy;
D O I
10.3390/e26010093
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
The ability to learn continuously is crucial for a robot to achieve a high level of intelligence and autonomy. In this paper, we consider continual reinforcement learning (RL) for quadruped robots, which includes the ability to continuously learn sub-sequential tasks (plasticity) and maintain performance on previous tasks (stability). The policy obtained by the proposed method enables robots to learn multiple tasks sequentially, while overcoming both catastrophic forgetting and loss of plasticity. At the same time, it achieves the above goals with as little modification to the original RL learning process as possible. The proposed method uses the Piggyback algorithm to select protected parameters for each task, and reinitializes the unused parameters to increase plasticity. Meanwhile, we encourage the policy network exploring by encouraging the entropy of the soft network of the policy network. Our experiments show that traditional continual learning algorithms cannot perform well on robot locomotion problems, and our algorithm is more stable and less disruptive to the RL training progress. Several robot locomotion experiments validate the effectiveness of our method.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Walking pattern acquisition for quadruped robot by using modular reinforcement learning
    Murao, H
    Tamaki, H
    Kitamura, S
    2001 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: E-SYSTEMS AND E-MAN FOR CYBERNETICS IN CYBERSPACE, 2002, : 1402 - 1405
  • [32] Research on Foothold Optimization of the Quadruped Crawling Robot based on Reinforcement Learning
    Liu X.
    Wang P.
    Dong R.
    Recent Patents on Mechanical Engineering, 2024, 17 (01) : 11 - 22
  • [33] Autonomous learning of stable quadruped locomotion
    Saggar, Manish
    D'Silva, Thomas
    Kohl, Nate
    Stone, Peter
    ROBOCUP 2006: ROBOT SOCCER WORLD CUP X, 2007, 4434 : 98 - +
  • [34] Comparative Analysis of Reinforcement Learning Algorithms for Bipedal Robot Locomotion
    Aydogmus, Omur
    Yilmaz, Musa
    IEEE ACCESS, 2024, 12 : 7490 - 7499
  • [35] Deep Reinforcement Learning Framework for Underwater Locomotion of Soft Robot
    Li, Guanda
    Shintake, Jun
    Hayashibe, Mitsuhiro
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 12033 - 12039
  • [36] Automated Hyperparameter Tuning in Reinforcement Learning for Quadrupedal Robot Locomotion
    Kim, Myeongseop
    Kim, Jung-Su
    Park, Jae-Han
    ELECTRONICS, 2024, 13 (01)
  • [37] Dynamic Locomotion with a Wheeled-Legged Quadruped Robot
    Sharf, I.
    BRAIN, BODY AND MACHINE, 2010, 83 : 299 - 310
  • [38] Energy Consumption Minimization of Quadruped Robot Based on Reinforcement Learning of DDPG Algorithm
    Yan, Zhenzhuo
    Ji, Hongwei
    Chang, Qing
    ACTUATORS, 2024, 13 (01)
  • [39] Locomotion Planning for Quadruped Robot Over Rough Terrain
    Wang, Zhongyuan
    Sun, Caiming
    Deng, Ganyu
    Zhang, Aidong
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 3170 - 3173
  • [40] Locomotion Control With Slip Detection for Quadruped Robot, PongBot
    Kim, Kyung-Hwan
    Kim, Jung-Yup
    International Journal of Control, Automation and Systems, 2024, 22 (12) : 3744 - 3752