Continual Reinforcement Learning for Quadruped Robot Locomotion

被引:1
|
作者
Gai, Sibo [1 ,2 ]
Lyu, Shangke [2 ]
Zhang, Hongyin [2 ]
Wang, Donglin [2 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China
[2] Westlake Univ, Sch Engineer, Hangzhou 310030, Peoples R China
关键词
continual learning; quadruped robot locomotion; reinforcement learning; plasticity; entropy;
D O I
10.3390/e26010093
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
The ability to learn continuously is crucial for a robot to achieve a high level of intelligence and autonomy. In this paper, we consider continual reinforcement learning (RL) for quadruped robots, which includes the ability to continuously learn sub-sequential tasks (plasticity) and maintain performance on previous tasks (stability). The policy obtained by the proposed method enables robots to learn multiple tasks sequentially, while overcoming both catastrophic forgetting and loss of plasticity. At the same time, it achieves the above goals with as little modification to the original RL learning process as possible. The proposed method uses the Piggyback algorithm to select protected parameters for each task, and reinitializes the unused parameters to increase plasticity. Meanwhile, we encourage the policy network exploring by encouraging the entropy of the soft network of the policy network. Our experiments show that traditional continual learning algorithms cannot perform well on robot locomotion problems, and our algorithm is more stable and less disruptive to the RL training progress. Several robot locomotion experiments validate the effectiveness of our method.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Reinforcement Learning for Quadruped Locomotion
    Zhao, Kangqiao
    Lin, Feng
    Seah, Hock Soon
    [J]. ADVANCES IN COMPUTER GRAPHICS, CGI 2021, 2021, 13002 : 167 - 177
  • [2] Hierarchical Reinforcement Learning for Quadruped Locomotion
    Jain, Deepali
    Iscen, Atil
    Caluwaerts, Ken
    [J]. 2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 7551 - 7557
  • [3] Quadruped Robot Locomotion in Unknown Terrain Using Deep Reinforcement
    Pei, Muleilan
    Wu, Dongping
    Wang, Changhong
    [J]. PROCEEDINGS OF 2020 3RD INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS), 2020, : 517 - 522
  • [4] A Hierarchical Framework for Quadruped Locomotion Based on Reinforcement Learning
    Tan, Wenhao
    Fang, Xing
    Zhang, Wei
    Song, Ran
    Chen, Teng
    Zheng, Yu
    Li, Yibin
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 8462 - 8468
  • [5] CARL: Controllable Agent with Reinforcement Learning for Quadruped Locomotion
    Luo, Ying-Sheng
    Soeseno, Jonathan Hans
    Chen, Trista Pei-Chun
    Chen, Wei-Chao
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2020, 39 (04):
  • [6] Learning Agile, Robust Locomotion Skills for Quadruped Robot
    Zhang, Zhitong
    Wei, Qing
    Chang, Xu
    Lang, Ling
    Ma, Hongxu
    An, Honglei
    [J]. 2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 1011 - 1016
  • [7] Reinforcement Learning based Control of a Quadruped Robot
    Ancy, A.
    Jisha, V. R.
    [J]. 2022 IEEE 19TH INDIA COUNCIL INTERNATIONAL CONFERENCE, INDICON, 2022,
  • [8] Learning to Navigate for Mobile Robot with Continual Reinforcement Learning
    Wang, Ning
    Zhang, Dingyuan
    Wang, Yong
    [J]. PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 3701 - 3706
  • [9] A Hierarchical Framework for Quadruped Omnidirectional Locomotion Based on Reinforcement Learning
    Tan, Wenhao
    Fang, Xing
    Zhang, Wei
    Song, Ran
    Chen, Teng
    Zheng, Yu
    Li, Yibin
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, : 1 - 12
  • [10] Realizing the quadruped robot walking by using reinforcement learning
    Murao, H
    Tamaki, H
    Kitamura, S
    [J]. 6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XVII, PROCEEDINGS: INDUSTRIAL SYSTEMS AND ENGINEERING III, 2002, : 240 - 243