Continual Reinforcement Learning for Quadruped Robot Locomotion

被引:3
|
作者
Gai, Sibo [1 ,2 ]
Lyu, Shangke [2 ]
Zhang, Hongyin [2 ]
Wang, Donglin [2 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China
[2] Westlake Univ, Sch Engineer, Hangzhou 310030, Peoples R China
关键词
continual learning; quadruped robot locomotion; reinforcement learning; plasticity; entropy;
D O I
10.3390/e26010093
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
The ability to learn continuously is crucial for a robot to achieve a high level of intelligence and autonomy. In this paper, we consider continual reinforcement learning (RL) for quadruped robots, which includes the ability to continuously learn sub-sequential tasks (plasticity) and maintain performance on previous tasks (stability). The policy obtained by the proposed method enables robots to learn multiple tasks sequentially, while overcoming both catastrophic forgetting and loss of plasticity. At the same time, it achieves the above goals with as little modification to the original RL learning process as possible. The proposed method uses the Piggyback algorithm to select protected parameters for each task, and reinitializes the unused parameters to increase plasticity. Meanwhile, we encourage the policy network exploring by encouraging the entropy of the soft network of the policy network. Our experiments show that traditional continual learning algorithms cannot perform well on robot locomotion problems, and our algorithm is more stable and less disruptive to the RL training progress. Several robot locomotion experiments validate the effectiveness of our method.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Reinforcement learning for quadrupedal locomotion with design of continual-hierarchical curriculum
    Kobayashi, Taisuke
    Sugino, Toshiki
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 95 (95)
  • [22] Generalization of movements in quadruped robot locomotion by learning specialized motion data
    Yamamoto, Hiroki
    Kim, Sungi
    Ishii, Yuichiro
    Ikemoto, Yusuke
    ROBOMECH JOURNAL, 2020, 7 (01):
  • [23] Model Predictive Control of Quadruped Robot Based on Reinforcement Learning
    Zhang, Zhitong
    Chang, Xu
    Ma, Hongxu
    An, Honglei
    Lang, Lin
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [24] A strategy for push recovery in quadruped Robot based on reinforcement Learning
    Chen, Yang-zhen
    Hou, Wen-Qi
    Wang, Jian
    Wang, Jian-Wen
    Ma, Hong-xu
    2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 3145 - 3151
  • [25] CPG Driven Locomotion Control of Quadruped Robot
    Liu, Chengju
    Chen, Yifei
    Zhang, Jiaqi
    Chen, Qijun
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 2368 - 2373
  • [26] FEEDBACK CONTROL OF THE LOCOMOTION OF A TAILED QUADRUPED ROBOT
    Liu, Yujiong
    Ben-Tzvi, Pinhas
    PROCEEDINGS OF ASME 2021 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2021, VOL 8B, 2021,
  • [27] The Quadruped Robot Locomotion Based on Force Control
    Zhang, Xianpeng
    Lang, Lin
    Wang, Jian
    Ma, Hongxu
    2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, : 5440 - 5445
  • [28] Analysis, Prototyping and Locomotion Control of a Quadruped Robot
    Souza, Lucas
    Mohr, Felipe
    Alencar, Brenda
    2023 LATIN AMERICAN ROBOTICS SYMPOSIUM, LARS, 2023 BRAZILIAN SYMPOSIUM ON ROBOTICS, SBR, AND 2023 WORKSHOP ON ROBOTICS IN EDUCATION, WRE, 2023, : 129 - 134
  • [29] Controlling the Solo12 quadruped robot with deep reinforcement learning
    Aractingi, Michel
    Leziart, Pierre-Alexandre
    Flayols, Thomas
    Perez, Julien
    Silander, Tomi
    Soueres, Philippe
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [30] Controlling the Solo12 quadruped robot with deep reinforcement learning
    Michel Aractingi
    Pierre-Alexandre Léziart
    Thomas Flayols
    Julien Perez
    Tomi Silander
    Philippe Souères
    Scientific Reports, 13