Continual Reinforcement Learning for Quadruped Robot Locomotion

被引:3
|
作者
Gai, Sibo [1 ,2 ]
Lyu, Shangke [2 ]
Zhang, Hongyin [2 ]
Wang, Donglin [2 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China
[2] Westlake Univ, Sch Engineer, Hangzhou 310030, Peoples R China
关键词
continual learning; quadruped robot locomotion; reinforcement learning; plasticity; entropy;
D O I
10.3390/e26010093
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
The ability to learn continuously is crucial for a robot to achieve a high level of intelligence and autonomy. In this paper, we consider continual reinforcement learning (RL) for quadruped robots, which includes the ability to continuously learn sub-sequential tasks (plasticity) and maintain performance on previous tasks (stability). The policy obtained by the proposed method enables robots to learn multiple tasks sequentially, while overcoming both catastrophic forgetting and loss of plasticity. At the same time, it achieves the above goals with as little modification to the original RL learning process as possible. The proposed method uses the Piggyback algorithm to select protected parameters for each task, and reinitializes the unused parameters to increase plasticity. Meanwhile, we encourage the policy network exploring by encouraging the entropy of the soft network of the policy network. Our experiments show that traditional continual learning algorithms cannot perform well on robot locomotion problems, and our algorithm is more stable and less disruptive to the RL training progress. Several robot locomotion experiments validate the effectiveness of our method.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Embodiment Enables the Spinal Engine in Quadruped Robot Locomotion
    Zhao, Qian
    Nakajima, Kohei
    Sumioka, Hidenobu
    Yu, Xiaoxiang
    Pfeifer, Rolf
    2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 2449 - 2456
  • [42] Adaptive gait pattern control of a quadruped locomotion robot
    Tsujita, K
    Tsuchiya, K
    Onat, A
    IROS 2001: PROCEEDINGS OF THE 2001 IEEE/RJS INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4: EXPANDING THE SOCIETAL ROLE OF ROBOTICS IN THE NEXT MILLENNIUM, 2001, : 2318 - 2325
  • [43] A dynamic locomotion strategy for stair walking of a quadruped robot
    Yoon, Daekeun
    Kim, Baekchul
    Jo, Ikhee
    Jeong, Woong
    2021 18TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS (UR), 2021, : 223 - 227
  • [44] A study on optimal gait pattern of a quadruped locomotion robot
    Tsujita, K
    Kawakami, M
    Tsuchiya, K
    2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 745 - 749
  • [45] Single Leg Operational Space Control of Quadruped Robot based on Reinforcement Learning
    Rao, Jinhui
    An, Honglei
    Zhang, Taihui
    Chen, Yangzhen
    Ma, Hongxu
    2016 IEEE CHINESE GUIDANCE, NAVIGATION AND CONTROL CONFERENCE (CGNCC), 2016, : 597 - 602
  • [46] Locomotion simulation of a quadruped robot on general level terrain
    AlZaydi, MY
    Amin, SHM
    INES'97 : 1997 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT ENGINEERING SYSTEMS, PROCEEDINGS, 1997, : 159 - 164
  • [47] Evolution of locomotion in a simulated quadruped robot and transferral to reality
    Glette, Kyrre
    Klaus, Gordon
    Cristobal Zagal, Juan
    Torresen, Jim
    PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 17TH '12), 2012, : 1139 - 1142
  • [48] Development of a Minimalistic Pneumatic Quadruped Robot for Fast Locomotion
    Narioka, Kenichi
    Rosendo, Andre
    Sproewitz, Alexander
    Hosoda, Koh
    2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO 2012), 2012,
  • [49] Learning with a Quadruped Chopstick Robot
    Lee, Wei-Chung
    Chen, Jong-Chen
    Wu, Shou-zhe
    Lin, Kuo-Ming
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, 2009, 5632 : 603 - 616
  • [50] Learning-Based Locomotion Controllers for Quadruped Robots in Indoor Stair Climbing via Deep Reinforcement Learning
    Sinsukudomchai, Tanawit
    Deelertpaiboon, Chirdpong
    2024 21ST INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY, ECTI-CON 2024, 2024,