Learning Generalizable Locomotion Skills with Hierarchical Reinforcement Learning

被引:0
|
作者
Li, Tianyu [1 ]
Lambert, Nathan [2 ]
Calandra, Roberto [1 ]
Meier, Franziska [1 ]
Rai, Akshara [1 ]
机构
[1] Facebook, Menlo Pk, CA 94025 USA
[2] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Facebook AI Res, Berkeley, CA 94720 USA
关键词
WALKING; GENERATION; FRAMEWORK; MODEL;
D O I
10.1109/icra40945.2020.9196642
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Learning to locomote to arbitrary goals on hardware remains a challenging problem for reinforcement learning. In this paper, we present a hierarchical framework that improves sample-efficiency and generalizability of learned locomotion skills on real-world robots. Our approach divides the problem of goal-oriented locomotion into two sub-problems: learning diverse primitives skills, and using model-based planning to sequence these skills. We parametrize our primitives as cyclic movements, improving sample-efficiency of learning from scratch on a 18 degrees of freedom robot. Then, we learn coarse dynamics models over primitive cycles and use them in a model predictive control framework. This allows us to learn to walk to arbitrary goals up to 12m away, after about two hours of training from scratch on hardware. Our results on a Daisy hexapod hardware and simulation demonstrate the efficacy of our approach at reaching distant targets, in different environments, and with sensory noise.
引用
收藏
页码:413 / 419
页数:7
相关论文
共 50 条
  • [1] DeepLoco: Dynamic Locomotion Skills Using Hierarchical Deep Reinforcement Learning
    Peng, Xue Bin
    Berseth, Glen
    Yin, Kangkang
    Van De Panne, Michiel
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (04):
  • [2] Hierarchical reinforcement learning for biped locomotion
    Sugimoto, Norikazu
    Hyon, Sang-Ho
    Morimoto, Jun
    [J]. NEUROSCIENCE RESEARCH, 2009, 65 : S183 - S183
  • [3] Hierarchical Reinforcement Learning for Quadruped Locomotion
    Jain, Deepali
    Iscen, Atil
    Caluwaerts, Ken
    [J]. 2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 7551 - 7557
  • [4] Evaluating skills in hierarchical reinforcement learning
    Marzieh Davoodabadi Farahani
    Nasser Mozayani
    [J]. International Journal of Machine Learning and Cybernetics, 2020, 11 : 2407 - 2420
  • [5] Evaluating skills in hierarchical reinforcement learning
    Farahani, Marzieh Davoodabadi
    Mozayani, Nasser
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (10) : 2407 - 2420
  • [6] A Hierarchical Framework for Quadruped Locomotion Based on Reinforcement Learning
    Tan, Wenhao
    Fang, Xing
    Zhang, Wei
    Song, Ran
    Chen, Teng
    Zheng, Yu
    Li, Yibin
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 8462 - 8468
  • [7] Discovering and Exploiting Skills in Hierarchical Reinforcement Learning
    Huang, Zhigang
    [J]. IEEE Access, 2024, 12 : 163042 - 163055
  • [8] Learning Multiple-Gait Quadrupedal Locomotion via Hierarchical Reinforcement Learning
    Wei, Lang
    Li, Yunxiang
    Ai, Yunfei
    Wu, Yuze
    Xu, Hao
    Wang, Wei
    [J]. INTERNATIONAL JOURNAL OF PRECISION ENGINEERING AND MANUFACTURING, 2023, 24 (9) : 1599 - 1613
  • [9] Learning Multiple-Gait Quadrupedal Locomotion via Hierarchical Reinforcement Learning
    Lang Wei
    Yunxiang Li
    Yunfei Ai
    Yuze Wu
    Hao Xu
    Wei Wang
    Guoming Hu
    [J]. International Journal of Precision Engineering and Manufacturing, 2023, 24 : 1599 - 1613
  • [10] Learning Generalizable Pivoting Skills
    Zhang, Xiang
    Jain, Siddarth
    Huang, Baichuan
    Tomizuka, Masayoshi
    Romeres, Diego
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 5865 - 5871