Learning Generalizable Locomotion Skills with Hierarchical Reinforcement Learning

被引：0

作者：

Li, Tianyu ^{[1
]}

Lambert, Nathan ^{[2
]}

Calandra, Roberto ^{[1
]}

Meier, Franziska ^{[1
]}

Rai, Akshara ^{[1
]}

机构：

[1] Facebook, Menlo Pk, CA 94025 USA

[2] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Facebook AI Res, Berkeley, CA 94720 USA

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) | 2020年

关键词：

WALKING; GENERATION; FRAMEWORK; MODEL;

D O I：

10.1109/icra40945.2020.9196642

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Learning to locomote to arbitrary goals on hardware remains a challenging problem for reinforcement learning. In this paper, we present a hierarchical framework that improves sample-efficiency and generalizability of learned locomotion skills on real-world robots. Our approach divides the problem of goal-oriented locomotion into two sub-problems: learning diverse primitives skills, and using model-based planning to sequence these skills. We parametrize our primitives as cyclic movements, improving sample-efficiency of learning from scratch on a 18 degrees of freedom robot. Then, we learn coarse dynamics models over primitive cycles and use them in a model predictive control framework. This allows us to learn to walk to arbitrary goals up to 12m away, after about two hours of training from scratch on hardware. Our results on a Daisy hexapod hardware and simulation demonstrate the efficacy of our approach at reaching distant targets, in different environments, and with sensory noise.

引用

页码：413 / 419

页数：7

共 50 条

[1] DeepLoco: Dynamic Locomotion Skills Using Hierarchical Deep Reinforcement Learning
Peng, Xue Bin
Berseth, Glen
Yin, Kangkang
Van De Panne, Michiel
[J]. ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (04):
[2] Hierarchical reinforcement learning for biped locomotion
Sugimoto, Norikazu
Hyon, Sang-Ho
Morimoto, Jun
[J]. NEUROSCIENCE RESEARCH, 2009, 65 : S183 - S183
[3] Hierarchical Reinforcement Learning for Quadruped Locomotion
Jain, Deepali
Iscen, Atil
Caluwaerts, Ken
[J]. 2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 7551 - 7557
[4] Evaluating skills in hierarchical reinforcement learning
Marzieh Davoodabadi Farahani
Nasser Mozayani
[J]. International Journal of Machine Learning and Cybernetics, 2020, 11 : 2407 - 2420
[5] Evaluating skills in hierarchical reinforcement learning
Farahani, Marzieh Davoodabadi
Mozayani, Nasser
[J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (10) : 2407 - 2420
[6] A Hierarchical Framework for Quadruped Locomotion Based on Reinforcement Learning
Tan, Wenhao
Fang, Xing
Zhang, Wei
Song, Ran
Chen, Teng
Zheng, Yu
Li, Yibin
[J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 8462 - 8468
[7] Discovering and Exploiting Skills in Hierarchical Reinforcement Learning
Huang, Zhigang
[J]. IEEE Access, 2024, 12 : 163042 - 163055
[8] Learning Multiple-Gait Quadrupedal Locomotion via Hierarchical Reinforcement Learning
Wei, Lang
Li, Yunxiang
Ai, Yunfei
Wu, Yuze
Xu, Hao
Wang, Wei
[J]. INTERNATIONAL JOURNAL OF PRECISION ENGINEERING AND MANUFACTURING, 2023, 24 (9) : 1599 - 1613
[9] Learning Multiple-Gait Quadrupedal Locomotion via Hierarchical Reinforcement Learning
Lang Wei
Yunxiang Li
Yunfei Ai
Yuze Wu
Hao Xu
Wei Wang
Guoming Hu
[J]. International Journal of Precision Engineering and Manufacturing, 2023, 24 : 1599 - 1613
[10] Learning Generalizable Pivoting Skills
Zhang, Xiang
Jain, Siddarth
Huang, Baichuan
Tomizuka, Masayoshi
Romeres, Diego
[J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 5865 - 5871

← 1 2 3 4 5 →