Acquisition of stand-up behavior by a real robot using hierarchical reinforcement learning

被引：131

作者：

Morimoto, J

Doya, K

机构：

[1] JST, ERATO, Kawato Dynam Brain Project, Kyoto 6190288, Japan

[2] Nara Inst Sci & Technol, Grad Sch Informat Sci, Nara 6300101, Japan

[3] JST, CREST, ATR Int, Kyoto 6190288, Japan

来源：

ROBOTICS AND AUTONOMOUS SYSTEMS | 2001年 / 36卷 / 01期

关键词：

reinforcement learning; hierarchical; real robot; stand-up; motor control;

D O I：

10.1016/S0921-8890(01)00113-0

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we propose a hierarchical reinforcement learning architecture that realizes practical learning speed in real hardware control tasks. In order to enable learning in a practical number of trials, we introduce a low-dimensional representation of the state of the robot for higher-level planning. The upper level learns a discrete sequence of sub-goals in a low-dimensional state space for achieving the main goal of the task. The lower-level modules learn local trajectories in the original high-dimensional state space to achieve the sub-goal specified by the upper level. We applied the hierarchical architecture to a three-link, two-joint robot for the task of learning to stand up by trial and error. The upper-level learning was implemented by Q-learning, while the lower-level learning was implemented by a continuous actor-critic method. The robot successfully learned to stand up within 750 trials in simulation and then in an additional 170 trials using real hardware. The effects of the setting of the search steps in the upper level and the use of a supplementary reward for achieving sub-goals are also tested in simulation. (C) 2001 Elsevier Science B.V. All rights reserved.

引用

下载

页码：37 / 51

页数：15

共 50 条

[1] Hierarchical reinforcement learning for motion learning: learning 'stand-up' trajectories
Morimoto, J
Doya, K
ADVANCED ROBOTICS, 1999, 13 (03) : 267 - 268
[2] Hierarchical reinforcement learning for motion learning: Learning `stand-up' trajectories
Morimoto, Jun
Doya, Kenji
Advanced Robotics, 1998, 13 (03): : 267 - 268
[3] A Multi-stage Approach for Efficiently Learning Humanoid Robot Stand-up Behavior
Luo, Dingsheng
Ding, Yaoxiang
Cao, Zidong
Wu, Xihong
2014 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2014), 2014, : 884 - 889
[4] Purposive behavior acquisition for a real robot by vision-based reinforcement learning
Asada, M
Noda, S
Tawaratsumida, S
Hosoda, K
MACHINE LEARNING, 1996, 23 (2-3) : 279 - 303
[5] Robot stand-up shows funny side
Heaven, Douglas
NEW SCIENTIST, 2017, 236 (3156) : 14 - 14
[6] BEHAVIOR ACQUISITION ON A MOBILE ROBOT USING REINFORCEMENT LEARNING WITH CONTINUOUS STATE SPACE
Arai, Tomoyuki
Toda, Yuichiro
Kubota, Naoyuki
PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), 2019, : 458 - 461
[7] Collective Behavior Acquisition of Real Robotic Swarms using Deep Reinforcement Learning
Yasuda, Toshiyuki
Ohkura, Kazuhiro
2018 SECOND IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC), 2018, : 179 - 180
[8] Learning Complex Stand-Up Motion for Humanoid Robots
Jeong, Heejin
Lee, Daniel D.
THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 4218 - 4219
[9] Safe Robot Navigation Using Constrained Hierarchical Reinforcement Learning
Roza, Felippe Schmoeller
Rasheed, Hassan
Roscher, Karsten
Ning, Xiangyu
Guennemann, Stephan
2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 737 - 742
[10] DEATH AND FULFILLMENT, OR WOULD THE REAL MR-DOSTOYEVSKY STAND-UP
SUTHERLAND, S
PHILOSOPHY, 1984, : 15 - 27

← 1 2 3 4 5 →