Acquisition of stand-up behavior by a real robot using hierarchical reinforcement learning

被引:131
|
作者
Morimoto, J
Doya, K
机构
[1] JST, ERATO, Kawato Dynam Brain Project, Kyoto 6190288, Japan
[2] Nara Inst Sci & Technol, Grad Sch Informat Sci, Nara 6300101, Japan
[3] JST, CREST, ATR Int, Kyoto 6190288, Japan
关键词
reinforcement learning; hierarchical; real robot; stand-up; motor control;
D O I
10.1016/S0921-8890(01)00113-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a hierarchical reinforcement learning architecture that realizes practical learning speed in real hardware control tasks. In order to enable learning in a practical number of trials, we introduce a low-dimensional representation of the state of the robot for higher-level planning. The upper level learns a discrete sequence of sub-goals in a low-dimensional state space for achieving the main goal of the task. The lower-level modules learn local trajectories in the original high-dimensional state space to achieve the sub-goal specified by the upper level. We applied the hierarchical architecture to a three-link, two-joint robot for the task of learning to stand up by trial and error. The upper-level learning was implemented by Q-learning, while the lower-level learning was implemented by a continuous actor-critic method. The robot successfully learned to stand up within 750 trials in simulation and then in an additional 170 trials using real hardware. The effects of the setting of the search steps in the upper level and the use of a supplementary reward for achieving sub-goals are also tested in simulation. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
下载
收藏
页码:37 / 51
页数:15
相关论文
共 50 条
  • [31] Vision-guided behavior acquisition of a mobile robot by multi-layered reinforcement learning
    Takahashi, Y
    Asada, M
    2000 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2000), VOLS 1-3, PROCEEDINGS, 2000, : 395 - 402
  • [32] Adaptive Robot Behavior Based on Human Comfort Using Reinforcement Learning
    Gonzalez-Santocildes, Asier
    Vazquez, Juan-Ignacio
    Eguiluz, Andoni
    IEEE ACCESS, 2024, 12 : 122289 - 122299
  • [33] Generation of a Socially Aware Behavior of a Guide Robot Using Reinforcement Learning
    Dewantara, Bima Sena Bayu
    Miura, Jun
    2016 INTERNATIONAL ELECTRONICS SYMPOSIUM (IES), 2016, : 105 - 110
  • [34] WILL REAL DATA ACQUISITION-SYSTEM PLEASE STAND UP
    ANDREIEV, N
    CONTROL ENGINEERING, 1977, 24 (11) : 68 - 71
  • [35] Multiscale Anticipatory Behavior by Hierarchical Reinforcement Learning
    Rungger, Matthias
    Ding, Hao
    Stursberg, Olaf
    ANTICIPATORY BEHAVIOR IN ADAPTIVE LEARNING SYSTEMS: FROM PSYCHOLOGICAL THEORIES TO ARTIFICIAL COGNITIVE SYSTEMS, 2009, 5499 : 301 - 320
  • [36] Accelerated Robot Skill Acquisition by Reinforcement Learning-Aided Sim-to-Real Domain Adaptation
    Loncarcvic, Zvezdan
    Ude, Ales
    Gams, Andrej
    2021 20TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2021, : 269 - 274
  • [37] Reinforcement Learning Control of a Real Mobile Robot Using Approximate Policy Iteration
    Zhang, Pengchen
    Xu, Xin
    Liu, Chunming
    Yuan, Qiping
    ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 3, PROCEEDINGS, 2009, 5553 : 278 - 288
  • [38] Expanding the Boundaries of Your Research Using Social Media: Stand-Up and Be Counted
    Kumar, M. Jagadesh
    IETE TECHNICAL REVIEW, 2014, 31 (04) : 255 - 257
  • [39] The Real Mrs. Maisel: Jean Carroll, the First Jewish Female Stand-Up Comedian
    Overbeke, Grace Kessler
    SHOFAR-AN INTERDISCIPLINARY JOURNAL OF JEWISH STUDIES, 2021, 39 (03) : 154 - 180
  • [40] Multi-robot cooperation based on hierarchical reinforcement learning
    Cheng, Xiaobei
    Shen, Jing
    Liu, Haibo
    Gu, Guochang
    COMPUTATIONAL SCIENCE - ICCS 2007, PT 3, PROCEEDINGS, 2007, 4489 : 90 - +