Humanoid robot control based on reinforcement learning

被引:0
|
作者
Iida, S [1 ]
Kuwayama, K [1 ]
Kanoh, M [1 ]
Kato, S [1 ]
Kunitachi, T [1 ]
Itoh, H [1 ]
机构
[1] Nagoya Inst Technol, Dept Intelligence & Comp Sci, Showa Ku, Nagoya, Aichi 4668555, Japan
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Many existing methods of reinforcement learning have treated tasks in a discrete low dimensional state space. However, controlling humanoid robots smooth requires a continuous high-dimensional state space. In this paper, to treat the state space, we proposed an adaptive allocation method of basis functions for reinforcement learning. Up to now, grid or incremental allocation method have been proposed for allocation of basis functions. However, these methods may cause the curse of dimensionality, and fall into local minima. On the other hand, our method avoids local minima which are assessed by the trace of activity of basis functions. That is, if current state is judged to fall into a local minimum, our method eliminates a basis function which affects the state most. Moreover our method learns with a low number of basis functions because of the elimination process. To confirm the effectiveness of our method, we used a maze task to compare our method with an existing method, which has only an allocation process. Moreover, as learning of continuous high-dimensional state spaces, our method was applied to motion control of a humanoid robot. We demonstrate that our method is capable of providing better performance than the existing method.
引用
收藏
页码:353 / 358
页数:6
相关论文
共 50 条
  • [1] Locomotion Control Method for Humanoid Robot Based on United Hierarchical Reinforcement Learning
    Liu, Boying
    Ma, Lu
    Liu, Chenju
    Xu, BinChen
    [J]. 2020 IEEE 16TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION (ICCA), 2020, : 1161 - 1166
  • [2] Push Recovery Control for Humanoid Robot using Reinforcement Learning
    Seo, Donghyeon
    Kim, Harin
    Kim, Donghan
    [J]. 2019 THIRD IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC 2019), 2019, : 488 - 492
  • [3] Humanoid Muscle-Skeleton Robot Arm Design and Control Based on Reinforcement Learning
    Fan, Jianyin
    Jin, Jing
    Wang, Qiang
    [J]. PROCEEDINGS OF THE 15TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2020), 2020, : 541 - 546
  • [4] Generalized Model Learning for Reinforcement Learning on a Humanoid Robot
    Hester, Todd
    Quinlan, Michael
    Stone, Peter
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2010, : 2369 - 2374
  • [5] Deep Reinforcement Learning for Humanoid Robot Behaviors
    Muzio, Alexandre F. V.
    Maximo, Marcos R. O. A.
    Yoneyama, Takashi
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2022, 105 (01)
  • [6] Deep Reinforcement Learning for Humanoid Robot Dribbling
    Muzio, Alexandre F., V
    Maximo, Marcos R. O. A.
    Yoneyama, Takashi
    [J]. 2020 XVIII LATIN AMERICAN ROBOTICS SYMPOSIUM, 2020 XII BRAZILIAN SYMPOSIUM ON ROBOTICS AND 2020 XI WORKSHOP OF ROBOTICS IN EDUCATION (LARS-SBR-WRE 2020), 2020, : 246 - 251
  • [7] A Reinforcement Learning Method for Humanoid Robot Walking
    Liu, Yunda
    Bi, Sheng
    Dong, Min
    Zhang, Yingjie
    Huang, Jialing
    Zhang, Jiawei
    [J]. 2018 IEEE 8TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER), 2018, : 623 - 628
  • [8] Deep Reinforcement Learning for Humanoid Robot Behaviors
    Alexandre F. V. Muzio
    Marcos R. O. A. Maximo
    Takashi Yoneyama
    [J]. Journal of Intelligent & Robotic Systems, 2022, 105
  • [9] Deep Reinforcement Learning for a Humanoid Robot Soccer Player
    Isaac Jesus da Silva
    Danilo Hernani Perico
    Thiago Pedro Donadon Homem
    Reinaldo Augusto da Costa Bianchi
    [J]. Journal of Intelligent & Robotic Systems, 2021, 102
  • [10] Deep Reinforcement Learning for a Humanoid Robot Soccer Player
    da Silva, Isaac Jesus
    Perico, Danilo Hernani
    Donadon Homem, Thiago Pedro
    da Costa Bianchi, Reinaldo Augusto
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 102 (03)