Cross-overlapping Hierarchical Reinforcement Learning in Humanoid Robots

被引:0
|
作者
Chen, Kuihan [1 ]
Liang, Zhiwei [1 ]
Liang, Wenzhao [1 ]
Zhou, Huijie [1 ]
Chen, Li [1 ]
Qin, Shiyan [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210046, Peoples R China
关键词
RoboCup; Soccer robots; Cross-overlapping Hierarchical Reinforcement Learning; baseline-based optimization techniques; optimization framework;
D O I
10.1109/CCDC52312.2021.9602590
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the RoboCup3D project, how to make the humanoid robot with faster running speed and more accurate kicking action is a popular research direction. In this paper, we extend the Overlapping Layered Learning method by proposing a cross-overlapping hierarchical reinforcement learning method, which is based on overlapping layered learning to smooth the action articulation by cross-learning the articulated action parameters or cross-learning the higher-level action parameters to obtain better action execution. The article also introduces the baseline-based optimization technique and elaborates the specific optimization strategy and optimization task. Finally, the effectiveness of cross-overlapping hierarchical reinforcement learning and baseline-based optimization techniques is demonstrated experimentally.
引用
收藏
页码:3340 / 3345
页数:6
相关论文
共 50 条
  • [1] Reinforcement learning for motion control of humanoid robots
    [J]. Iida, S. (iida@ics.nitech.ac.jp), 2004, Institute of Electrical and Electronics Engineers, IEEE; Robotics Society of Japan, RSJ (Institute of Electrical and Electronics Engineers Inc.):
  • [2] Learning to Move an Object by the Humanoid Robots by Using Deep Reinforcement Learning
    Aslan, Simge Nur
    Tasci, Burak
    Ucar, Aysegul
    Guzelis, Cuneyt
    [J]. INTELLIGENT ENVIRONMENTS 2021, 2021, 29 : 143 - 155
  • [3] Motivated Reinforcement Learning for Improved Head Actuation of Humanoid Robots
    Fountain, Jake
    Walker, Josiah
    Budden, David
    Mendes, Alexandre
    Chalup, Stephan K.
    [J]. ROBOCUP 2013: ROBOT WORLD CUP XVII, 2014, 8371 : 268 - 279
  • [4] Analysis of Cost Functions for Reinforcement Learning of Reaching Tasks in Humanoid Robots
    Savevska, Kristina
    Ude, Ales
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (01):
  • [5] Visual Navigation for Biped Humanoid Robots Using Deep Reinforcement Learning
    Lobos-Tsunekawa, Kenzo
    Leiva, Francisco
    Ruiz-del-Solar, Javier
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04): : 3247 - 3254
  • [6] Hybrid Dynamic Control Algorithm for Humanoid Robots Based on Reinforcement Learning
    Duśko M. Katić
    Aleksandar D. Rodić
    Miomir K. Vukobratović
    [J]. Journal of Intelligent and Robotic Systems, 2008, 51 : 3 - 30
  • [7] Hybrid dynamic control algorithm for humanoid robots based on reinforcement learning
    Katic, Dusko M.
    Rodic, Aleksandar D.
    Vukobratovic, Miomir K.
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2008, 51 (01) : 3 - 30
  • [8] Nonstrict Hierarchical Reinforcement Learning for Interactive Systems and Robots
    Cuayahuitl, Heriberto
    Kruijff-Korbayova, Ivana
    Dethlefs, Nina
    [J]. ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2014, 4 (03)
  • [9] Reinforcement learning with imitative behaviors for humanoid robots navigation: synchronous planning and control
    Wang, Xiaoying
    Zhang, Tong
    [J]. AUTONOMOUS ROBOTS, 2024, 48 (02)
  • [10] DDPG Reinforcement Learning Experiment for Improving the Stability of Bipedal Walking of Humanoid Robots
    Chun, Yeonghun
    Choi, Junghun
    Min, Injoon
    Ahn, Minsung
    Han, Jeakweon
    [J]. 2023 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION, SII, 2023,