Cross-overlapping Hierarchical Reinforcement Learning in Humanoid Robots

被引：0

作者：

Chen, Kuihan ^{[1
]}

Liang, Zhiwei ^{[1
]}

Liang, Wenzhao ^{[1
]}

Zhou, Huijie ^{[1
]}

Chen, Li ^{[1
]}

Qin, Shiyan ^{[1
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210046, Peoples R China

来源：

PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021) | 2021年

关键词：

RoboCup; Soccer robots; Cross-overlapping Hierarchical Reinforcement Learning; baseline-based optimization techniques; optimization framework;

D O I：

10.1109/CCDC52312.2021.9602590

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In the RoboCup3D project, how to make the humanoid robot with faster running speed and more accurate kicking action is a popular research direction. In this paper, we extend the Overlapping Layered Learning method by proposing a cross-overlapping hierarchical reinforcement learning method, which is based on overlapping layered learning to smooth the action articulation by cross-learning the articulated action parameters or cross-learning the higher-level action parameters to obtain better action execution. The article also introduces the baseline-based optimization technique and elaborates the specific optimization strategy and optimization task. Finally, the effectiveness of cross-overlapping hierarchical reinforcement learning and baseline-based optimization techniques is demonstrated experimentally.

引用

页码：3340 / 3345

页数：6

共 50 条

[1] Reinforcement learning for motion control of humanoid robots
[J]. Iida, S. (iida@ics.nitech.ac.jp), 2004, Institute of Electrical and Electronics Engineers, IEEE; Robotics Society of Japan, RSJ (Institute of Electrical and Electronics Engineers Inc.):
[2] Learning to Move an Object by the Humanoid Robots by Using Deep Reinforcement Learning
Aslan, Simge Nur
Tasci, Burak
Ucar, Aysegul
Guzelis, Cuneyt
[J]. INTELLIGENT ENVIRONMENTS 2021, 2021, 29 : 143 - 155
[3] Motivated Reinforcement Learning for Improved Head Actuation of Humanoid Robots
Fountain, Jake
Walker, Josiah
Budden, David
Mendes, Alexandre
Chalup, Stephan K.
[J]. ROBOCUP 2013: ROBOT WORLD CUP XVII, 2014, 8371 : 268 - 279
[4] Analysis of Cost Functions for Reinforcement Learning of Reaching Tasks in Humanoid Robots
Savevska, Kristina
Ude, Ales
[J]. APPLIED SCIENCES-BASEL, 2024, 14 (01):
[5] Visual Navigation for Biped Humanoid Robots Using Deep Reinforcement Learning
Lobos-Tsunekawa, Kenzo
Leiva, Francisco
Ruiz-del-Solar, Javier
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04): : 3247 - 3254
[6] Hybrid Dynamic Control Algorithm for Humanoid Robots Based on Reinforcement Learning
Duśko M. Katić
Aleksandar D. Rodić
Miomir K. Vukobratović
[J]. Journal of Intelligent and Robotic Systems, 2008, 51 : 3 - 30
[7] Hybrid dynamic control algorithm for humanoid robots based on reinforcement learning
Katic, Dusko M.
Rodic, Aleksandar D.
Vukobratovic, Miomir K.
[J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2008, 51 (01) : 3 - 30
[8] Nonstrict Hierarchical Reinforcement Learning for Interactive Systems and Robots
Cuayahuitl, Heriberto
Kruijff-Korbayova, Ivana
Dethlefs, Nina
[J]. ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2014, 4 (03)
[9] Reinforcement learning with imitative behaviors for humanoid robots navigation: synchronous planning and control
Wang, Xiaoying
Zhang, Tong
[J]. AUTONOMOUS ROBOTS, 2024, 48 (02)
[10] DDPG Reinforcement Learning Experiment for Improving the Stability of Bipedal Walking of Humanoid Robots
Chun, Yeonghun
Choi, Junghun
Min, Injoon
Ahn, Minsung
Han, Jeakweon
[J]. 2023 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION, SII, 2023,

← 1 2 3 4 5 →