Hierarchical reinforcement learning with adaptive scheduling for robot control

被引：4

作者：

Huang, Zhigang ^{[1
]}

Liu, Quan ^{[1
]}

Zhu, Fei ^{[1
]}

机构：

[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Jiangsu, Peoples R China

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2023年 / 126卷

基金：

中国国家自然科学基金;

关键词：

Hierarchical reinforcement learning; Exploration and exploitation; Scheduling; Sparse reward;

D O I：

10.1016/j.engappai.2023.107130

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Conventional hierarchical reinforcement learning (HRL) relies on discrete options to represent explicitly distinguishable knowledge, which may lead to severe performance bottlenecks. It is possible to represent richer knowledge through continuous options, but reliable scheduling methods are lacking. To design an available scheduling method for continuous options, in this paper, the hierarchical reinforcement learning with adaptive scheduling (HAS) algorithm is proposed. Its low-level controller learns diverse options, while the high-level controller schedules options to learn solutions. It achieves an adaptive balance between exploration and exploitation during the frequent scheduling of continuous options, maximizing the representation potential of continuous options. It builds on multi-step static scheduling and makes switching decisions according to the relative advantages of the previous and the estimated continuous options, enabling the agent to focus on different behaviors at different phases of the task. The expected t-step distance is applied to demonstrate the superiority of adaptive scheduling in terms of exploration. Furthermore, an interruption incentive based on annealing is proposed to alleviate excessive exploration during the early training phase, accelerating the convergence rate. Finally, we apply HAS to robot control with sparse rewards in continuous spaces, and develop a comprehensive experimental analysis scheme. The experimental results not only demonstrate the high performance and robustness of HAS, but also provide evidence that the adaptive scheduling method has a positive effect both on the representation and option policies.

引用

页数：15

共 50 条

[31] Adaptive Control and Intersections with Reinforcement Learning
Annaswamy, Anuradha M.
[J]. ANNUAL REVIEW OF CONTROL ROBOTICS AND AUTONOMOUS SYSTEMS, 2023, 6 : 65 - 93
[32] Tumbling Robot Control Using Reinforcement Learning: An Adaptive Control Policy That Transfers Well to the Real World
Schwartzwald, Andrew
Tlachac, Matthew
Guzman, Luis
Bacharis, Athanasios
Papanikolopoulos, Nikolaos
[J]. IEEE ROBOTICS & AUTOMATION MAGAZINE, 2023, 30 (02) : 86 - 95
[33] Application of reinforcement learning to dexterous robot control
Bucak, IO
Zohdy, MA
[J]. PROCEEDINGS OF THE 1998 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 1998, : 1405 - 1409
[34] A Reinforcement Learning Approach for Continuum Robot Control
Turhan Can Kargin
Jakub Kołota
[J]. Journal of Intelligent & Robotic Systems, 2023, 109
[35] Reinforcement Learning based Control of a Quadruped Robot
Ancy, A.
Jisha, V. R.
[J]. 2022 IEEE 19TH INDIA COUNCIL INTERNATIONAL CONFERENCE, INDICON, 2022,
[36] Robot Control Optimization Using Reinforcement Learning
Kai-Tai Song
Wen-Yu Sun
[J]. Journal of Intelligent and Robotic Systems, 1998, 21 : 221 - 238
[37] Robot control optimization using reinforcement learning
Song, KT
Sun, WY
[J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 1998, 21 (03) : 221 - 238
[38] Improving Reinforcement Learning speed for robot control
Matignon, Laetitia
Laurent, Guillaume J.
Le Fort-Piat, Nadine
[J]. 2006 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-12, 2006, : 3172 - +
[39] Humanoid robot control based on reinforcement learning
Iida, S
Kuwayama, K
Kanoh, M
Kato, S
Kunitachi, T
Itoh, H
[J]. PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON MICRO-NANOMECHATRONICS AND HUMAN SCIENCE, 2004, : 353 - 358
[40] A Reinforcement Learning Approach for Continuum Robot Control
Kargin, Turhan Can
Kolota, Jakub
[J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2023, 109 (04)

← 1 2 3 4 5 →