Hierarchical reinforcement learning with adaptive scheduling for robot control

被引:4
|
作者
Huang, Zhigang [1 ]
Liu, Quan [1 ]
Zhu, Fei [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Hierarchical reinforcement learning; Exploration and exploitation; Scheduling; Sparse reward;
D O I
10.1016/j.engappai.2023.107130
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Conventional hierarchical reinforcement learning (HRL) relies on discrete options to represent explicitly distinguishable knowledge, which may lead to severe performance bottlenecks. It is possible to represent richer knowledge through continuous options, but reliable scheduling methods are lacking. To design an available scheduling method for continuous options, in this paper, the hierarchical reinforcement learning with adaptive scheduling (HAS) algorithm is proposed. Its low-level controller learns diverse options, while the high-level controller schedules options to learn solutions. It achieves an adaptive balance between exploration and exploitation during the frequent scheduling of continuous options, maximizing the representation potential of continuous options. It builds on multi-step static scheduling and makes switching decisions according to the relative advantages of the previous and the estimated continuous options, enabling the agent to focus on different behaviors at different phases of the task. The expected t-step distance is applied to demonstrate the superiority of adaptive scheduling in terms of exploration. Furthermore, an interruption incentive based on annealing is proposed to alleviate excessive exploration during the early training phase, accelerating the convergence rate. Finally, we apply HAS to robot control with sparse rewards in continuous spaces, and develop a comprehensive experimental analysis scheme. The experimental results not only demonstrate the high performance and robustness of HAS, but also provide evidence that the adaptive scheduling method has a positive effect both on the representation and option policies.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Adaptive Control and Intersections with Reinforcement Learning
    Annaswamy, Anuradha M.
    [J]. ANNUAL REVIEW OF CONTROL ROBOTICS AND AUTONOMOUS SYSTEMS, 2023, 6 : 65 - 93
  • [32] Tumbling Robot Control Using Reinforcement Learning: An Adaptive Control Policy That Transfers Well to the Real World
    Schwartzwald, Andrew
    Tlachac, Matthew
    Guzman, Luis
    Bacharis, Athanasios
    Papanikolopoulos, Nikolaos
    [J]. IEEE ROBOTICS & AUTOMATION MAGAZINE, 2023, 30 (02) : 86 - 95
  • [33] Application of reinforcement learning to dexterous robot control
    Bucak, IO
    Zohdy, MA
    [J]. PROCEEDINGS OF THE 1998 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 1998, : 1405 - 1409
  • [34] A Reinforcement Learning Approach for Continuum Robot Control
    Turhan Can Kargin
    Jakub Kołota
    [J]. Journal of Intelligent & Robotic Systems, 2023, 109
  • [35] Reinforcement Learning based Control of a Quadruped Robot
    Ancy, A.
    Jisha, V. R.
    [J]. 2022 IEEE 19TH INDIA COUNCIL INTERNATIONAL CONFERENCE, INDICON, 2022,
  • [36] Robot Control Optimization Using Reinforcement Learning
    Kai-Tai Song
    Wen-Yu Sun
    [J]. Journal of Intelligent and Robotic Systems, 1998, 21 : 221 - 238
  • [37] Robot control optimization using reinforcement learning
    Song, KT
    Sun, WY
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 1998, 21 (03) : 221 - 238
  • [38] Improving Reinforcement Learning speed for robot control
    Matignon, Laetitia
    Laurent, Guillaume J.
    Le Fort-Piat, Nadine
    [J]. 2006 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-12, 2006, : 3172 - +
  • [39] Humanoid robot control based on reinforcement learning
    Iida, S
    Kuwayama, K
    Kanoh, M
    Kato, S
    Kunitachi, T
    Itoh, H
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON MICRO-NANOMECHATRONICS AND HUMAN SCIENCE, 2004, : 353 - 358
  • [40] A Reinforcement Learning Approach for Continuum Robot Control
    Kargin, Turhan Can
    Kolota, Jakub
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2023, 109 (04)