HCS-R-HER: Hierarchical reinforcement learning based on cross subtasks rainbow hindsight experience replay

被引:1
|
作者
Zhao, Xiaotong [1 ]
Du, Jingli [1 ]
Wang, Zhihan [1 ]
机构
[1] Xidian Univ, Key Lab Elect Equipment Struct Design, Minist Educ, Xian 710071, Shaanxi, Peoples R China
关键词
Hierarchical reinforcement learning; Hindsight Experience Replay; RL; Continuum robots;
D O I
10.1016/j.jocs.2023.102113
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Sparse reward feedback from the environment is the main challenge for learning goal-oriented tasks based on reinforcement learning. The lack of sufficient exploration also leads to the inability of the agent to robustly learn strategies, especially for hierarchical task control of continuous action space continuum robots, which are more difficult to explore. In this paper, we propose a hierarchical reinforcement learning framework, HCS-R-HER, to accelerate learning by reusing empirical data across subtasks. It uses an upper-level controller, meta-controller, to integrate the underlying targets, and a set of lower-level controllers, controllers, responsible for performing atomic operations. The Oracle perspective mechanism can skip the process of unfinished subtasks, which helps speed up the learning of the meta-controller. The CS-R-HER framework is used to improve the sparsity of the data and accelerate the learning of controllers. Our approach can solve complex tasks or hierarchical tasks more effectively, especially for continuum robot motion environments in continuous action space. Our method is the first time to apply HER to data augmentation for hierarchical tasks and to implement a framework where multiple subgoals are learned together.
引用
收藏
页数:15
相关论文
共 46 条
  • [1] Deep Reinforcement Learning Based on the Hindsight Experience Replay for Autonomous Driving of Mobile Robot
    Park M.
    Hong J.S.
    Kwon N.K.
    [J]. Journal of Institute of Control, Robotics and Systems, 2022, 28 (11): : 1006 - 1012
  • [2] Double Broad Reinforcement Learning Based on Hindsight Experience Replay for Collision Avoidance of Unmanned Surface Vehicles
    Yu, Jiabao
    Chen, Jiawei
    Chen, Ying
    Zhou, Zhiguo
    Duan, Junwei
    [J]. JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (12)
  • [3] Knowledge Transfer for Deep Reinforcement Learning with Hierarchical Experience Replay
    Yin, Haiyan
    Pan, Sinno Jialin
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1640 - 1646
  • [4] SOFT ACTOR-CRITIC REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATOR WITH HINDSIGHT EXPERIENCE REPLAY
    Yan, Tao
    Zhang, Wenan
    Yang, Simon X.
    Yu, Li
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2019, 34 (05): : 536 - 543
  • [5] Human-Aware Robot Navigation via Reinforcement Learning with Hindsight Experience Replay and Curriculum Learning
    Li, Keyu
    Lu, Ye
    Meng, Max Q. -H.
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE-ROBIO 2021), 2021, : 346 - 351
  • [6] Forgetful experience replay in hierarchical reinforcement learning from expert demonstrations
    Skrynnik, Alexey
    Staroverov, Aleksey
    Aitygulov, Ermek
    Aksenov, Kirill
    Davydov, Vasilii
    Panov, Aleksandr, I
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 218
  • [7] A Novel Reinforcement Learning Sampling Method Without Additional Environment Feedback in Hindsight Experience Replay
    Li, Chenxing
    Liu, Yinlong
    Bing, Zhenshan
    Seyler, Jan
    Eivazi, Shahram
    [J]. ROBOT INTELLIGENCE TECHNOLOGY AND APPLICATIONS 6, 2022, 429 : 462 - 473
  • [8] Hindsight Experience Replay Improves Reinforcement Learning for Control of a MIMO Musculoskeletal Model of the Human Arm
    Crowder, Douglas C.
    Abreu, Jessica
    Kirsch, Robert F.
    [J]. IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2021, 29 : 1016 - 1025
  • [9] Deep Reinforcement Learning with Experience Replay Based on SARSA
    Zhao, Dongbin
    Wang, Haitao
    Shao, Kun
    Zhu, Yuanheng
    [J]. PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
  • [10] Cooperative multi-agent target searching: a deep reinforcement learning approach based on parallel hindsight experience replay
    Zhou, Yi
    Liu, Zhixiang
    Shi, Huaguang
    Li, Si
    Ning, Nianwen
    Liu, Fuqiang
    Gao, Xiaozhi
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (05) : 4887 - 4898