Robotic Control in Adversarial and Sparse Reward Environments: A Robust Goal-Conditioned Reinforcement Learning Approach

被引:1
|
作者
He, Xiangkun [1 ]
Lv, Chen [1 ]
机构
[1] Nanyang Technological University, School of Mechanical and Aerospace Engineering, 639798, Singapore
来源
关键词
Deep neural networks - Job analysis - Perturbation techniques - Robotics - Uncertainty analysis;
D O I
10.1109/TAI.2023.3237665
中图分类号
学科分类号
摘要
With deep neural networks-based function approximators, reinforcement learning holds the promise of learning complex end-to-end robotic controllers that can map high-dimensional sensory information directly to control policies. However, a common challenge, especially for robotics, is sample-efficient learning from sparse rewards, in which an agent is required to find a long sequence of 'correct' actions to achieve a desired outcome. Unfortunately, inevitable perturbations on observations may make this task trickier to solve. Here, this article advances a novel robust goal-conditioned reinforcement learning approach for end-to-end robotic control in adversarial and sparse reward environments. Specifically, a mixed adversarial attack scheme is presented to generate diverse adversarial perturbations on observations by combining white-box and black-box attacks. Meanwhile, a hindsight experience replay technique considering observation perturbations is developed to turn a failed experience into a successful one and generate the policy trajectories perturbed by the mixed adversarial attacks. Additionally, a robust goal-conditioned actor-critic method is proposed to learn goal-conditioned policies and keep the variations of the perturbed policy trajectories within bounds. Finally, the proposed method is evaluated on three tasks with adversarial attacks and sparse reward settings. The results indicate that our scheme can ensure robotic control performance and policy robustness on the adversarial and sparse reward tasks. © 2020 IEEE.
引用
收藏
页码:244 / 253
相关论文
共 50 条
  • [1] Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
    Li, Yunfei
    Gao, Tian
    Yang, Jiaqi
    Xu, Huazhe
    Wu, Yi
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [2] Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning
    Ding, Hongyu
    Tang, Yuanze
    Wu, Qing
    Wang, Bo
    Chen, Chunlin
    Wang, Zhi
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2023, 10 (12) : 2233 - 2247
  • [3] Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning
    Hongyu Ding
    Yuanze Tang
    Qing Wu
    Bo Wang
    Chunlin Chen
    Zhi Wang
    [J]. IEEE/CAA Journal of Automatica Sinica, 2023, 10 (12) : 2233 - 2247
  • [4] Goal-Conditioned Reinforcement Learning with Imagined Subgoals
    Chane-Sane, Elliot
    Schmid, Cordelia
    Laptev, Ivan
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [5] State Representation Learning for Goal-Conditioned Reinforcement Learning
    Steccanella, Lorenzo
    Jonsson, Anders
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT IV, 2023, 13716 : 84 - 99
  • [6] Goal exploration augmentation via pre-trained skills for sparse-reward long-horizon goal-conditioned reinforcement learning
    Lisheng Wu
    Ke Chen
    [J]. Machine Learning, 2024, 113 : 2527 - 2557
  • [7] MURM: Utilization of Multi-Views for Goal-Conditioned Reinforcement Learning in Robotic Manipulation
    Jang, Seongwon
    Jeong, Hyemi
    Yang, Hyunseok
    [J]. ROBOTICS, 2023, 12 (04)
  • [8] Bisimulation Makes Analogies in Goal-Conditioned Reinforcement Learning
    Hansen-Estruch, Philippe
    Zhang, Amy
    Nair, Ashvin
    Yin, Patrick
    Levine, Sergey
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [9] Goal exploration augmentation via pre-trained skills for sparse-reward long-horizon goal-conditioned reinforcement learning
    Wu, Lisheng
    Chen, Ke
    [J]. MACHINE LEARNING, 2024, 113 (05) : 2527 - 2557
  • [10] Curriculum Goal-Conditioned Imitation for Offline Reinforcement Learning
    Feng, Xiaoyun
    Jiang, Li
    Yu, Xudong
    Xu, Haoran
    Sun, Xiaoyan
    Wang, Jie
    Zhan, Xianyuan
    Chan, Wai Kin
    [J]. IEEE TRANSACTIONS ON GAMES, 2024, 16 (01) : 102 - 112