Robotic Control in Adversarial and Sparse Reward Environments: A Robust Goal-Conditioned Reinforcement Learning Approach

被引：1

作者：

He, Xiangkun ^{[1
]}

Lv, Chen ^{[1
]}

机构：

[1] Nanyang Technological University, School of Mechanical and Aerospace Engineering, 639798, Singapore

来源：

IEEE Transactions on Artificial Intelligence | 2024年 / 5卷 / 01期

关键词：

Deep neural networks - Job analysis - Perturbation techniques - Robotics - Uncertainty analysis;

D O I：

10.1109/TAI.2023.3237665

中图分类号：

学科分类号：

摘要：

With deep neural networks-based function approximators, reinforcement learning holds the promise of learning complex end-to-end robotic controllers that can map high-dimensional sensory information directly to control policies. However, a common challenge, especially for robotics, is sample-efficient learning from sparse rewards, in which an agent is required to find a long sequence of 'correct' actions to achieve a desired outcome. Unfortunately, inevitable perturbations on observations may make this task trickier to solve. Here, this article advances a novel robust goal-conditioned reinforcement learning approach for end-to-end robotic control in adversarial and sparse reward environments. Specifically, a mixed adversarial attack scheme is presented to generate diverse adversarial perturbations on observations by combining white-box and black-box attacks. Meanwhile, a hindsight experience replay technique considering observation perturbations is developed to turn a failed experience into a successful one and generate the policy trajectories perturbed by the mixed adversarial attacks. Additionally, a robust goal-conditioned actor-critic method is proposed to learn goal-conditioned policies and keep the variations of the perturbed policy trajectories within bounds. Finally, the proposed method is evaluated on three tasks with adversarial attacks and sparse reward settings. The results indicate that our scheme can ensure robotic control performance and policy robustness on the adversarial and sparse reward tasks. © 2020 IEEE.

引用

页码：244 / 253

共 50 条

[1] Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
Li, Yunfei
Gao, Tian
Yang, Jiaqi
Xu, Huazhe
Wu, Yi
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[2] Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning
Ding, Hongyu
Tang, Yuanze
Wu, Qing
Wang, Bo
Chen, Chunlin
Wang, Zhi
[J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2023, 10 (12) : 2233 - 2247
[3] Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning
Hongyu Ding
Yuanze Tang
Qing Wu
Bo Wang
Chunlin Chen
Zhi Wang
[J]. IEEE/CAA Journal of Automatica Sinica, 2023, 10 (12) : 2233 - 2247
[4] Goal-Conditioned Reinforcement Learning with Imagined Subgoals
Chane-Sane, Elliot
Schmid, Cordelia
Laptev, Ivan
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[5] State Representation Learning for Goal-Conditioned Reinforcement Learning
Steccanella, Lorenzo
Jonsson, Anders
[J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT IV, 2023, 13716 : 84 - 99
[6] Goal exploration augmentation via pre-trained skills for sparse-reward long-horizon goal-conditioned reinforcement learning
Lisheng Wu
Ke Chen
[J]. Machine Learning, 2024, 113 : 2527 - 2557
[7] MURM: Utilization of Multi-Views for Goal-Conditioned Reinforcement Learning in Robotic Manipulation
Jang, Seongwon
Jeong, Hyemi
Yang, Hyunseok
[J]. ROBOTICS, 2023, 12 (04)
[8] Bisimulation Makes Analogies in Goal-Conditioned Reinforcement Learning
Hansen-Estruch, Philippe
Zhang, Amy
Nair, Ashvin
Yin, Patrick
Levine, Sergey
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[9] Goal exploration augmentation via pre-trained skills for sparse-reward long-horizon goal-conditioned reinforcement learning
Wu, Lisheng
Chen, Ke
[J]. MACHINE LEARNING, 2024, 113 (05) : 2527 - 2557
[10] Curriculum Goal-Conditioned Imitation for Offline Reinforcement Learning
Feng, Xiaoyun
Jiang, Li
Yu, Xudong
Xu, Haoran
Sun, Xiaoyan
Wang, Jie
Zhan, Xianyuan
Chan, Wai Kin
[J]. IEEE TRANSACTIONS ON GAMES, 2024, 16 (01) : 102 - 112

← 1 2 3 4 5 →