Robust Reinforcement Learning via Progressive Task Sequence

被引：0

作者：

Li, Yike ^{[1
]}

Tian, Yunzhe ^{[1
]}

Tong, Endong ^{[1
]}

Niu, Wenjia ^{[1
]}

Liu, Jiqiang ^{[1
]}

机构：

[1] Beijing Jiaotong Univ, Beijing Key Lab Secur & Privacy Intelligent Trans, Beijing, Peoples R China

来源：

PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023 | 2023年

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Robust reinforcement learning (RL) has been a challenging problem due to the gap between simulation and the real world. Existing efforts typically address the robust RL problem by solving a maxmin problem. The main idea is to maximize the cumulative reward under the worst-possible perturbations. However, the worst-case optimization either leads to overly conservative solutions or unstable training process, which further affects the policy robustness and generalization performance. In this paper, we tackle this problem from both formulation definition and algorithm design. First, we formulate the robust RL as a max-expectation optimization problem, where the goal is to find an optimal policy under both the worst cases and the non-worst cases. Then, we propose a novel framework DRRL to solve the max-expectation optimization. Given our definition of the feasible tasks, a task generation and sequencing mechanism is introduced to dynamically output tasks at appropriate difficulty level for the current policy. With these progressive tasks, DRRL realizes dynamic multi-task learning to improve the policy robustness and the training stability. Finally, extensive experiments demonstrate that the proposed method exhibits significant performance on the unmanned CarRacing game and multiple high-dimensional MuJoCo environments.

引用

页码：455 / 463

页数：9

共 50 条

[1] Curricular Robust Reinforcement Learning via GAN-Based Perturbation Through Continuously Scheduled Task Sequence
Li, Yike
Tian, Yunzhe
Tong, Endong
Niu, Wenjia
Xiang, Yingxiao
Chen, Tong
Wu, Yalun
Liu, Jiqiang
TSINGHUA SCIENCE AND TECHNOLOGY, 2023, 28 (01): : 27 - 38
[2] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning
Yuan, Haoqi
Lu, Zongqing
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[3] Guided Reinforcement Learning via Sequence Learning
Ramamurthy, Rajkumar
Sifa, Rafet
Luebbering, Max
Bauckhage, Christian
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 335 - 345
[4] Towards Robust Knowledge Graph Embedding via Multi-Task Reinforcement Learning
Zhang, Zhao
Zhuang, Fuzhen
Zhu, Hengshu
Li, Chao
Xiong, Hui
He, Qing
Xu, Yongjun
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) : 4321 - 4334
[5] Reinforcement Learning via Auxiliary Task Distillation
Harish, Abhinav Narayan
Heck, Larry
Hanna, Josiah P.
Zsolt
Szot, Andrew
COMPUTER VISION - ECCV 2024, PT LXXXI, 2025, 15139 : 214 - 230
[6] Machining sequence learning via inverse reinforcement learning
Sugisawa, Yasutomo
Takasugi, Keigo
Asakawa, Naoki
PRECISION ENGINEERING-JOURNAL OF THE INTERNATIONAL SOCIETIES FOR PRECISION ENGINEERING AND NANOTECHNOLOGY, 2022, 73 : 477 - 487
[7] Robust Reinforcement Learning via Genetic Curriculum
Song, Yeeho
Schneider, Jeff
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 5560 - 5566
[8] SEQUENCE-TO-SEQUENCE ASR OPTIMIZATION VIA REINFORCEMENT LEARNING
Tjandra, Andros
Sakti, Sakriani
Nakamura, Satoshi
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5829 - 5833
[9] Learning Robust Representation for Reinforcement Learning with Distractions by Reward Sequence Prediction
Zhou, Qi
Wang, Jie
Liu, Qiyuan
Kuang, Yufei
Zhou, Wengang
Li, Houqiang
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 2551 - 2562
[10] Robust and efficient task scheduling for robotics applications with reinforcement learning
Tejer, Mateusz
Szczepanski, Rafal
Tarczewski, Tomasz
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127

← 1 2 3 4 5 →