Robust Reinforcement Learning via Progressive Task Sequence

被引:0
|
作者
Li, Yike [1 ]
Tian, Yunzhe [1 ]
Tong, Endong [1 ]
Niu, Wenjia [1 ]
Liu, Jiqiang [1 ]
机构
[1] Beijing Jiaotong Univ, Beijing Key Lab Secur & Privacy Intelligent Trans, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Robust reinforcement learning (RL) has been a challenging problem due to the gap between simulation and the real world. Existing efforts typically address the robust RL problem by solving a maxmin problem. The main idea is to maximize the cumulative reward under the worst-possible perturbations. However, the worst-case optimization either leads to overly conservative solutions or unstable training process, which further affects the policy robustness and generalization performance. In this paper, we tackle this problem from both formulation definition and algorithm design. First, we formulate the robust RL as a max-expectation optimization problem, where the goal is to find an optimal policy under both the worst cases and the non-worst cases. Then, we propose a novel framework DRRL to solve the max-expectation optimization. Given our definition of the feasible tasks, a task generation and sequencing mechanism is introduced to dynamically output tasks at appropriate difficulty level for the current policy. With these progressive tasks, DRRL realizes dynamic multi-task learning to improve the policy robustness and the training stability. Finally, extensive experiments demonstrate that the proposed method exhibits significant performance on the unmanned CarRacing game and multiple high-dimensional MuJoCo environments.
引用
收藏
页码:455 / 463
页数:9
相关论文
共 50 条
  • [1] Curricular Robust Reinforcement Learning via GAN-Based Perturbation Through Continuously Scheduled Task Sequence
    Li, Yike
    Tian, Yunzhe
    Tong, Endong
    Niu, Wenjia
    Xiang, Yingxiao
    Chen, Tong
    Wu, Yalun
    Liu, Jiqiang
    TSINGHUA SCIENCE AND TECHNOLOGY, 2023, 28 (01): : 27 - 38
  • [2] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning
    Yuan, Haoqi
    Lu, Zongqing
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [3] Guided Reinforcement Learning via Sequence Learning
    Ramamurthy, Rajkumar
    Sifa, Rafet
    Luebbering, Max
    Bauckhage, Christian
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 335 - 345
  • [4] Towards Robust Knowledge Graph Embedding via Multi-Task Reinforcement Learning
    Zhang, Zhao
    Zhuang, Fuzhen
    Zhu, Hengshu
    Li, Chao
    Xiong, Hui
    He, Qing
    Xu, Yongjun
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) : 4321 - 4334
  • [5] Reinforcement Learning via Auxiliary Task Distillation
    Harish, Abhinav Narayan
    Heck, Larry
    Hanna, Josiah P.
    Zsolt
    Szot, Andrew
    COMPUTER VISION - ECCV 2024, PT LXXXI, 2025, 15139 : 214 - 230
  • [6] Machining sequence learning via inverse reinforcement learning
    Sugisawa, Yasutomo
    Takasugi, Keigo
    Asakawa, Naoki
    PRECISION ENGINEERING-JOURNAL OF THE INTERNATIONAL SOCIETIES FOR PRECISION ENGINEERING AND NANOTECHNOLOGY, 2022, 73 : 477 - 487
  • [7] Robust Reinforcement Learning via Genetic Curriculum
    Song, Yeeho
    Schneider, Jeff
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 5560 - 5566
  • [8] SEQUENCE-TO-SEQUENCE ASR OPTIMIZATION VIA REINFORCEMENT LEARNING
    Tjandra, Andros
    Sakti, Sakriani
    Nakamura, Satoshi
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5829 - 5833
  • [9] Learning Robust Representation for Reinforcement Learning with Distractions by Reward Sequence Prediction
    Zhou, Qi
    Wang, Jie
    Liu, Qiyuan
    Kuang, Yufei
    Zhou, Wengang
    Li, Houqiang
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 2551 - 2562
  • [10] Robust and efficient task scheduling for robotics applications with reinforcement learning
    Tejer, Mateusz
    Szczepanski, Rafal
    Tarczewski, Tomasz
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127