Action Robust Reinforcement Learning and Applications in Continuous Control

被引:0
|
作者
Tessler, Chen [1 ]
Efroni, Yonathan [1 ]
Mannor, Shie [1 ]
机构
[1] Technion Israel Inst Technol, Dept Elect Engn, Haifa, Israel
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A policy is said to be robust if it maximizes the reward while considering a bad, or even adversarial, model. In this work we formalize two new criteria of robustness to action uncertainty. Specifically, we consider two scenarios in which the agent attempts to perform an action a, and (i) with probability alpha, an alternative adversarial action a is taken, or (ii) an adversary adds a perturbation to the selected action in the case of continuous action space. We show that our criteria are related to common forms of uncertainty in robotics domains, such as the occurrence of abrupt forces, and suggest algorithms in the tabular case. Building on the suggested algorithms, we generalize our approach to deep reinforcement learning (DRL) and provide extensive experiments in the various Mu-JoCo domains. Our experiments show that not only does our approach produce robust policies, but it also improves the performance in the absence of perturbations. This generalization indicates that action-robustness can be thought of as implicit regularization in RL problems.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Hierarchical Deep Reinforcement Learning for Continuous Action Control
    Yang, Zhaoyang
    Merrick, Kathryn
    Jin, Lianwen
    Abbass, Hussein A.
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (11) : 5174 - 5184
  • [2] Continuous action reinforcement learning applied to vehicle suspension control
    Howell, MN
    Frost, GP
    Gordon, TJ
    Wu, QH
    [J]. MECHATRONICS, 1997, 7 (03) : 263 - 276
  • [3] Robust Control in the Worst Case Using Continuous Time Reinforcement Learning
    Perrusquia, Adolfo
    Yu, Wen
    Li, Xiaoou
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 1951 - 1954
  • [4] Robust Reinforcement Learning in Continuous Control Tasks with Uncertainty Set Regularization
    Zhang, Yuan
    Wang, Jianhong
    Boedecker, Joschka
    [J]. CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [5] Robust reinforcement learning control
    Kretchmar, RM
    Young, PM
    Anderson, CW
    Hittle, DC
    Anderson, ML
    Tu, J
    Delnero, CC
    [J]. PROCEEDINGS OF THE 2001 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2001, : 902 - 907
  • [6] Reinforcement learning in continuous action spaces
    van Hasselt, Hado
    Wiering, Marco A.
    [J]. 2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2007, : 272 - +
  • [7] Convergent Reinforcement Learning Control with Neural Networks and Continuous Action Search
    Lee, Minwoo
    Anderson, Charles W.
    [J]. 2014 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2014, : 33 - 40
  • [8] Multi-Task Deep Reinforcement Learning for Continuous Action Control
    Yang, Zhaoyang
    Merrick, Kathryn
    Abbass, Hussein
    Jin, Lianwen
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3301 - 3307
  • [9] Robust Optimal Control of Continuous Time Linear System using Reinforcement Learning
    Sami, Abdul
    Memon, Attaullah Y.
    [J]. 2018 AUSTRALIAN & NEW ZEALAND CONTROL CONFERENCE (ANZCC), 2018, : 154 - 159
  • [10] Continuous Action Reinforcement Learning for Control-Affine Systems with Unknown Dynamics
    Aleksandra Faust
    Peter Ruymgaart
    Molly Salman
    Rafael Fierro
    Lydia Tapia
    [J]. IEEE/CAA Journal of Automatica Sinica, 2014, 1 (03) : 323 - 336