Action Robust Reinforcement Learning and Applications in Continuous Control

被引:0
|
作者
Tessler, Chen [1 ]
Efroni, Yonathan [1 ]
Mannor, Shie [1 ]
机构
[1] Technion Israel Inst Technol, Dept Elect Engn, Haifa, Israel
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A policy is said to be robust if it maximizes the reward while considering a bad, or even adversarial, model. In this work we formalize two new criteria of robustness to action uncertainty. Specifically, we consider two scenarios in which the agent attempts to perform an action a, and (i) with probability alpha, an alternative adversarial action a is taken, or (ii) an adversary adds a perturbation to the selected action in the case of continuous action space. We show that our criteria are related to common forms of uncertainty in robotics domains, such as the occurrence of abrupt forces, and suggest algorithms in the tabular case. Building on the suggested algorithms, we generalize our approach to deep reinforcement learning (DRL) and provide extensive experiments in the various Mu-JoCo domains. Our experiments show that not only does our approach produce robust policies, but it also improves the performance in the absence of perturbations. This generalization indicates that action-robustness can be thought of as implicit regularization in RL problems.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Robust Deep Reinforcement Learning for Quadcopter Control
    Deshpande, Aditya M.
    Minai, Ali A.
    Kumar, Manish
    IFAC PAPERSONLINE, 2021, 54 (20): : 90 - 95
  • [22] Active Reinforcement Learning for Robust Building Control
    Jang, Doseok
    Yan, Larry
    Spangher, Lucas
    Spanos, Costas J.
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 20, 2024, : 22150 - 22158
  • [23] The driver and the engineer: Reinforcement learning and robust control
    Bernat, Natalie
    Chen, Jiexin
    Matni, Nikolai
    Doyle, John
    2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 3932 - 3939
  • [24] Soft Action Particle Deep Reinforcement Learning for a Continuous Action Space
    Kang, Minjae
    Lee, Kyungjae
    Oh, Songhwai
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 5028 - 5033
  • [25] Continuous-time reinforcement learning for robust control under worst-case uncertainty
    Perrusquia, Adolfo
    Yu, Wen
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2021, 52 (04) : 770 - 784
  • [26] Continuous Control with a Combination of Supervised and Reinforcement Learning
    Kangin, Dmitry
    Pugeault, Nicolas
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018, : 163 - 170
  • [27] Competitive reinforcement learning in continuous control tasks
    Abramson, M
    Pachowicz, P
    Wechsler, H
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 1909 - 1914
  • [28] Recursive Compositional Reinforcement Learning for Continuous Control
    Tanik, Guven Orkun
    Ertekin, Seyda
    2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
  • [29] Reinforcement learning for continuous stochastic control problems
    Munos, R
    Bourgine, P
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 10, 1998, 10 : 1029 - 1035
  • [30] Benchmarking Deep Reinforcement Learning for Continuous Control
    Duan, Yan
    Chen, Xi
    Houthooft, Rein
    Schulman, John
    Abbeel, Pieter
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48