Improving Robustness of Deep Reinforcement Learning Agents: Environment Attack based on the Critic Network

被引:0
|
作者
Schott, Lucas [1 ,2 ]
Hajri, Hatem [1 ]
Lamprier, Sylvain [2 ]
机构
[1] IRT SystemX, Palaiseau, France
[2] Sorbonne Univ, ISIR, Paris, France
关键词
Deep Reinforcement Learning; Adversarial Training; Robustness;
D O I
10.1109/IJCNN55064.2022.9892901
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To improve robustness of deep reinforcement learning agents, a line of recent works focus on producing disturbances of the dynamics of the environment. Existing approaches of the literature to generate such disturbances are environment adversarial reinforcement learning methods. These methods set the problem as a two-player game between the protagonist agent, which learns to perform a task in an environment, and the adversary agent, which learns to disturb the dynamics of the considered environment to make the protagonist agent fail. Alternatively, we propose to build on gradient-based adversarial attacks, usually used for classification tasks for instance, that we apply on the critic network of the protagonist to identify efficient disturbances of the dynamics of the environment. Rather than training an adversary agent, which usually reveals as very complex and unstable, we leverage the knowledge of the critic network of the protagonist, to dynamically increase the complexity of the task at each step of the learning process. We show that our method, while being faster and lighter, leads to significantly better improvements in robustness of the policy than existing methods of the literature.
引用
下载
收藏
页数:8
相关论文
共 50 条
  • [41] Averaged Soft Actor-Critic for Deep Reinforcement Learning
    Ding, Feng
    Ma, Guanfeng
    Chen, Zhikui
    Gao, Jing
    Li, Peng
    COMPLEXITY, 2021, 2021
  • [42] Deep Actor-Critic Reinforcement Learning for Anomaly Detection
    Zhong, Chen
    Gursoy, M. Cenk
    Velipasalar, Senem
    2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
  • [43] On the Robustness of Controlled Deep Reinforcement Learning for Slice Placement
    Jose Jurandir Alves Esteves
    Amina Boubendir
    Fabrice Guillemin
    Pierre Sens
    Journal of Network and Systems Management, 2022, 30
  • [44] Workflow scheduling based on deep reinforcement learning in the cloud environment
    Tingting Dong
    Fei Xue
    Chuangbai Xiao
    Jiangjiang Zhang
    Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 10823 - 10835
  • [45] Environment Exploration for Mapless Navigation based on Deep Reinforcement Learning
    Toan, Nguyen Duc
    Gon-Woo, Kim
    2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 17 - 20
  • [46] Workflow scheduling based on deep reinforcement learning in the cloud environment
    Dong, Tingting
    Xue, Fei
    Xiao, Chuangbai
    Zhang, Jiangjiang
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (12) : 10823 - 10835
  • [47] On the Robustness of Controlled Deep Reinforcement Learning for Slice Placement
    Esteves, Jose Jurandir Alves
    Boubendir, Amina
    Guillemin, Fabrice
    Sens, Pierre
    JOURNAL OF NETWORK AND SYSTEMS MANAGEMENT, 2022, 30 (03)
  • [48] Secure wireless network system based on deep reinforcement learning network
    Yan, Xiaolong
    Feng, Yingying
    OPTIK, 2022, 271
  • [49] Reinforcement learning based agents for improving layouts of automotive crash structures
    Jens Trilling
    Axel Schumacher
    Ming Zhou
    Applied Intelligence, 2024, 54 : 1751 - 1769
  • [50] Reinforcement learning based agents for improving layouts of automotive crash structures
    Trilling, Jens
    Schumacher, Axel
    Zhou, Ming
    APPLIED INTELLIGENCE, 2024, 54 (02) : 1751 - 1769