Learning adversarial attack policies through multi-objective reinforcement learning

被引:11
|
作者
Garcia, Javier [1 ]
Majadas, Ruben [1 ]
Fernandez, Fernando [1 ]
机构
[1] Univ Carlos III Madrid, Dept Informat, Avda Univ 30, Madrid 28911, Spain
关键词
Multi-objective reinforcement learning; Adversarial reinforcement learning; LEVEL; STATE;
D O I
10.1016/j.engappai.2020.104021
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Reinforcement Learning has shown promising results in learning policies for complex sequential decisionmaking tasks. However, different adversarial attack strategies have revealed the weakness of these policies to perturbations to their observations. Most of these attacks have been built on existing adversarial example crafting techniques used to fool classifiers, where an adversarial attack is considered a success if it makes the classifier outputs any wrong class. The major drawback of these approaches when applied to decision-making tasks is that they are blind for long-term goals. In contrast, this paper suggests that it is more appropriate to view the attack process as a sequential optimization problem, with the aim of learning a sequence of attacks, where the attacker must consider the long-term effects of each attack. In this paper, we propose that such an attack policy must be learned with two objectives in view. On the one hand, the attack must pursue the maximum performance loss of the attacked policy. On the other hand, it also should minimize the cost of the attacks. Therefore, in this paper we propose a novel modelization of the process of learning an attack policy as a Multi-objective Markov Decision Process with two objectives: maximizing the performance loss of the attacked policy and minimizing the cost of the attacks. We also reveal the conflicting nature of these two objectives and use a Multi-objective Reinforcement Learning algorithm to draw the Pareto fronts for four well-known tasks: the GridWorld, the Cartpole, the Mountain car and the Breakout.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Nondominated Policy-Guided Learning in Multi-Objective Reinforcement Learning
    Kim, Man-Je
    Park, Hyunsoo
    Ahn, Chang Wook
    [J]. ELECTRONICS, 2022, 11 (07)
  • [22] Decomposition based Multi-Objective Evolutionary Algorithm in XCS for Multi-Objective Reinforcement Learning
    Cheng, Xiu
    Browne, Will N.
    Zhang, Mengjie
    [J]. 2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, : 622 - 629
  • [23] Multi-objective Multiagent Credit Assignment Through Difference Rewards in Reinforcement Learning
    Yliniemi, Logan
    Tumer, Kagan
    [J]. SIMULATED EVOLUTION AND LEARNING (SEAL 2014), 2014, 8886 : 407 - 418
  • [24] A Multi-objective Reinforcement Learning Algorithm for JS']JSSP
    Mendez-Hernandez, Beatriz M.
    Rodriguez-Bazan, Erick D.
    Martinez-Jimenez, Yailen
    Libin, Pieter
    Nowe, Ann
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: THEORETICAL NEURAL COMPUTATION, PT I, 2019, 11727 : 567 - 584
  • [25] Multi-Objective Service Composition Using Reinforcement Learning
    Moustafa, Ahmed
    Zhang, Minjie
    [J]. SERVICE-ORIENTED COMPUTING, ICSOC 2013, 2013, 8274 : 298 - 312
  • [26] Taming Lagrangian chaos with multi-objective reinforcement learning
    Calascibetta, Chiara
    Biferale, Luca
    Borra, Francesco
    Celani, Antonio
    Cencini, Massimo
    [J]. EUROPEAN PHYSICAL JOURNAL E, 2023, 46 (03):
  • [27] Multi-Objective Reinforcement Learning for Designing Ethical Environments
    Rodriguez-Soto, Manel
    Lopez-Sanchez, Maite
    Rodriguez-Aguilar, Juan A.
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 545 - 551
  • [28] Multi-Objective Order Scheduling via Reinforcement Learning
    Chen, Sirui
    Tian, Yuming
    An, Lingling
    [J]. ALGORITHMS, 2023, 16 (11)
  • [29] Dynamic Weights in Multi-Objective Deep Reinforcement Learning
    Abels, Axel
    Roijers, Diederik M.
    Lenaerts, Tom
    Nowe, Ann
    Steckelmacher, Denis
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [30] A temporal difference method for multi-objective reinforcement learning
    Ruiz-Montiel, Manuela
    Mandow, Lawrence
    Perez-de-la-Cruz, Jose-Luis
    [J]. NEUROCOMPUTING, 2017, 263 : 15 - 25