Learning adversarial attack policies through multi-objective reinforcement learning

被引：11

作者：

Garcia, Javier ^{[1
]}

Majadas, Ruben ^{[1
]}

Fernandez, Fernando ^{[1
]}

机构：

[1] Univ Carlos III Madrid, Dept Informat, Avda Univ 30, Madrid 28911, Spain

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2020年 / 96卷

关键词：

Multi-objective reinforcement learning; Adversarial reinforcement learning; LEVEL; STATE;

D O I：

10.1016/j.engappai.2020.104021

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep Reinforcement Learning has shown promising results in learning policies for complex sequential decisionmaking tasks. However, different adversarial attack strategies have revealed the weakness of these policies to perturbations to their observations. Most of these attacks have been built on existing adversarial example crafting techniques used to fool classifiers, where an adversarial attack is considered a success if it makes the classifier outputs any wrong class. The major drawback of these approaches when applied to decision-making tasks is that they are blind for long-term goals. In contrast, this paper suggests that it is more appropriate to view the attack process as a sequential optimization problem, with the aim of learning a sequence of attacks, where the attacker must consider the long-term effects of each attack. In this paper, we propose that such an attack policy must be learned with two objectives in view. On the one hand, the attack must pursue the maximum performance loss of the attacked policy. On the other hand, it also should minimize the cost of the attacks. Therefore, in this paper we propose a novel modelization of the process of learning an attack policy as a Multi-objective Markov Decision Process with two objectives: maximizing the performance loss of the attacked policy and minimizing the cost of the attacks. We also reveal the conflicting nature of these two objectives and use a Multi-objective Reinforcement Learning algorithm to draw the Pareto fronts for four well-known tasks: the GridWorld, the Cartpole, the Mountain car and the Breakout.

引用

页数：11

共 50 条

[21] Nondominated Policy-Guided Learning in Multi-Objective Reinforcement Learning
Kim, Man-Je
Park, Hyunsoo
Ahn, Chang Wook
[J]. ELECTRONICS, 2022, 11 (07)
[22] Decomposition based Multi-Objective Evolutionary Algorithm in XCS for Multi-Objective Reinforcement Learning
Cheng, Xiu
Browne, Will N.
Zhang, Mengjie
[J]. 2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, : 622 - 629
[23] Multi-objective Multiagent Credit Assignment Through Difference Rewards in Reinforcement Learning
Yliniemi, Logan
Tumer, Kagan
[J]. SIMULATED EVOLUTION AND LEARNING (SEAL 2014), 2014, 8886 : 407 - 418
[24] A Multi-objective Reinforcement Learning Algorithm for JS']JSSP
Mendez-Hernandez, Beatriz M.
Rodriguez-Bazan, Erick D.
Martinez-Jimenez, Yailen
Libin, Pieter
Nowe, Ann
[J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: THEORETICAL NEURAL COMPUTATION, PT I, 2019, 11727 : 567 - 584
[25] Multi-Objective Service Composition Using Reinforcement Learning
Moustafa, Ahmed
Zhang, Minjie
[J]. SERVICE-ORIENTED COMPUTING, ICSOC 2013, 2013, 8274 : 298 - 312
[26] Taming Lagrangian chaos with multi-objective reinforcement learning
Calascibetta, Chiara
Biferale, Luca
Borra, Francesco
Celani, Antonio
Cencini, Massimo
[J]. EUROPEAN PHYSICAL JOURNAL E, 2023, 46 (03):
[27] Multi-Objective Reinforcement Learning for Designing Ethical Environments
Rodriguez-Soto, Manel
Lopez-Sanchez, Maite
Rodriguez-Aguilar, Juan A.
[J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 545 - 551
[28] Multi-Objective Order Scheduling via Reinforcement Learning
Chen, Sirui
Tian, Yuming
An, Lingling
[J]. ALGORITHMS, 2023, 16 (11)
[29] Dynamic Weights in Multi-Objective Deep Reinforcement Learning
Abels, Axel
Roijers, Diederik M.
Lenaerts, Tom
Nowe, Ann
Steckelmacher, Denis
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[30] A temporal difference method for multi-objective reinforcement learning
Ruiz-Montiel, Manuela
Mandow, Lawrence
Perez-de-la-Cruz, Jose-Luis
[J]. NEUROCOMPUTING, 2017, 263 : 15 - 25

← 1 2 3 4 5 →