Reward Attack on Stochastic Bandits with Non-stationary Rewards

被引：1

作者：

Yang, Chenye ^{[1
]}

Liu, Guanlin ^{[1
]}

Lai, Lifeng ^{[1
]}

机构：

[1] Univ Calif Davis, Dept Elect & Comp Engn, Davis, CA 95616 USA

来源：

FIFTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, IEEECONF | 2023年

基金：

美国国家科学基金会;

关键词：

bandit; non-stationary reward; attack cost;

D O I：

10.1109/IEEECONF59524.2023.10476992

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we investigate rewards attacks on stochastic multi-armed bandit algorithms with non-stationary environment. The attacker's goal is to force the victim algorithm to choose a suboptimal arm most of the time while incurring a small attack cost. Three main attack scenarios are considered: easy attack scenario, general attack scenario, and general attack scenario with limited information of victim algorithm. These scenarios have different assumptions about the environment and accessible information. We propose three attack strategies, one for each considered scenario, and prove that they are successful in terms of expected target arm selection and attack

引用

页码：1387 / 1393

页数：7

共 50 条

[21] Time-Decaying Bandits for Non-stationary Systems
Komiyama, Junpei
Qin, Tao
WEB AND INTERNET ECONOMICS, 2014, 8877 : 460 - 466
[22] Beam Alignment for mmWave Using Non-Stationary Bandits
Gupta, Ruchir
Lakshmanan, K.
Sah, Abhay Kumar
IEEE COMMUNICATIONS LETTERS, 2020, 24 (11) : 2619 - 2622
[23] Non-Stationary Representation Learning in Sequential Linear Bandits
Qin, Yuzhen
Menara, Tommaso
Oymak, Samet
Ching, Shinung
Pasqualetti, Fabio
IEEE OPEN JOURNAL OF CONTROL SYSTEMS, 2022, 1 : 41 - 56
[24] Non-stationary Dueling Bandits for Online Learning to Rank
Lu, Shiyin
Miao, Yuan
Yang, Ping
Hu, Yao
Zhang, Lijun
WEB AND BIG DATA, PT II, APWEB-WAIM 2022, 2023, 13422 : 166 - 174
[25] Non-Stationary Bandits with Auto-Regressive Temporal Dependency
Chen, Qinyi
Golrezaei, Negin
Bouneffouf, Djallel
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[26] Bootstrapping non-stationary stochastic volatility
Boswijk, H. Peter
Cavaliere, Giuseppe
Georgiev, Iliyan
Rahbek, Anders
JOURNAL OF ECONOMETRICS, 2021, 224 (01) : 161 - 180
[27] Stochastic Modeling of Non-Stationary Channels
Gligorevic, Snjezana
2013 7TH EUROPEAN CONFERENCE ON ANTENNAS AND PROPAGATION (EUCAP), 2013, : 1677 - 1681
[28] Maximizing Reward in a Non-Stationary Mobile Robot Environment
Dani Goldberg
Maja J. Matarić
Autonomous Agents and Multi-Agent Systems, 2003, 6 : 287 - 316
[29] Maximizing reward in a non-stationary mobile robot environment
Goldberg, D
Mataric, MJ
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2003, 6 (03) : 287 - 316
[30] Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits
Saha, Aadirupa
Gupta, Shubham
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022, : 19027 - 19049

← 1 2 3 4 5 →