Reward Attack on Stochastic Bandits with Non-stationary Rewards

被引:1
|
作者
Yang, Chenye [1 ]
Liu, Guanlin [1 ]
Lai, Lifeng [1 ]
机构
[1] Univ Calif Davis, Dept Elect & Comp Engn, Davis, CA 95616 USA
基金
美国国家科学基金会;
关键词
bandit; non-stationary reward; attack cost;
D O I
10.1109/IEEECONF59524.2023.10476992
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we investigate rewards attacks on stochastic multi-armed bandit algorithms with non-stationary environment. The attacker's goal is to force the victim algorithm to choose a suboptimal arm most of the time while incurring a small attack cost. Three main attack scenarios are considered: easy attack scenario, general attack scenario, and general attack scenario with limited information of victim algorithm. These scenarios have different assumptions about the environment and accessible information. We propose three attack strategies, one for each considered scenario, and prove that they are successful in terms of expected target arm selection and attack
引用
收藏
页码:1387 / 1393
页数:7
相关论文
共 50 条
  • [21] Time-Decaying Bandits for Non-stationary Systems
    Komiyama, Junpei
    Qin, Tao
    WEB AND INTERNET ECONOMICS, 2014, 8877 : 460 - 466
  • [22] Beam Alignment for mmWave Using Non-Stationary Bandits
    Gupta, Ruchir
    Lakshmanan, K.
    Sah, Abhay Kumar
    IEEE COMMUNICATIONS LETTERS, 2020, 24 (11) : 2619 - 2622
  • [23] Non-Stationary Representation Learning in Sequential Linear Bandits
    Qin, Yuzhen
    Menara, Tommaso
    Oymak, Samet
    Ching, Shinung
    Pasqualetti, Fabio
    IEEE OPEN JOURNAL OF CONTROL SYSTEMS, 2022, 1 : 41 - 56
  • [24] Non-stationary Dueling Bandits for Online Learning to Rank
    Lu, Shiyin
    Miao, Yuan
    Yang, Ping
    Hu, Yao
    Zhang, Lijun
    WEB AND BIG DATA, PT II, APWEB-WAIM 2022, 2023, 13422 : 166 - 174
  • [25] Non-Stationary Bandits with Auto-Regressive Temporal Dependency
    Chen, Qinyi
    Golrezaei, Negin
    Bouneffouf, Djallel
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [26] Bootstrapping non-stationary stochastic volatility
    Boswijk, H. Peter
    Cavaliere, Giuseppe
    Georgiev, Iliyan
    Rahbek, Anders
    JOURNAL OF ECONOMETRICS, 2021, 224 (01) : 161 - 180
  • [27] Stochastic Modeling of Non-Stationary Channels
    Gligorevic, Snjezana
    2013 7TH EUROPEAN CONFERENCE ON ANTENNAS AND PROPAGATION (EUCAP), 2013, : 1677 - 1681
  • [28] Maximizing Reward in a Non-Stationary Mobile Robot Environment
    Dani Goldberg
    Maja J. Matarić
    Autonomous Agents and Multi-Agent Systems, 2003, 6 : 287 - 316
  • [29] Maximizing reward in a non-stationary mobile robot environment
    Goldberg, D
    Mataric, MJ
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2003, 6 (03) : 287 - 316
  • [30] Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits
    Saha, Aadirupa
    Gupta, Shubham
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022, : 19027 - 19049