Reward Attack on Stochastic Bandits with Non-stationary Rewards

被引:1
|
作者
Yang, Chenye [1 ]
Liu, Guanlin [1 ]
Lai, Lifeng [1 ]
机构
[1] Univ Calif Davis, Dept Elect & Comp Engn, Davis, CA 95616 USA
基金
美国国家科学基金会;
关键词
bandit; non-stationary reward; attack cost;
D O I
10.1109/IEEECONF59524.2023.10476992
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we investigate rewards attacks on stochastic multi-armed bandit algorithms with non-stationary environment. The attacker's goal is to force the victim algorithm to choose a suboptimal arm most of the time while incurring a small attack cost. Three main attack scenarios are considered: easy attack scenario, general attack scenario, and general attack scenario with limited information of victim algorithm. These scenarios have different assumptions about the environment and accessible information. We propose three attack strategies, one for each considered scenario, and prove that they are successful in terms of expected target arm selection and attack
引用
收藏
页码:1387 / 1393
页数:7
相关论文
共 50 条
  • [41] Achieving Optimal Dynamic Regret for Non-stationary Bandits without Prior Information
    Auer, Peter
    Chen, Yifang
    Gajane, Pratik
    Lee, Chung-Wei
    Luo, Haipeng
    Ortner, Ronald
    Wei, Chen-Yu
    CONFERENCE ON LEARNING THEORY, VOL 99, 2019, 99
  • [42] ANACONDA: An Improved Dynamic Regret Algorithm for Adaptive Non-Stationary Dueling Bandits
    Buening, Thomas Kleine
    Saha, Aadirupa
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
  • [43] A Change-Detection-Based Thompson Sampling Framework for Non-Stationary Bandits
    Ghatak, Gourab
    IEEE TRANSACTIONS ON COMPUTERS, 2021, 70 (10) : 1670 - 1676
  • [44] Contextual Multi-Armed Bandits for Non-Stationary Wireless Network Selection
    Martinez, Lluis
    Vidal, Josep
    Cabrera-Bean, Margarita
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 285 - 290
  • [45] Non-Stationary Bandits under Recharging Payoffs: Improved Planning with Sublinear Regret
    Papadigenopoulos, Orestis
    Caramanis, Constantine
    Shakkottai, Sanjay
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [46] A Technical Note on Non-Stationary Parametric Bandits: Existing Mistakes and Preliminary Solutions
    Faury, Louis
    Russac, Yoan
    Abeille, Marc
    Calauzenes, Clement
    ALGORITHMIC LEARNING THEORY, VOL 132, 2021, 132
  • [47] Stationary and non-stationary stochastic response of linear fractional viscoelastic systems
    Di Paola, Mario
    Failla, Giuseppe
    Pirrotta, Antonina
    PROBABILISTIC ENGINEERING MECHANICS, 2012, 28 : 85 - 90
  • [48] Non-stationary stochastic embedding for transfer function estimation
    Goodwin, GC
    Braslavsky, JH
    Seron, MM
    AUTOMATICA, 2002, 38 (01) : 47 - 62
  • [49] Tracking the Best Expert in Non-stationary Stochastic Environments
    Wei, Chen-Yu
    Hong, Yi-Te
    Lu, Chi-Jen
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [50] Non-stationary Stochastic Network Optimization with Imperfect Estimations
    Liu, Yu
    Liu, Zhenhua
    Yang, Yuanyuan
    2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 431 - 441