Reward Attack on Stochastic Bandits with Non-stationary Rewards

被引:1
|
作者
Yang, Chenye [1 ]
Liu, Guanlin [1 ]
Lai, Lifeng [1 ]
机构
[1] Univ Calif Davis, Dept Elect & Comp Engn, Davis, CA 95616 USA
基金
美国国家科学基金会;
关键词
bandit; non-stationary reward; attack cost;
D O I
10.1109/IEEECONF59524.2023.10476992
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we investigate rewards attacks on stochastic multi-armed bandit algorithms with non-stationary environment. The attacker's goal is to force the victim algorithm to choose a suboptimal arm most of the time while incurring a small attack cost. Three main attack scenarios are considered: easy attack scenario, general attack scenario, and general attack scenario with limited information of victim algorithm. These scenarios have different assumptions about the environment and accessible information. We propose three attack strategies, one for each considered scenario, and prove that they are successful in terms of expected target arm selection and attack
引用
收藏
页码:1387 / 1393
页数:7
相关论文
共 50 条
  • [31] Non-stationary Continuum-armed Bandits for Online Hyperparameter Optimization
    Lu, Shiyin
    Zhou, Yu-Hang
    Shi, Jing-Cheng
    Zhu, Wenya
    Yu, Qingtao
    Chen, Qing-Guo
    Da, Qing
    Zhang, Lijun
    WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 618 - 627
  • [32] Non-Stationary Stochastic Global Optimization Algorithms
    Gomez, Jonatan
    Rivera, Andres
    ALGORITHMS, 2022, 15 (10)
  • [33] Martingale approximation of non-stationary stochastic processes
    Volny, Dalibor
    STOCHASTICS AND DYNAMICS, 2006, 6 (02) : 173 - 183
  • [34] THE SHIFT OPERATOR FOR NON-STATIONARY STOCHASTIC PROCESSES
    GETOOR, RK
    DUKE MATHEMATICAL JOURNAL, 1956, 23 (01) : 175 - 187
  • [35] Stochastic modelling of non-stationary smooth phenomena
    Tornatore, V
    Migliaccio, F
    IV HOTINE-MARUSSI SYMPOSIUM ON MATHEMATICAL GEODESY, 2001, (122): : 77 - 82
  • [36] Reliability aspects of a stochastic non-stationary process
    Burgazzi, L.
    RELIABILITY, RISK AND SAFETY: THEORY AND APPLICATIONS VOLS 1-3, 2010, : 2159 - 2164
  • [37] Stochastic modelling of non-stationary financial assets
    Estevens, Joana
    Rocha, Paulo
    Boto, Joao P.
    Lind, Pedro G.
    CHAOS, 2017, 27 (11)
  • [38] Stochastic Contextual Bandits with Long Horizon Rewards
    Qin, Yuzhen
    Li, Yingcong
    Pasqualetti, Fabio
    Fazel, Maryam
    Oymak, Samet
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9525 - 9533
  • [39] A stochastic harmonic function representation for non-stationary stochastic processes
    Chen, Jianbing
    Kong, Fan
    Peng, Yongbo
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2017, 96 : 31 - 44
  • [40] Stochastic Multi-path Routing Problem with Non-stationary Rewards: Building PayU's Dynamic Routing
    Trivedi, Pankaj
    Singh, Arvind
    COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 1707 - 1712