Anytime Algorithms for Multi-Armed Bandit Problems

被引:10
|
作者
Kleinberg, Robert [1 ]
机构
[1] MIT CSAIL, Cambridge, MA 02139 USA
关键词
D O I
10.1145/1109557.1109659
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
引用
收藏
页码:928 / 936
页数:9
相关论文
共 50 条
  • [32] Deterministic Sequencing of Exploration and Exploitation for Multi-Armed Bandit Problems
    Vakili, Sattar
    Liu, Keqin
    Zhao, Qing
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2013, 7 (05) : 759 - 767
  • [33] The Effect of Communication on Noncooperative Multiplayer Multi-Armed Bandit Problems
    Evirgen, Noyan
    Kose, Alper
    2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 331 - 336
  • [34] An asymptotically optimal strategy for constrained multi-armed bandit problems
    Hyeong Soo Chang
    Mathematical Methods of Operations Research, 2020, 91 : 545 - 557
  • [35] On the Optimality of Perturbations in Stochastic and Adversarial Multi-armed Bandit Problems
    Kim, Baekjin
    Tewari, Ambuj
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [36] Foraging decisions as multi-armed bandit problems: Applying reinforcement learning algorithms to foraging data
    Morimoto, Juliano
    JOURNAL OF THEORETICAL BIOLOGY, 2019, 467 : 48 - 56
  • [37] Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems
    Bubeck, Sebastien
    Cesa-Bianchi, Nicolo
    FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2012, 5 (01): : 1 - 122
  • [38] AB Testing for Process Versions with Contextual Multi-armed Bandit Algorithms
    Satyal, Suhrid
    Weber, Ingo
    Paik, Hye-Young
    Di Ciccio, Claudio
    Mendling, Jan
    ADVANCED INFORMATION SYSTEMS ENGINEERING, CAISE 2018, 2018, 10816 : 19 - 34
  • [39] Dynamic Multi-Armed Bandit with Covariates
    Pavlidis, Nicos G.
    Tasoulis, Dimitris K.
    Adams, Niall M.
    Hand, David J.
    ECAI 2008, PROCEEDINGS, 2008, 178 : 777 - +
  • [40] Distributed Competitive Decision Making Using Multi-Armed Bandit Algorithms
    Almasri, Mahmoud
    Mansour, Ali
    Moy, Christophe
    Assoum, Ammar
    Le Jeune, Denis
    Osswald, Christophe
    WIRELESS PERSONAL COMMUNICATIONS, 2021, 118 (02) : 1165 - 1188