Anytime Algorithms for Multi-Armed Bandit Problems

被引：10

作者：

Kleinberg, Robert ^{[1
]}

机构：

[1] MIT CSAIL, Cambridge, MA 02139 USA

来源：

PROCEEDINGS OF THE SEVENTHEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS | 2006年

关键词：

D O I：

10.1145/1109557.1109659

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

引用

页码：928 / 936

页数：9

共 50 条

[31] An asymptotically optimal strategy for constrained multi-armed bandit problems
Chang, Hyeong Soo
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2020, 91 (03) : 545 - 557
[32] Deterministic Sequencing of Exploration and Exploitation for Multi-Armed Bandit Problems
Vakili, Sattar
Liu, Keqin
Zhao, Qing
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2013, 7 (05) : 759 - 767
[33] The Effect of Communication on Noncooperative Multiplayer Multi-Armed Bandit Problems
Evirgen, Noyan
Kose, Alper
2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 331 - 336
[34] An asymptotically optimal strategy for constrained multi-armed bandit problems
Hyeong Soo Chang
Mathematical Methods of Operations Research, 2020, 91 : 545 - 557
[35] On the Optimality of Perturbations in Stochastic and Adversarial Multi-armed Bandit Problems
Kim, Baekjin
Tewari, Ambuj
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[36] Foraging decisions as multi-armed bandit problems: Applying reinforcement learning algorithms to foraging data
Morimoto, Juliano
JOURNAL OF THEORETICAL BIOLOGY, 2019, 467 : 48 - 56
[37] Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems
Bubeck, Sebastien
Cesa-Bianchi, Nicolo
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2012, 5 (01): : 1 - 122
[38] AB Testing for Process Versions with Contextual Multi-armed Bandit Algorithms
Satyal, Suhrid
Weber, Ingo
Paik, Hye-Young
Di Ciccio, Claudio
Mendling, Jan
ADVANCED INFORMATION SYSTEMS ENGINEERING, CAISE 2018, 2018, 10816 : 19 - 34
[39] Dynamic Multi-Armed Bandit with Covariates
Pavlidis, Nicos G.
Tasoulis, Dimitris K.
Adams, Niall M.
Hand, David J.
ECAI 2008, PROCEEDINGS, 2008, 178 : 777 - +
[40] Distributed Competitive Decision Making Using Multi-Armed Bandit Algorithms
Almasri, Mahmoud
Mansour, Ali
Moy, Christophe
Assoum, Ammar
Le Jeune, Denis
Osswald, Christophe
WIRELESS PERSONAL COMMUNICATIONS, 2021, 118 (02) : 1165 - 1188

← 1 2 3 4 5 →