Achieving Optimal Dynamic Regret for Non-stationary Bandits without Prior Information

被引：0

作者：

Auer, Peter ^{[1
]}

Chen, Yifang ^{[2
]}

Gajane, Pratik ^{[1
]}

Lee, Chung-Wei ^{[2
]}

Luo, Haipeng ^{[2
]}

Ortner, Ronald ^{[1
]}

Wei, Chen-Yu ^{[2
]}

机构：

[1] Montan Univ Leoben, Leoben, Austria

[2] Univ Southern Calif, Los Angeles, CA 90007 USA

来源：

CONFERENCE ON LEARNING THEORY, VOL 99 | 2019年 / 99卷

基金：

奥地利科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This joint extended abstract introduces and compares the results of (Auer et al., 2019) and (Chen et al., 2019), both of which resolve the problem of achieving optimal dynamic regret for nonstationary bandits without prior information on the non-stationarity. Specifically, Auer et al. (2019) resolve the problem for the traditional multi-armed bandits setting, while Chen et al. (2019) give a solution for the more general contextual bandits setting. Both works extend the key idea of (Auer et al., 2018) developed for a simpler two-armed setting.

引用

页数：5

共 50 条

[41] A Regret-Based Approach to Non-stationary Discrete Stochastic Optimization
Gharehshiran, Omid Namvar
Krishnamurthy, Vikram
Yin, George
2015 54TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2015, : 3959 - 3964
[42] A Change-Detection-Based Thompson Sampling Framework for Non-Stationary Bandits
Ghatak, Gourab
IEEE TRANSACTIONS ON COMPUTERS, 2021, 70 (10) : 1670 - 1676
[43] Contextual Multi-Armed Bandits for Non-Stationary Wireless Network Selection
Martinez, Lluis
Vidal, Josep
Cabrera-Bean, Margarita
IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 285 - 290
[44] A Technical Note on Non-Stationary Parametric Bandits: Existing Mistakes and Preliminary Solutions
Faury, Louis
Russac, Yoan
Abeille, Marc
Calauzenes, Clement
ALGORITHMIC LEARNING THEORY, VOL 132, 2021, 132
[45] Bargaining with asymmetric information in non-stationary markets
Daniel Trefler
Economic Theory, 1999, 13 : 577 - 601
[46] Bargaining with asymmetric information in non-stationary markets
Trefler, D
ECONOMIC THEORY, 1999, 13 (03) : 577 - 601
[47] Dynamic memory model for non-stationary optimization
Bendtsen, CN
Krink, T
CEC'02: PROCEEDINGS OF THE 2002 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2002, : 145 - 150
[48] Learning non-stationary dynamic bayesian networks
Robinson, Joshua W.
Hartemink, Alexander J.
Journal of Machine Learning Research, 2010, 11 : 3647 - 3680
[49] Learning Non-Stationary Dynamic Bayesian Networks
Robinson, Joshua W.
Hartemink, Alexander J.
JOURNAL OF MACHINE LEARNING RESEARCH, 2010, 11 : 3647 - 3680
[50] Dynamic Adaptation on Non-stationary Visual Domains
Shkodrani, Sindi
Hofmann, Michael
Gavves, Efstratios
COMPUTER VISION - ECCV 2018 WORKSHOPS, PT II, 2019, 11130 : 158 - 171

← 1 2 3 4 5 →