Achieving Optimal Dynamic Regret for Non-stationary Bandits without Prior Information

被引：0

作者：

Auer, Peter ^{[1
]}

Chen, Yifang ^{[2
]}

Gajane, Pratik ^{[1
]}

Lee, Chung-Wei ^{[2
]}

Luo, Haipeng ^{[2
]}

Ortner, Ronald ^{[1
]}

Wei, Chen-Yu ^{[2
]}

机构：

[1] Montan Univ Leoben, Leoben, Austria

[2] Univ Southern Calif, Los Angeles, CA 90007 USA

来源：

CONFERENCE ON LEARNING THEORY, VOL 99 | 2019年 / 99卷

基金：

奥地利科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This joint extended abstract introduces and compares the results of (Auer et al., 2019) and (Chen et al., 2019), both of which resolve the problem of achieving optimal dynamic regret for nonstationary bandits without prior information on the non-stationarity. Specifically, Auer et al. (2019) resolve the problem for the traditional multi-armed bandits setting, while Chen et al. (2019) give a solution for the more general contextual bandits setting. Both works extend the key idea of (Auer et al., 2018) developed for a simpler two-armed setting.

引用

页数：5

共 50 条

[31] Online Learning with Non-Convex Losses and Non-Stationary Regret
Gao, Xiang
Li, Xiaobo
Zhang, Shuzhong
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
[32] Optimal encoding of non-stationary sources
Reif, JH
Storer, JA
INFORMATION SCIENCES, 2001, 135 (1-2) : 87 - 105
[33] Unimodal Bandits with Continuous Arms: Order-optimal Regret without Smoothness
Combes R.
Proutiere A.
Fauquette A.
Performance Evaluation Review, 2020, 48 (01): : 107 - 108
[34] Unimodal Bandits with Continuous Arms: Order-optimal Regret without Smoothness
Combes, Richard
Proutiere, Alexandre
Fauquette, Alexandre
PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS, 2020, 4 (01)
[35] Non-stationary Continuum-armed Bandits for Online Hyperparameter Optimization
Lu, Shiyin
Zhou, Yu-Hang
Shi, Jing-Cheng
Zhu, Wenya
Yu, Qingtao
Chen, Qing-Guo
Da, Qing
Zhang, Lijun
WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 618 - 627
[36] Information Theory for Non-Stationary Processes with Stationary Increments
Granero-Belinchon, Carlos
Roux, Stephane G.
Garnier, Nicolas B.
ENTROPY, 2019, 21 (12)
[37] Non-Stationary Dynamic Mode Decomposition
Ferre, John
Rokem, Ariel
Buffalo, Elizabeth A.
Kutz, J. Nathan
Fairhall, Adrienne
IEEE ACCESS, 2023, 11 : 117159 - 117176
[38] Dynamic histograms for non-stationary updates
Lam, E
Salem, K
9TH INTERNATIONAL DATABASE ENGINEERING & APPLICATION SYMPOSIUM, PROCEEDINGS, 2005, : 235 - 243
[39] Optimal inventory policies with non-stationary supply disruptions and advance supply information
Atasoy, Bilge
Gullu, Refik
Tan, Tarkan
DECISION SUPPORT SYSTEMS, 2012, 53 (02) : 269 - 281
[40] Optimal policy for a dynamic, non-stationary, stochastic inventory problem with capacity commitment
Xu, Ningxiong
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2009, 199 (02) : 400 - 408

← 1 2 3 4 5 →