Achieving Optimal Dynamic Regret for Non-stationary Bandits without Prior Information

被引:0
|
作者
Auer, Peter [1 ]
Chen, Yifang [2 ]
Gajane, Pratik [1 ]
Lee, Chung-Wei [2 ]
Luo, Haipeng [2 ]
Ortner, Ronald [1 ]
Wei, Chen-Yu [2 ]
机构
[1] Montan Univ Leoben, Leoben, Austria
[2] Univ Southern Calif, Los Angeles, CA 90007 USA
来源
基金
奥地利科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This joint extended abstract introduces and compares the results of (Auer et al., 2019) and (Chen et al., 2019), both of which resolve the problem of achieving optimal dynamic regret for nonstationary bandits without prior information on the non-stationarity. Specifically, Auer et al. (2019) resolve the problem for the traditional multi-armed bandits setting, while Chen et al. (2019) give a solution for the more general contextual bandits setting. Both works extend the key idea of (Auer et al., 2018) developed for a simpler two-armed setting.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Online Learning with Non-Convex Losses and Non-Stationary Regret
    Gao, Xiang
    Li, Xiaobo
    Zhang, Shuzhong
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
  • [32] Optimal encoding of non-stationary sources
    Reif, JH
    Storer, JA
    INFORMATION SCIENCES, 2001, 135 (1-2) : 87 - 105
  • [33] Unimodal Bandits with Continuous Arms: Order-optimal Regret without Smoothness
    Combes R.
    Proutiere A.
    Fauquette A.
    Performance Evaluation Review, 2020, 48 (01): : 107 - 108
  • [34] Unimodal Bandits with Continuous Arms: Order-optimal Regret without Smoothness
    Combes, Richard
    Proutiere, Alexandre
    Fauquette, Alexandre
    PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS, 2020, 4 (01)
  • [35] Non-stationary Continuum-armed Bandits for Online Hyperparameter Optimization
    Lu, Shiyin
    Zhou, Yu-Hang
    Shi, Jing-Cheng
    Zhu, Wenya
    Yu, Qingtao
    Chen, Qing-Guo
    Da, Qing
    Zhang, Lijun
    WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 618 - 627
  • [36] Information Theory for Non-Stationary Processes with Stationary Increments
    Granero-Belinchon, Carlos
    Roux, Stephane G.
    Garnier, Nicolas B.
    ENTROPY, 2019, 21 (12)
  • [37] Non-Stationary Dynamic Mode Decomposition
    Ferre, John
    Rokem, Ariel
    Buffalo, Elizabeth A.
    Kutz, J. Nathan
    Fairhall, Adrienne
    IEEE ACCESS, 2023, 11 : 117159 - 117176
  • [38] Dynamic histograms for non-stationary updates
    Lam, E
    Salem, K
    9TH INTERNATIONAL DATABASE ENGINEERING & APPLICATION SYMPOSIUM, PROCEEDINGS, 2005, : 235 - 243
  • [39] Optimal inventory policies with non-stationary supply disruptions and advance supply information
    Atasoy, Bilge
    Gullu, Refik
    Tan, Tarkan
    DECISION SUPPORT SYSTEMS, 2012, 53 (02) : 269 - 281
  • [40] Optimal policy for a dynamic, non-stationary, stochastic inventory problem with capacity commitment
    Xu, Ningxiong
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2009, 199 (02) : 400 - 408