Achieving Optimal Dynamic Regret for Non-stationary Bandits without Prior Information

被引:0
|
作者
Auer, Peter [1 ]
Chen, Yifang [2 ]
Gajane, Pratik [1 ]
Lee, Chung-Wei [2 ]
Luo, Haipeng [2 ]
Ortner, Ronald [1 ]
Wei, Chen-Yu [2 ]
机构
[1] Montan Univ Leoben, Leoben, Austria
[2] Univ Southern Calif, Los Angeles, CA 90007 USA
来源
基金
奥地利科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This joint extended abstract introduces and compares the results of (Auer et al., 2019) and (Chen et al., 2019), both of which resolve the problem of achieving optimal dynamic regret for nonstationary bandits without prior information on the non-stationarity. Specifically, Auer et al. (2019) resolve the problem for the traditional multi-armed bandits setting, while Chen et al. (2019) give a solution for the more general contextual bandits setting. Both works extend the key idea of (Auer et al., 2018) developed for a simpler two-armed setting.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] A Regret-Based Approach to Non-stationary Discrete Stochastic Optimization
    Gharehshiran, Omid Namvar
    Krishnamurthy, Vikram
    Yin, George
    2015 54TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2015, : 3959 - 3964
  • [42] A Change-Detection-Based Thompson Sampling Framework for Non-Stationary Bandits
    Ghatak, Gourab
    IEEE TRANSACTIONS ON COMPUTERS, 2021, 70 (10) : 1670 - 1676
  • [43] Contextual Multi-Armed Bandits for Non-Stationary Wireless Network Selection
    Martinez, Lluis
    Vidal, Josep
    Cabrera-Bean, Margarita
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 285 - 290
  • [44] A Technical Note on Non-Stationary Parametric Bandits: Existing Mistakes and Preliminary Solutions
    Faury, Louis
    Russac, Yoan
    Abeille, Marc
    Calauzenes, Clement
    ALGORITHMIC LEARNING THEORY, VOL 132, 2021, 132
  • [45] Bargaining with asymmetric information in non-stationary markets
    Daniel Trefler
    Economic Theory, 1999, 13 : 577 - 601
  • [46] Bargaining with asymmetric information in non-stationary markets
    Trefler, D
    ECONOMIC THEORY, 1999, 13 (03) : 577 - 601
  • [47] Dynamic memory model for non-stationary optimization
    Bendtsen, CN
    Krink, T
    CEC'02: PROCEEDINGS OF THE 2002 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2002, : 145 - 150
  • [48] Learning non-stationary dynamic bayesian networks
    Robinson, Joshua W.
    Hartemink, Alexander J.
    Journal of Machine Learning Research, 2010, 11 : 3647 - 3680
  • [49] Learning Non-Stationary Dynamic Bayesian Networks
    Robinson, Joshua W.
    Hartemink, Alexander J.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2010, 11 : 3647 - 3680
  • [50] Dynamic Adaptation on Non-stationary Visual Domains
    Shkodrani, Sindi
    Hofmann, Michael
    Gavves, Efstratios
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT II, 2019, 11130 : 158 - 171