Efficient Change-Point Detection for Tackling Piecewise-Stationary Bandits

被引:0
|
作者
Besson, Lilian [1 ]
Kaufmann, Emilie [2 ]
Maillard, Odalric-Ambrym [2 ]
Seznec, Julien
机构
[1] IRISA, Inria Rennes, Ens, France
[2] Univ Lille, F-59000 Lille, France
关键词
Multi-Armed Bandits; Change Point Detection; Non-Stationary Bandits; LIKELIHOOD RATIO; REGRET BOUNDS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We introduce GLR-klUCB, a novel algorithm for the piecewise i.i.d. non-stationary bandit problem with bounded rewards. This algorithm combines an efficient bandit algorithm, klUCB, with an efficient, parameter-free, change-point detector, the Bernoulli Generalized Likelihood Ratio Test, for which we provide new theoretical guarantees of independent interest. Unlike previous nonstationary bandit algorithms using a change-point detector, GLR-klUCB does not need to be calibrated based on prior knowledge on the arms' means. We prove that this algorithm can attain a TATT ln(T)) regret in T rounds on some "easy" instances in which there is sufficient delay between two change-points, where A is the number of arms and TT the number of change-points, without prior knowledge of TT. In contrast with recently proposed algorithms that are agnostic to TT, we perform a numerical study showing that GLR-klUCB is also very efficient in practice, beyond easy instances.
引用
收藏
页数:40
相关论文
共 50 条
  • [1] Efficient Change-Point Detection for Tackling Piecewise-Stationary Bandits
    Besson, Lilian
    Kaufmann, Emilie
    Maillard, Odalric-Ambrym
    Seznec, Julien
    [J]. Journal of Machine Learning Research, 2022, 23
  • [2] NETWORK INFERENCE AND CHANGE POINT DETECTION FOR PIECEWISE-STATIONARY TIME SERIES
    Yu, Hang
    Li, Chenyang
    Dauwels, Justin
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [3] NEAR-OPTIMAL ALGORITHMS FOR PIECEWISE-STATIONARY CASCADING BANDITS
    Wang, Lingda
    Zhou, Huozhi
    Li, Bingcong
    Varshney, Lay R.
    Zhao, Zhizhen
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3365 - 3369
  • [4] A Near-Optimal Change-Detection Based Algorithm for Piecewise-Stationary Combinatorial Semi-Bandits
    Zhou, Huozhi
    Wang, Lingda
    Varshney, Lav R.
    Lim, Ee-Peng
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 6933 - 6940
  • [5] Nearly Optimal Adaptive Procedure with Change Detection for Piecewise-Stationary Bandit
    Cao, Yang
    Wen, Zheng
    Kveton, Branislav
    Xie, Yao
    [J]. 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 418 - 427
  • [6] DETECTION OF A DETERMINISTIC SIGNAL IN PIECEWISE-STATIONARY INTERFERENCE
    GOREV, PV
    KOLDANOV, AP
    [J]. TELECOMMUNICATIONS AND RADIO ENGINEERING, 1986, 40-1 (04) : 79 - 82
  • [7] Change-Point Detection for Variance Piecewise Constant Models
    Adelfio, Giada
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2012, 41 (04) : 437 - 448
  • [8] Change-point detection for piecewise deterministic Markov processes
    Cleynen, Alice
    de Saporta, Benoite
    [J]. AUTOMATICA, 2018, 97 : 234 - 247
  • [9] A Change-Detection Based Framework for Piecewise-Stationary Multi-Armed Bandit Problem
    Liu, Fang
    Lee, Joohyun
    Shroff, Ness
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3651 - 3658
  • [10] General fuzzy piecewise regression analysis with automatic change-point detection
    Yu, JR
    Tzeng, GH
    Li, HL
    [J]. FUZZY SETS AND SYSTEMS, 2001, 119 (02) : 247 - 257