NEAR-OPTIMAL ALGORITHMS FOR PIECEWISE-STATIONARY CASCADING BANDITS

被引:1
|
作者
Wang, Lingda [1 ]
Zhou, Huozhi [1 ]
Li, Bingcong [2 ]
Varshney, Lay R. [1 ]
Zhao, Zhizhen [1 ]
机构
[1] Univ Illinois, Champaign, IL 61820 USA
[2] Univ Minnesota Twin Cities, Minneapolis, MN USA
关键词
Online Learning; Cascading Bandits; Non-stationary Bandits; Change-point Detection;
D O I
10.1109/ICASSP39728.2021.9414506
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Cascading bandit (CB) is a popular model for web search and online advertising. However, the stationary CB model may be too simple to cope with real-world problems, where user preferences may change over time. Considering piecewise-stationary environments, two efficient algorithms, GLRT-CascadeUCB and GLRT-CascadeKL-UCB, are developed. Comparing with existing works, the proposed algorithms: i) are free of change-point-dependent information for choosing parameters; ii) have fewer tuning parameters; iii) improve regret upper bounds. We also show that the proposed algorithms are optimal up to logarithm terms by deriving a minimax lower bound Omega(root NLT) for piecewise-stationary CB. The efficiency of the proposed algorithms is validated through numerical tests on a real-world benchmark dataset.
引用
收藏
页码:3365 / 3369
页数:5
相关论文
共 50 条
  • [1] A Near-Optimal Change-Detection Based Algorithm for Piecewise-Stationary Combinatorial Semi-Bandits
    Zhou, Huozhi
    Wang, Lingda
    Varshney, Lav R.
    Lim, Ee-Peng
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 6933 - 6940
  • [2] Efficient Change-Point Detection for Tackling Piecewise-Stationary Bandits
    Besson, Lilian
    Kaufmann, Emilie
    Maillard, Odalric-Ambrym
    Seznec, Julien
    [J]. Journal of Machine Learning Research, 2022, 23
  • [3] Efficient Change-Point Detection for Tackling Piecewise-Stationary Bandits
    Besson, Lilian
    Kaufmann, Emilie
    Maillard, Odalric-Ambrym
    Seznec, Julien
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [4] Near-Optimal Collaborative Learning in Bandits
    Reda, Clemence
    Vakili, Sattar
    Kaufmann, Emilie
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [5] Nearly Optimal Adaptive Procedure with Change Detection for Piecewise-Stationary Bandit
    Cao, Yang
    Wen, Zheng
    Kveton, Branislav
    Xie, Yao
    [J]. 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 418 - 427
  • [6] Near-Optimal MNL Bandits Under Risk Criteria
    Xi, Guangyu
    Tao, Chao
    Zhou, Yuan
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10397 - 10404
  • [7] An Optimal Algorithm for the Stochastic Bandits with Knowing Near-optimal Mean Reward
    Yang, Shangdong
    Wang, Hao
    Gao, Yang
    Chen, Xingguo
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 2130 - 2132
  • [8] DETECTION OF A DETERMINISTIC SIGNAL IN PIECEWISE-STATIONARY INTERFERENCE
    GOREV, PV
    KOLDANOV, AP
    [J]. TELECOMMUNICATIONS AND RADIO ENGINEERING, 1986, 40-1 (04) : 79 - 82
  • [9] Speckle coherence of piecewise-stationary stochastic targets
    Morgan, Matthew R.
    Trahey, Gregg E.
    Walker, William F.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 146 (03): : 1721 - 1731
  • [10] The Model of Time Series as a Piecewise-Stationary Process
    Ivanov, N. G.
    Prasolov, A., V
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON APPLICATIONS IN INFORMATION TECHNOLOGY (ICAIT - 2018), 2018, : 150 - 153