Cascading Non-Stationary Bandits: Online Learning to Rank in the Non-Stationary Cascade Model

被引:0
|
作者
Li, Chang [1 ]
de Rijke, Maarten [1 ]
机构
[1] Univ Amsterdam, Amsterdam, Netherlands
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Non-stationarity appears in many online applications such as web search and advertising. In this paper, we study the online learning to rank problem in a non-stationary environment where user preferences change abruptly at an unknown moment in time. We consider the problem of identifying the K most attractive items and propose cascading non-stationary bandits, an online learning variant of the cascading model, where a user browses a ranked list from top to bottom and clicks on the first attractive item. We propose two algorithms for solving this non-stationary problem: CascadeDUCB and CascadeSWUCB. We analyze their performance and derive gap-dependent upper bounds on the n-step regret of these algorithms. We also establish a lower bound on the regret for cascading nonstationary bandits and show that both algorithms match the lower bound up to a logarithmic factor. Finally, we evaluate their performance on a real-world web search click dataset.
引用
收藏
页码:2859 / 2865
页数:7
相关论文
共 50 条
  • [1] Non-stationary Dueling Bandits for Online Learning to Rank
    Lu, Shiyin
    Miao, Yuan
    Yang, Ping
    Hu, Yao
    Zhang, Lijun
    [J]. WEB AND BIG DATA, PT II, APWEB-WAIM 2022, 2023, 13422 : 166 - 174
  • [2] Non-stationary Bandits with Knapsacks
    Liu, Shang
    Jiang, Jiashuo
    Li, Xiaocheng
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [3] Learning Contextual Bandits in a Non-stationary Environment
    Wu, Qingyun
    Iyer, Naveen
    Wang, Hongning
    [J]. ACM/SIGIR PROCEEDINGS 2018, 2018, : 495 - 504
  • [4] Online learning of non-stationary sequences
    Monteleoni, C
    Jaakkola, T
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 1093 - 1100
  • [5] Unifying Clustered and Non-stationary Bandits
    Li, Chuanhao
    Wu, Qingyun
    Wang, Hongning
    [J]. 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [6] Non-Stationary Representation Learning in Sequential Linear Bandits
    Qin, Yuzhen
    Menara, Tommaso
    Oymak, Samet
    Ching, Shinung
    Pasqualetti, Fabio
    [J]. IEEE Open Journal of Control Systems, 2022, 1 : 41 - 56
  • [7] Online Learning for Non-Stationary A/B Tests
    Medina, Andres Munoz
    Vassilvitiskii, Sergei
    Yin, Dong
    [J]. CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 317 - 326
  • [8] Online Non-stationary Boosting
    Pocock, Adam
    Yiapanis, Paraskevas
    Singer, Jeremy
    Lujan, Mikel
    Brown, Gavin
    [J]. MULTIPLE CLASSIFIER SYSTEMS, PROCEEDINGS, 2010, 5997 : 205 - 214
  • [9] Non-stationary lognormal model development and comparison with the non-stationary GEV model
    Aissaoui-Fqayeh, I.
    El-Adlouni, S.
    Ouarda, T. B. M. J.
    St-Hilaire, A.
    [J]. HYDROLOGICAL SCIENCES JOURNAL-JOURNAL DES SCIENCES HYDROLOGIQUES, 2009, 54 (06): : 1141 - 1156
  • [10] A Simple Approach for Non-stationary Linear Bandits
    Zhao, Peng
    Zhang, Lijun
    Jiang, Yuan
    Zhou, Zhi-Hua
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 746 - 754