Cascading Non-Stationary Bandits: Online Learning to Rank in the Non-Stationary Cascade Model

被引：0

作者：

Li, Chang ^{[1
]}

de Rijke, Maarten ^{[1
]}

机构：

[1] Univ Amsterdam, Amsterdam, Netherlands

来源：

PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2019年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Non-stationarity appears in many online applications such as web search and advertising. In this paper, we study the online learning to rank problem in a non-stationary environment where user preferences change abruptly at an unknown moment in time. We consider the problem of identifying the K most attractive items and propose cascading non-stationary bandits, an online learning variant of the cascading model, where a user browses a ranked list from top to bottom and clicks on the first attractive item. We propose two algorithms for solving this non-stationary problem: CascadeDUCB and CascadeSWUCB. We analyze their performance and derive gap-dependent upper bounds on the n-step regret of these algorithms. We also establish a lower bound on the regret for cascading nonstationary bandits and show that both algorithms match the lower bound up to a logarithmic factor. Finally, we evaluate their performance on a real-world web search click dataset.

引用

页码：2859 / 2865

页数：7

共 50 条

[1] Non-stationary Dueling Bandits for Online Learning to Rank
Lu, Shiyin
Miao, Yuan
Yang, Ping
Hu, Yao
Zhang, Lijun
[J]. WEB AND BIG DATA, PT II, APWEB-WAIM 2022, 2023, 13422 : 166 - 174
[2] Non-stationary Bandits with Knapsacks
Liu, Shang
Jiang, Jiashuo
Li, Xiaocheng
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[3] Learning Contextual Bandits in a Non-stationary Environment
Wu, Qingyun
Iyer, Naveen
Wang, Hongning
[J]. ACM/SIGIR PROCEEDINGS 2018, 2018, : 495 - 504
[4] Online learning of non-stationary sequences
Monteleoni, C
Jaakkola, T
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 1093 - 1100
[5] Unifying Clustered and Non-stationary Bandits
Li, Chuanhao
Wu, Qingyun
Wang, Hongning
[J]. 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
[6] Non-stationary Bandits with Heavy Tail
Pan, Weici
Liu, Zhenhua
[J]. Performance Evaluation Review, 2024, 52 (02): : 33 - 35
[7] Non-Stationary Representation Learning in Sequential Linear Bandits
Qin, Yuzhen
Menara, Tommaso
Oymak, Samet
Ching, Shinung
Pasqualetti, Fabio
[J]. IEEE Open Journal of Control Systems, 2022, 1 : 41 - 56
[8] Online Learning for Non-Stationary A/B Tests
Medina, Andres Munoz
Vassilvitiskii, Sergei
Yin, Dong
[J]. CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 317 - 326
[9] Online Non-stationary Boosting
Pocock, Adam
Yiapanis, Paraskevas
Singer, Jeremy
Lujan, Mikel
Brown, Gavin
[J]. MULTIPLE CLASSIFIER SYSTEMS, PROCEEDINGS, 2010, 5997 : 205 - 214
[10] Non-stationary lognormal model development and comparison with the non-stationary GEV model
Aissaoui-Fqayeh, I.
El-Adlouni, S.
Ouarda, T. B. M. J.
St-Hilaire, A.
[J]. HYDROLOGICAL SCIENCES JOURNAL-JOURNAL DES SCIENCES HYDROLOGIQUES, 2009, 54 (06): : 1141 - 1156

← 1 2 3 4 5 →