Cascading Contextual Assortment Bandits

被引：0

作者：

Choi, Hyun-jun ^{[1
]}

Udwani, Rajan ^{[2
]}

Oh, Min-hwan ^{[1
]}

机构：

[1] Seoul Natl Univ, Seoul, South Korea

[2] Univ Calif Berkeley, Berkeley, CA USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

基金：

新加坡国家研究基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a new combinatorial bandit model, the cascading contextual assortment bandit. This model serves as a generalization of both existing cascading bandits and assortment bandits, broadening their applicability in practice. For this model, we propose our first UCB bandit algorithm, UCB-CCA. We prove that this algorithm achieves a T-step regret upper-bound of (O) over tilde (1/kappa d root T), sharper than existing bounds for cascading contextual bandits by eliminating dependence on cascade length K. To improve the dependence on problem-dependent constant., we introduce our second algorithm, UCB-CCA+, which leverages a new Bernstein-type concentration result. This algorithm achieves (O) over tilde (d root T) without dependence on kappa in the leading term. We substantiate our theoretical claims with numerical experiments, demonstrating the practical efficacy of our proposed methods.

引用

页数：12

共 50 条

[31] Practical Contextual Bandits with Regression Oracles
Foster, Dylan J.
Agarwal, Alekh
Dudik, Miroslav
Luo, Haipeng
Schapire, Robert E.
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[32] Robust Contextual Bandits via Bootstrapping
Tang, Qiao
Xie, Hong
Xia, Yunni
Lee, Jia
Zhu, Qingsheng
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 12182 - 12189
[33] Adversarial Attacks on Linear Contextual Bandits
Garcelon, Evrard
Roziere, Baptiste
Meunier, Laurent
Tarbouriech, Jean
Teytaud, Olivier
Lazaric, Alessandro
Pirotta, Matteo
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS (NEURIPS 2020), 2020, 33
[34] Safe Exploration for Optimizing Contextual Bandits
Jagerman, Rolf
Markov, Ilya
De Rijke, Maarten
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2020, 38 (03)
[35] Adversarial Contextual Bandits Go Kernelized
Neu, Gergely
Olkhovskaya, Julia
Vakili, Sattar
INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 237, 2024, 237
[36] Practical Contextual Bandits with Feedback Graphs
Zhang, Mengxiao
Zhang, Yuheng
Vrousgou, Olga
Luo, Haipeng
Mineiro, Paul
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[37] Meritocratic Fairness for Infinite and Contextual Bandits
Joseph, Matthew
Kearns, Michael
Morgenstern, Jamie
Neel, Seth
Roth, Aaron
PROCEEDINGS OF THE 2018 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY (AIES'18), 2018, : 158 - 163
[38] Adaptive metamorphic testing with contextual bandits
Spieker, Helge
Gotlieb, Arnaud
JOURNAL OF SYSTEMS AND SOFTWARE, 2020, 165
[39] Efficient Kernel UCB for Contextual Bandits
Zenati, Houssam
Bietti, Alberto
Diemert, Eustache
Mairal, Julien
Martin, Matthieu
Gaillard, Pierre
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151 : 5689 - 5720
[40] Shuffle Private Linear Contextual Bandits
Chowdhury, Sayak Ray
Zhou, Xingyu
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,

← 1 2 3 4 5 →