Cascading Contextual Assortment Bandits

被引：0

作者：

Choi, Hyun-jun ^{[1
]}

Udwani, Rajan ^{[2
]}

Oh, Min-hwan ^{[1
]}

机构：

[1] Seoul Natl Univ, Seoul, South Korea

[2] Univ Calif Berkeley, Berkeley, CA USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

基金：

新加坡国家研究基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a new combinatorial bandit model, the cascading contextual assortment bandit. This model serves as a generalization of both existing cascading bandits and assortment bandits, broadening their applicability in practice. For this model, we propose our first UCB bandit algorithm, UCB-CCA. We prove that this algorithm achieves a T-step regret upper-bound of (O) over tilde (1/kappa d root T), sharper than existing bounds for cascading contextual bandits by eliminating dependence on cascade length K. To improve the dependence on problem-dependent constant., we introduce our second algorithm, UCB-CCA+, which leverages a new Bernstein-type concentration result. This algorithm achieves (O) over tilde (d root T) without dependence on kappa in the leading term. We substantiate our theoretical claims with numerical experiments, demonstrating the practical efficacy of our proposed methods.

引用

页数：12

共 50 条

[41] Smoothness-Adaptive Contextual Bandits
Gur, Yonatan
Momeni, Ahmadreza
Wager, Stefan
OPERATIONS RESEARCH, 2022, 70 (06) : 3198 - 3216
[42] Differentially Private Contextual Linear Bandits
Shariff, Roshan
Sheffet, Or
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[43] Model Selection for Generic Contextual Bandits
Ghosh, Avishek
Sankararaman, Abishek
Ramchandran, Kannan
IEEE TRANSACTIONS ON INFORMATION THEORY, 2024, 70 (01) : 656 - 675
[44] AdaLinUCB: Opportunistic Learning for Contextual Bandits
Guo, Xueying
Wang, Xiaoxiao
Liu, Xin
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2420 - 2427
[45] Contextual Bandits with Online Neural Regression
Deb, Rohan
Ban, Yikun
Zuo, Shiliang
He, Jingrui
Banerjee, Arindam
arXiv, 2023,
[46] Transferable Contextual Bandits with Prior Observations
Labille, Kevin
Huang, Wen
Wu, Xintao
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT II, 2021, 12713 : 398 - 410
[47] Exploratory Search of GANs with Contextual Bandits
Kropotov, Ivan
Medlar, Alan
Glowacka, Dorota
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 3157 - 3161
[48] Langevin Monte Carlo for Contextual Bandits
Xu, Pan
Zheng, Hongkai
Mazumdar, Eric
Azizzadenesheli, Kamyar
Anandkumar, Anima
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[49] Data Poisoning Attacks in Contextual Bandits
Ma, Yuzhe
Jun, Kwang-Sung
Li, Lihong
Zhu, Xiaojin
DECISION AND GAME THEORY FOR SECURITY, GAMESEC 2018, 2018, 11199 : 186 - 204
[50] Contextual Bandits With Cross-Learning
Balseiro, Santiago
Golrezaei, Negin
Mahdian, Mohammad
Mirrokni, Vahab
Schneider, Jon
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32

← 1 2 3 4 5 →