Cascading Contextual Assortment Bandits

被引：0

作者：

Choi, Hyun-jun ^{[1
]}

Udwani, Rajan ^{[2
]}

Oh, Min-hwan ^{[1
]}

机构：

[1] Seoul Natl Univ, Seoul, South Korea

[2] Univ Calif Berkeley, Berkeley, CA USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

基金：

新加坡国家研究基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a new combinatorial bandit model, the cascading contextual assortment bandit. This model serves as a generalization of both existing cascading bandits and assortment bandits, broadening their applicability in practice. For this model, we propose our first UCB bandit algorithm, UCB-CCA. We prove that this algorithm achieves a T-step regret upper-bound of (O) over tilde (1/kappa d root T), sharper than existing bounds for cascading contextual bandits by eliminating dependence on cascade length K. To improve the dependence on problem-dependent constant., we introduce our second algorithm, UCB-CCA+, which leverages a new Bernstein-type concentration result. This algorithm achieves (O) over tilde (d root T) without dependence on kappa in the leading term. We substantiate our theoretical claims with numerical experiments, demonstrating the practical efficacy of our proposed methods.

引用

页数：12

共 50 条

[11] Thompson Sampling Algorithms for Cascading Bandits
Zhong, Zixin
Chueng, Wang Chi
Tan, Vincent Y. F.
JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
[12] Cost-Aware Cascading Bandits
Gan, Chao
Zhou, Ruida
Yang, Jing
Shen, Cong
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 : 3692 - 3706
[13] Model selection for contextual bandits
Foster, Dylan J.
Krishnamurthy, Akshay
Luo, Haipeng
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[14] Conservative Contextual Linear Bandits
Kazerouni, Abbas
Ghavamzadeh, Mohammad
Abbasi-Yadkori, Yasin
Van Roy, Benjamin
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[15] Contextual bandits with similarity information
Slivkins, A. (slivkins@microsoft.com), 1600, Microtome Publishing (15):
[16] Contextual Bandits with Similarity Information
Slivkins, Aleksandrs
JOURNAL OF MACHINE LEARNING RESEARCH, 2014, 15 : 2533 - 2568
[17] Expected Improvement for Contextual Bandits
Hung Tran-The
Gupta, Sunil
Sana, Santu
Tuan Truong
Tran-Thanh, Long
Venkatesh, Svetha
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[18] Nonparametric Stochastic Contextual Bandits
Guan, Melody Y.
Jiang, Heinrich
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3119 - 3125
[19] Balanced Linear Contextual Bandits
Dimakopoulou, Maria
Zhou, Zhengyuan
Athey, Susan
Imbens, Guido
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3445 - 3453
[20] Linear Contextual Bandits with Knapsacks
Agrawal, Shipra
Devanur, Nikhil R.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29

← 1 2 3 4 5 →