Cascading Contextual Assortment Bandits

被引:0
|
作者
Choi, Hyun-jun [1 ]
Udwani, Rajan [2 ]
Oh, Min-hwan [1 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] Univ Calif Berkeley, Berkeley, CA USA
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a new combinatorial bandit model, the cascading contextual assortment bandit. This model serves as a generalization of both existing cascading bandits and assortment bandits, broadening their applicability in practice. For this model, we propose our first UCB bandit algorithm, UCB-CCA. We prove that this algorithm achieves a T-step regret upper-bound of (O) over tilde (1/kappa d root T), sharper than existing bounds for cascading contextual bandits by eliminating dependence on cascade length K. To improve the dependence on problem-dependent constant., we introduce our second algorithm, UCB-CCA+, which leverages a new Bernstein-type concentration result. This algorithm achieves (O) over tilde (d root T) without dependence on kappa in the leading term. We substantiate our theoretical claims with numerical experiments, demonstrating the practical efficacy of our proposed methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [11] Thompson Sampling Algorithms for Cascading Bandits
    Zhong, Zixin
    Chueng, Wang Chi
    Tan, Vincent Y. F.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
  • [12] Cost-Aware Cascading Bandits
    Gan, Chao
    Zhou, Ruida
    Yang, Jing
    Shen, Cong
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 : 3692 - 3706
  • [13] Model selection for contextual bandits
    Foster, Dylan J.
    Krishnamurthy, Akshay
    Luo, Haipeng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [14] Conservative Contextual Linear Bandits
    Kazerouni, Abbas
    Ghavamzadeh, Mohammad
    Abbasi-Yadkori, Yasin
    Van Roy, Benjamin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [15] Contextual bandits with similarity information
    Slivkins, A. (slivkins@microsoft.com), 1600, Microtome Publishing (15):
  • [16] Contextual Bandits with Similarity Information
    Slivkins, Aleksandrs
    JOURNAL OF MACHINE LEARNING RESEARCH, 2014, 15 : 2533 - 2568
  • [17] Expected Improvement for Contextual Bandits
    Hung Tran-The
    Gupta, Sunil
    Sana, Santu
    Tuan Truong
    Tran-Thanh, Long
    Venkatesh, Svetha
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [18] Nonparametric Stochastic Contextual Bandits
    Guan, Melody Y.
    Jiang, Heinrich
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3119 - 3125
  • [19] Balanced Linear Contextual Bandits
    Dimakopoulou, Maria
    Zhou, Zhengyuan
    Athey, Susan
    Imbens, Guido
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3445 - 3453
  • [20] Linear Contextual Bandits with Knapsacks
    Agrawal, Shipra
    Devanur, Nikhil R.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29