Cascading Contextual Assortment Bandits

被引:0
|
作者
Choi, Hyun-jun [1 ]
Udwani, Rajan [2 ]
Oh, Min-hwan [1 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] Univ Calif Berkeley, Berkeley, CA USA
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a new combinatorial bandit model, the cascading contextual assortment bandit. This model serves as a generalization of both existing cascading bandits and assortment bandits, broadening their applicability in practice. For this model, we propose our first UCB bandit algorithm, UCB-CCA. We prove that this algorithm achieves a T-step regret upper-bound of (O) over tilde (1/kappa d root T), sharper than existing bounds for cascading contextual bandits by eliminating dependence on cascade length K. To improve the dependence on problem-dependent constant., we introduce our second algorithm, UCB-CCA+, which leverages a new Bernstein-type concentration result. This algorithm achieves (O) over tilde (d root T) without dependence on kappa in the leading term. We substantiate our theoretical claims with numerical experiments, demonstrating the practical efficacy of our proposed methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Contextual Combinatorial Cascading Bandits
    Li, Shuai
    Wang, Baoxiang
    Zhang, Shengyu
    Chen, Wei
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [2] Online Clustering of Contextual Cascading Bandits
    Li, Shuai
    Zhang, Shengyu
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3554 - 3561
  • [3] Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users
    Yang, Hantao
    Liu, Xutong
    Wang, Zhiyong
    Xie, Hong
    Lui, John C. S.
    Lian, Defu
    Chen, Enhong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 18, 2024, : 20596 - 20603
  • [4] Combinatorial Cascading Bandits
    Kveton, Branislav
    Wen, Zheng
    Ashkan, Azin
    Szepesvari, Csaba
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [5] Minimax Regret for Cascading Bandits
    Vial, Daniel
    Sanghavi, Sujay
    Shakkottai, Sanjay
    Srikant, R.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [6] Contextual Blocking Bandits
    Basu, Soumya
    Papadigenopoulos, Orestis
    Caramanis, Constantine
    Shakkottai, Sanjay
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130 : 271 - +
  • [7] Regularized Contextual Bandits
    Fontaine, Xavier
    Berthet, Quentin
    Perchet, Vianney
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [8] Semiparametric Contextual Bandits
    Krishnamurthy, Akshay
    Wu, Zhiwei Steven
    Syrgkanis, Vasilis
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [9] A Thompson Sampling Algorithm for Cascading Bandits
    Cheung, Wang Chi
    Tan, Vincent Y. F.
    Zhong, Zixin
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 438 - 447
  • [10] Cost-aware Cascading Bandits
    Zhou, Ruida
    Gan, Chao
    Yang, Jing
    Shen, Cong
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 3228 - 3234