Cascading Contextual Assortment Bandits

被引:0
|
作者
Choi, Hyun-jun [1 ]
Udwani, Rajan [2 ]
Oh, Min-hwan [1 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] Univ Calif Berkeley, Berkeley, CA USA
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a new combinatorial bandit model, the cascading contextual assortment bandit. This model serves as a generalization of both existing cascading bandits and assortment bandits, broadening their applicability in practice. For this model, we propose our first UCB bandit algorithm, UCB-CCA. We prove that this algorithm achieves a T-step regret upper-bound of (O) over tilde (1/kappa d root T), sharper than existing bounds for cascading contextual bandits by eliminating dependence on cascade length K. To improve the dependence on problem-dependent constant., we introduce our second algorithm, UCB-CCA+, which leverages a new Bernstein-type concentration result. This algorithm achieves (O) over tilde (d root T) without dependence on kappa in the leading term. We substantiate our theoretical claims with numerical experiments, demonstrating the practical efficacy of our proposed methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Smoothness-Adaptive Contextual Bandits
    Gur, Yonatan
    Momeni, Ahmadreza
    Wager, Stefan
    OPERATIONS RESEARCH, 2022, 70 (06) : 3198 - 3216
  • [42] Differentially Private Contextual Linear Bandits
    Shariff, Roshan
    Sheffet, Or
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [43] Model Selection for Generic Contextual Bandits
    Ghosh, Avishek
    Sankararaman, Abishek
    Ramchandran, Kannan
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2024, 70 (01) : 656 - 675
  • [44] AdaLinUCB: Opportunistic Learning for Contextual Bandits
    Guo, Xueying
    Wang, Xiaoxiao
    Liu, Xin
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2420 - 2427
  • [45] Contextual Bandits with Online Neural Regression
    Deb, Rohan
    Ban, Yikun
    Zuo, Shiliang
    He, Jingrui
    Banerjee, Arindam
    arXiv, 2023,
  • [46] Transferable Contextual Bandits with Prior Observations
    Labille, Kevin
    Huang, Wen
    Wu, Xintao
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT II, 2021, 12713 : 398 - 410
  • [47] Exploratory Search of GANs with Contextual Bandits
    Kropotov, Ivan
    Medlar, Alan
    Glowacka, Dorota
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 3157 - 3161
  • [48] Langevin Monte Carlo for Contextual Bandits
    Xu, Pan
    Zheng, Hongkai
    Mazumdar, Eric
    Azizzadenesheli, Kamyar
    Anandkumar, Anima
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [49] Data Poisoning Attacks in Contextual Bandits
    Ma, Yuzhe
    Jun, Kwang-Sung
    Li, Lihong
    Zhu, Xiaojin
    DECISION AND GAME THEORY FOR SECURITY, GAMESEC 2018, 2018, 11199 : 186 - 204
  • [50] Contextual Bandits With Cross-Learning
    Balseiro, Santiago
    Golrezaei, Negin
    Mahdian, Mohammad
    Mirrokni, Vahab
    Schneider, Jon
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32