Cascading Contextual Assortment Bandits

被引:0
|
作者
Choi, Hyun-jun [1 ]
Udwani, Rajan [2 ]
Oh, Min-hwan [1 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] Univ Calif Berkeley, Berkeley, CA USA
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a new combinatorial bandit model, the cascading contextual assortment bandit. This model serves as a generalization of both existing cascading bandits and assortment bandits, broadening their applicability in practice. For this model, we propose our first UCB bandit algorithm, UCB-CCA. We prove that this algorithm achieves a T-step regret upper-bound of (O) over tilde (1/kappa d root T), sharper than existing bounds for cascading contextual bandits by eliminating dependence on cascade length K. To improve the dependence on problem-dependent constant., we introduce our second algorithm, UCB-CCA+, which leverages a new Bernstein-type concentration result. This algorithm achieves (O) over tilde (d root T) without dependence on kappa in the leading term. We substantiate our theoretical claims with numerical experiments, demonstrating the practical efficacy of our proposed methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Practical Contextual Bandits with Regression Oracles
    Foster, Dylan J.
    Agarwal, Alekh
    Dudik, Miroslav
    Luo, Haipeng
    Schapire, Robert E.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [32] Robust Contextual Bandits via Bootstrapping
    Tang, Qiao
    Xie, Hong
    Xia, Yunni
    Lee, Jia
    Zhu, Qingsheng
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 12182 - 12189
  • [33] Adversarial Attacks on Linear Contextual Bandits
    Garcelon, Evrard
    Roziere, Baptiste
    Meunier, Laurent
    Tarbouriech, Jean
    Teytaud, Olivier
    Lazaric, Alessandro
    Pirotta, Matteo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS (NEURIPS 2020), 2020, 33
  • [34] Safe Exploration for Optimizing Contextual Bandits
    Jagerman, Rolf
    Markov, Ilya
    De Rijke, Maarten
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2020, 38 (03)
  • [35] Adversarial Contextual Bandits Go Kernelized
    Neu, Gergely
    Olkhovskaya, Julia
    Vakili, Sattar
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 237, 2024, 237
  • [36] Practical Contextual Bandits with Feedback Graphs
    Zhang, Mengxiao
    Zhang, Yuheng
    Vrousgou, Olga
    Luo, Haipeng
    Mineiro, Paul
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [37] Meritocratic Fairness for Infinite and Contextual Bandits
    Joseph, Matthew
    Kearns, Michael
    Morgenstern, Jamie
    Neel, Seth
    Roth, Aaron
    PROCEEDINGS OF THE 2018 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY (AIES'18), 2018, : 158 - 163
  • [38] Adaptive metamorphic testing with contextual bandits
    Spieker, Helge
    Gotlieb, Arnaud
    JOURNAL OF SYSTEMS AND SOFTWARE, 2020, 165
  • [39] Efficient Kernel UCB for Contextual Bandits
    Zenati, Houssam
    Bietti, Alberto
    Diemert, Eustache
    Mairal, Julien
    Martin, Matthieu
    Gaillard, Pierre
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151 : 5689 - 5720
  • [40] Shuffle Private Linear Contextual Bandits
    Chowdhury, Sayak Ray
    Zhou, Xingyu
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,