Efficient Kernel UCB for Contextual Bandits

被引:0
|
作者
Zenati, Houssam [1 ,2 ]
Bietti, Alberto [3 ]
Diemert, Eustache [1 ]
Mairal, Julien [2 ]
Martin, Matthieu [1 ]
Gaillard, Pierre [2 ]
机构
[1] Criteo AI Lab, Ann Arbor, MI 48104 USA
[2] INRIA, Grenoble, France
[3] NYU, Ctr Data Sci, New York, NY 10003 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we tackle the computational efficiency of kernelized UCB algorithms in contextual bandits. While standard methods require a O(CT3) complexity where T is the horizon and the constant C is related to optimizing the UCB rule, we propose an efficient contextual algorithm for large-scale problems. Specifically, our method relies on incremental Nystrom approximations of the joint kernel embedding of contexts and actions. This allows us to achieve a complexity of O(CTm2) where m is the number of Nystrom points. To recover the same regret as the standard kernelized UCB algorithm, m needs to be of order of the effective dimension of the problem, which is at most O(root T) and nearly constant in some cases.
引用
收藏
页码:5689 / 5720
页数:32
相关论文
共 50 条
  • [41] Efficient Explorative Key-Term Selection Strategies for Conversational Contextual Bandits
    Wang, Zhiyong
    Liu, Xutong
    Li, Shuai
    Lui, John C. S.
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 10288 - 10295
  • [42] Cornering Stationary and Restless Mixing Bandits with Remix-UCB
    Audiffren, Julien
    Ralaivola, Liva
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [43] UCB-based Algorithms for Multinomial Logistic Regression Bandits
    Amani, Sanae
    Thrampoulidis, Christos
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [44] An Environmentally Sensitive Jamming Bandits Using Improved UCB Method
    Zheng, Yuzhuo
    Wang, Jun
    Mao, Shaoqing
    Han, Dongmei
    PROCEEDINGS OF 2020 IEEE 15TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2020), 2020, : 295 - 299
  • [45] Contextual Bandits with Cross-Learning
    Balseiro, Santiago
    Golrezaei, Negin
    Mahdian, Mohammad
    Mirrokni, Vahab
    Schneider, Jon
    MATHEMATICS OF OPERATIONS RESEARCH, 2023, 48 (03) : 1607 - 1629
  • [46] Signal detection models as contextual bandits
    Sherratt, Thomas N.
    O'Neill, Erica
    ROYAL SOCIETY OPEN SCIENCE, 2023, 10 (06):
  • [47] Neural Contextual Bandits without Regret
    Kassraie, Parnian
    Krause, Andreas
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151 : 240 - 278
  • [48] Stochastic Conservative Contextual Linear Bandits
    Lin, Jiabin
    Lee, Xian Yeow
    Jubery, Talukder
    Moothedath, Shana
    Sarkar, Soumik
    Ganapathysubramanian, Baskar
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 7321 - 7326
  • [49] Practical Contextual Bandits with Regression Oracles
    Foster, Dylan J.
    Agarwal, Alekh
    Dudik, Miroslav
    Luo, Haipeng
    Schapire, Robert E.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [50] Robust Contextual Bandits via Bootstrapping
    Tang, Qiao
    Xie, Hong
    Xia, Yunni
    Lee, Jia
    Zhu, Qingsheng
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 12182 - 12189