Efficient Kernel UCB for Contextual Bandits

被引:0
|
作者
Zenati, Houssam [1 ,2 ]
Bietti, Alberto [3 ]
Diemert, Eustache [1 ]
Mairal, Julien [2 ]
Martin, Matthieu [1 ]
Gaillard, Pierre [2 ]
机构
[1] Criteo AI Lab, Ann Arbor, MI 48104 USA
[2] INRIA, Grenoble, France
[3] NYU, Ctr Data Sci, New York, NY 10003 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we tackle the computational efficiency of kernelized UCB algorithms in contextual bandits. While standard methods require a O(CT3) complexity where T is the horizon and the constant C is related to optimizing the UCB rule, we propose an efficient contextual algorithm for large-scale problems. Specifically, our method relies on incremental Nystrom approximations of the joint kernel embedding of contexts and actions. This allows us to achieve a complexity of O(CTm2) where m is the number of Nystrom points. To recover the same regret as the standard kernelized UCB algorithm, m needs to be of order of the effective dimension of the problem, which is at most O(root T) and nearly constant in some cases.
引用
收藏
页码:5689 / 5720
页数:32
相关论文
共 50 条
  • [31] Linear Contextual Bandits with Knapsacks
    Agrawal, Shipra
    Devanur, Nikhil R.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [32] Contextual Bandits in A Collaborative Environment
    Wu, Qingyun
    Wang, Huazheng
    Gu, Quanquan
    Wang, Hongning
    SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 529 - 538
  • [33] Action Centered Contextual Bandits
    Greenewald, Kristjan
    Tewari, Ambuj
    Klasnja, Predrag
    Murphy, Susan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [34] Federated Linear Contextual Bandits
    Huang, Ruiquan
    Wu, Weiqiang
    Yang, Jing
    Shen, Cong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [35] Cascading Contextual Assortment Bandits
    Choi, Hyun-jun
    Udwani, Rajan
    Oh, Min-hwan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [36] Contextual Combinatorial Cascading Bandits
    Li, Shuai
    Wang, Baoxiang
    Zhang, Shengyu
    Chen, Wei
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [37] Contextual Bandits with Stochastic Experts
    Sen, Rajat
    Shanmugam, Karthikeyan
    Shakkottai, Sanjay
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
  • [38] Adapting to Misspecification in Contextual Bandits
    Foster, Dylan J.
    Gentile, Claudio
    Mohri, Mehryar
    Zimmert, Julian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS (NEURIPS 2020), 2020, 33
  • [39] Efficient Client Selection Based on Contextual Combinatorial Multi-Arm Bandits
    Shi, Fang
    Lin, Weiwei
    Fan, Lisheng
    Lai, Xiazhi
    Wang, Xiumin
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (08) : 5265 - 5277
  • [40] Efficient First-Order Contextual Bandits: Prediction, Allocation, and Triangular Discrimination
    Foster, Dylan J.
    Krishnamurthy, Akshay
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34