Efficient Kernel UCB for Contextual Bandits

被引:0
|
作者
Zenati, Houssam [1 ,2 ]
Bietti, Alberto [3 ]
Diemert, Eustache [1 ]
Mairal, Julien [2 ]
Martin, Matthieu [1 ]
Gaillard, Pierre [2 ]
机构
[1] Criteo AI Lab, Ann Arbor, MI 48104 USA
[2] INRIA, Grenoble, France
[3] NYU, Ctr Data Sci, New York, NY 10003 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we tackle the computational efficiency of kernelized UCB algorithms in contextual bandits. While standard methods require a O(CT3) complexity where T is the horizon and the constant C is related to optimizing the UCB rule, we propose an efficient contextual algorithm for large-scale problems. Specifically, our method relies on incremental Nystrom approximations of the joint kernel embedding of contexts and actions. This allows us to achieve a complexity of O(CTm2) where m is the number of Nystrom points. To recover the same regret as the standard kernelized UCB algorithm, m needs to be of order of the effective dimension of the problem, which is at most O(root T) and nearly constant in some cases.
引用
收藏
页码:5689 / 5720
页数:32
相关论文
共 50 条
  • [1] Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles
    Foster, Dylan J.
    Rakhlin, Alexander
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [2] Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles
    Foster, Dylan J.
    Rakhlin, Alexander
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [3] Interconnected Neural Linear Contextual Bandits with UCB Exploration
    Chen, Yang
    Xie, Miao
    Liu, Jiamou
    Zhao, Kaiqi
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT I, 2022, 13280 : 169 - 181
  • [4] An Efficient Algorithm for Deep Stochastic Contextual Bandits
    Zhu, Tan
    Liang, Guannan
    Zhu, Chunjiang
    Li, Haining
    Bi, Jinbo
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11193 - 11201
  • [5] Kernel Methods for Cooperative Multi-Agent Contextual Bandits
    Dubey, Abhimanyu
    Pentland, Alex
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [6] Kernel Methods for Cooperative Multi-Agent Contextual Bandits
    Dubey, Abhimanyu
    Pentland, Alex
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [7] Thresholding Bandits with Augmented UCB
    Mukherjee, Subhojyoti
    Purushothama, Naveen Kolar
    Sudarsanam, Nandan
    Ravindran, Balaraman
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2515 - 2521
  • [8] Communication Efficient Distributed Learning for Kernelized Contextual Bandits
    Li, Chuanhao
    Wang, Huazheng
    Wang, Mengdi
    Wang, Hongning
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [9] Contextual bandits with surrogate losses: Margin bounds and efficient algorithms
    Foster, Dylan J.
    Krishnamurthy, Akshay
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [10] Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability
    Saha, Aadirupa
    Krishnamurthy, Akshay
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167, 2022, 167