Efficient Kernel UCB for Contextual Bandits

被引：0

作者：

Zenati, Houssam ^{[1
,2
]}

Bietti, Alberto ^{[3
]}

Diemert, Eustache ^{[1
]}

Mairal, Julien ^{[2
]}

Martin, Matthieu ^{[1
]}

Gaillard, Pierre ^{[2
]}

机构：

[1] Criteo AI Lab, Ann Arbor, MI 48104 USA

[2] INRIA, Grenoble, France

[3] NYU, Ctr Data Sci, New York, NY 10003 USA

来源：

INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151 | 2022年 / 151卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we tackle the computational efficiency of kernelized UCB algorithms in contextual bandits. While standard methods require a O(CT3) complexity where T is the horizon and the constant C is related to optimizing the UCB rule, we propose an efficient contextual algorithm for large-scale problems. Specifically, our method relies on incremental Nystrom approximations of the joint kernel embedding of contexts and actions. This allows us to achieve a complexity of O(CTm2) where m is the number of Nystrom points. To recover the same regret as the standard kernelized UCB algorithm, m needs to be of order of the effective dimension of the problem, which is at most O(root T) and nearly constant in some cases.

引用

页码：5689 / 5720

页数：32

共 50 条

[1] Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles
Foster, Dylan J.
Rakhlin, Alexander
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
[2] Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles
Foster, Dylan J.
Rakhlin, Alexander
25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
[3] Interconnected Neural Linear Contextual Bandits with UCB Exploration
Chen, Yang
Xie, Miao
Liu, Jiamou
Zhao, Kaiqi
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT I, 2022, 13280 : 169 - 181
[4] An Efficient Algorithm for Deep Stochastic Contextual Bandits
Zhu, Tan
Liang, Guannan
Zhu, Chunjiang
Li, Haining
Bi, Jinbo
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11193 - 11201
[5] Kernel Methods for Cooperative Multi-Agent Contextual Bandits
Dubey, Abhimanyu
Pentland, Alex
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
[6] Kernel Methods for Cooperative Multi-Agent Contextual Bandits
Dubey, Abhimanyu
Pentland, Alex
25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
[7] Thresholding Bandits with Augmented UCB
Mukherjee, Subhojyoti
Purushothama, Naveen Kolar
Sudarsanam, Nandan
Ravindran, Balaraman
PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2515 - 2521
[8] Communication Efficient Distributed Learning for Kernelized Contextual Bandits
Li, Chuanhao
Wang, Huazheng
Wang, Mengdi
Wang, Hongning
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[9] Contextual bandits with surrogate losses: Margin bounds and efficient algorithms
Foster, Dylan J.
Krishnamurthy, Akshay
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[10] Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability
Saha, Aadirupa
Krishnamurthy, Akshay
INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167, 2022, 167

← 1 2 3 4 5 →