Contextual bandits with surrogate losses: Margin bounds and efficient algorithms

被引：0

作者：

Foster, Dylan J. ^{[1
]}

Krishnamurthy, Akshay ^{[2
]}

机构：

[1] Cornell Univ, Ithaca, NY 14853 USA

[2] Microsoft Res, Nyc, NY USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018) | 2018年 / 31卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We use surrogate losses to obtain several new regret bounds and new algorithms for contextual bandit learning. Using the ramp loss, we derive new margin-based regret bounds in terms of standard sequential complexity measures of a benchmark class of real-valued regression functions. Using the hinge loss, we derive an efficient algorithm with a root dT-type mistake bound against benchmark policies induced by d-dimensional regressors. Under realizability assumptions, our results also yield classical regret bounds.

引用

页数：12

共 50 条

[1] Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability
Saha, Aadirupa
Krishnamurthy, Akshay
INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167, 2022, 167
[2] Efficient Kernel UCB for Contextual Bandits
Zenati, Houssam
Bietti, Alberto
Diemert, Eustache
Mairal, Julien
Martin, Matthieu
Gaillard, Pierre
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151 : 5689 - 5720
[3] Optimal Algorithms for Stochastic Contextual Preference Bandits
Saha, Aadirupa
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[4] Surrogate Regret Bounds for Polyhedral Losses
Frongillo, Rafael
Waggoner, Bo
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
[5] A Reduction from Linear Contextual Bandits Lower Bounds to Estimations Lower Bounds
He, Jiahao
Zhang, Jiheng
Zhang, Rachel Q.
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[6] Efficient Algorithms for Extreme Bandits
Baudry, Dorian
Russac, Yoan
Kaufmann, Emilie
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
[7] An Efficient Algorithm for Deep Stochastic Contextual Bandits
Zhu, Tan
Liang, Guannan
Zhu, Chunjiang
Li, Haining
Bi, Jinbo
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11193 - 11201
[8] Mostly Exploration-Free Algorithms for Contextual Bandits
Bastani, Hamsa
Bayati, Mohsen
Khosravi, Khashayar
MANAGEMENT SCIENCE, 2021, 67 (03) : 1329 - 1349
[9] Instance-optimal PAC Algorithms for Contextual Bandits
Li, Zhaoqi
Ratliff, Lillian
Nassif, Houssam
Jamieson, Kevin
Jain, Lalit
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[10] Generalized Contextual Bandits With Latent Features: Algorithms and Applications
Xu, Xiongxiao
Xie, Hong
Lui, John C. S.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 4763 - 4775

← 1 2 3 4 5 →