Contextual bandits with surrogate losses: Margin bounds and efficient algorithms

被引:0
|
作者
Foster, Dylan J. [1 ]
Krishnamurthy, Akshay [2 ]
机构
[1] Cornell Univ, Ithaca, NY 14853 USA
[2] Microsoft Res, Nyc, NY USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We use surrogate losses to obtain several new regret bounds and new algorithms for contextual bandit learning. Using the ramp loss, we derive new margin-based regret bounds in terms of standard sequential complexity measures of a benchmark class of real-valued regression functions. Using the hinge loss, we derive an efficient algorithm with a root dT-type mistake bound against benchmark policies induced by d-dimensional regressors. Under realizability assumptions, our results also yield classical regret bounds.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability
    Saha, Aadirupa
    Krishnamurthy, Akshay
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167, 2022, 167
  • [2] Efficient Kernel UCB for Contextual Bandits
    Zenati, Houssam
    Bietti, Alberto
    Diemert, Eustache
    Mairal, Julien
    Martin, Matthieu
    Gaillard, Pierre
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151 : 5689 - 5720
  • [3] Optimal Algorithms for Stochastic Contextual Preference Bandits
    Saha, Aadirupa
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [4] Surrogate Regret Bounds for Polyhedral Losses
    Frongillo, Rafael
    Waggoner, Bo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [5] A Reduction from Linear Contextual Bandits Lower Bounds to Estimations Lower Bounds
    He, Jiahao
    Zhang, Jiheng
    Zhang, Rachel Q.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [6] Efficient Algorithms for Extreme Bandits
    Baudry, Dorian
    Russac, Yoan
    Kaufmann, Emilie
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [7] An Efficient Algorithm for Deep Stochastic Contextual Bandits
    Zhu, Tan
    Liang, Guannan
    Zhu, Chunjiang
    Li, Haining
    Bi, Jinbo
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11193 - 11201
  • [8] Mostly Exploration-Free Algorithms for Contextual Bandits
    Bastani, Hamsa
    Bayati, Mohsen
    Khosravi, Khashayar
    MANAGEMENT SCIENCE, 2021, 67 (03) : 1329 - 1349
  • [9] Instance-optimal PAC Algorithms for Contextual Bandits
    Li, Zhaoqi
    Ratliff, Lillian
    Nassif, Houssam
    Jamieson, Kevin
    Jain, Lalit
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [10] Generalized Contextual Bandits With Latent Features: Algorithms and Applications
    Xu, Xiongxiao
    Xie, Hong
    Lui, John C. S.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 4763 - 4775