Tight First- and Second-Order Regret Bounds for Adversarial Linear Bandits

被引:0
|
作者
Ito, Shinji [1 ]
Hirahara, Shuichi [2 ]
Soma, Tasuku [3 ]
Yoshida, Yuichi [2 ]
机构
[1] NEC Corp Ltd, Tokyo, Japan
[2] Natl Inst Informat, Tokyo, Japan
[3] Univ Tokyo, Tokyo, Japan
关键词
ALGORITHMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose novel algorithms with first- and second-order regret bounds for adversarial linear bandits. These regret bounds imply that our algorithms perform well when there is an action achieving a small cumulative loss or the loss has a small variance. In addition, we need only assumptions weaker than those of existing algorithms; our algorithms work on discrete action sets as well as continuous ones without a priori knowledge about losses, and they run efficiently if a linear optimization oracle for the action set is available. These results are obtained by combining optimistic online optimization, continuous multiplicative weight update methods, and a novel technique that we refer to as distribution truncation. We also show that the regret bounds of our algorithms are tight up to polylogarithmic factors.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] First- and Second-Order Bounds for Adversarial Linear Contextual Bandits
    Olkhovskaya, Julia
    Mayo, Jack
    van Erven, Tim
    Neu, Gergely
    Wei, Chen-Yu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [2] On first- and second-order conditions for error bounds
    Huang, LR
    Ng, KF
    SIAM JOURNAL ON OPTIMIZATION, 2004, 14 (04) : 1057 - 1073
  • [3] Hybrid Regret Bounds for Combinatorial Semi-Bandits and Adversarial Linear Bandits
    Ito, Shinji
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [4] Tight Regret Bounds for Infinite-armed Linear Contextual Bandits
    Li, Yingkai
    Wang, Yining
    Chen, Xi
    Zhou, Yuan
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130 : 370 - 378
  • [5] Gaslighting, First- and Second-Order
    Catapang Podosky, Paul-Mikhail
    HYPATIA-A JOURNAL OF FEMINIST PHILOSOPHY, 2021, 36 (01): : 207 - 227
  • [6] Collaborative Linear Bandits with Adversarial Agents: Near-Optimal Regret Bounds
    Mitra, Aritra
    Adibi, Arman
    Pappas, George J.
    Hassani, Hamed
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [7] FIRST- AND SECOND-ORDER CORRECTIONS TO MAGNETIC MOMENT FOR LINEAR MULTIPOLES
    HOWARD, JE
    PHYSICS OF FLUIDS, 1968, 11 (07) : 1569 - &
  • [8] Integration of first- and second-order orientation
    Allen, HA
    Hess, RF
    Mansouri, B
    Dakin, SC
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2003, 20 (06) : 974 - 986
  • [9] First- and second-order perturbations of hypersurfaces
    Mars, M
    CLASSICAL AND QUANTUM GRAVITY, 2005, 22 (16) : 3325 - 3347
  • [10] First- and second-order Poisson spots
    Kelly, William R.
    Shirley, Eric L.
    Migdall, Alan L.
    Polyakov, Sergey V.
    Hendrix, Kurt
    AMERICAN JOURNAL OF PHYSICS, 2009, 77 (08) : 713 - 720