Tight First- and Second-Order Regret Bounds for Adversarial Linear Bandits

被引:0
|
作者
Ito, Shinji [1 ]
Hirahara, Shuichi [2 ]
Soma, Tasuku [3 ]
Yoshida, Yuichi [2 ]
机构
[1] NEC Corp Ltd, Tokyo, Japan
[2] Natl Inst Informat, Tokyo, Japan
[3] Univ Tokyo, Tokyo, Japan
关键词
ALGORITHMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose novel algorithms with first- and second-order regret bounds for adversarial linear bandits. These regret bounds imply that our algorithms perform well when there is an action achieving a small cumulative loss or the loss has a small variance. In addition, we need only assumptions weaker than those of existing algorithms; our algorithms work on discrete action sets as well as continuous ones without a priori knowledge about losses, and they run efficiently if a linear optimization oracle for the action set is available. These results are obtained by combining optimistic online optimization, continuous multiplicative weight update methods, and a novel technique that we refer to as distribution truncation. We also show that the regret bounds of our algorithms are tight up to polylogarithmic factors.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] First- and second-order methods for semidefinite programming
    Monteiro, RDC
    MATHEMATICAL PROGRAMMING, 2003, 97 (1-2) : 209 - 244
  • [22] First- and Second-Order Logic of Mass Terms
    Peter Roeper
    Journal of Philosophical Logic, 2004, 33 : 261 - 297
  • [23] First- and second-order Greeks in the Heston model
    Chan, Jiun Hong
    Joshi, Mark
    Zhu, Dan
    JOURNAL OF RISK, 2015, 17 (04): : 19 - 69
  • [24] First- and second-order dynamic equations with impulse
    Atici, F. M.
    Biles, D. C.
    ADVANCES IN DIFFERENCE EQUATIONS, 2005, 2005 (02) : 119 - 132
  • [25] INSURANCE WITH BORROWING: FIRST- AND SECOND-ORDER APPROXIMATIONS
    Borovkov, A. A.
    ADVANCES IN APPLIED PROBABILITY, 2009, 41 (04) : 1141 - 1160
  • [26] Microkinetics of the first- and second-order phase transitions
    Stepanov, VA
    PHASE TRANSITIONS, 2005, 78 (7-8) : 607 - 619
  • [27] CONCOMITANT FIRST- AND SECOND-ORDER NUCLEOPHILIC SUBSTITUTION
    CASAPIERI, P
    SWART, ER
    JOURNAL OF THE CHEMICAL SOCIETY, 1961, (OCT): : 4342 - &
  • [28] Testing first- and second-order stochastic dominance
    Xu, K
    Fisher, G
    Willson, D
    CANADIAN JOURNAL OF ECONOMICS-REVUE CANADIENNE D ECONOMIQUE, 1996, 29 : S562 - S564
  • [29] First- and second-order processing in transient stereopsis
    Edwards, M
    Pope, DR
    Schor, CM
    VISION RESEARCH, 2000, 40 (19) : 2645 - 2651
  • [30] Survey measures of first- and second-order competences
    Danneels, Erwin
    STRATEGIC MANAGEMENT JOURNAL, 2016, 37 (10) : 2174 - 2188