Kernel-Based Methods for Bandit Convex Optimization

被引:67
|
作者
Bubeck, Sebastien [1 ]
Lee, Yin Tat [1 ]
Eldan, Ronen [2 ]
机构
[1] Microsoft Res, Cambridge, MA 02142 USA
[2] Weizmann Inst Sci, Rehovot, Israel
关键词
multi-armed bandit; online learning; convex optimization;
D O I
10.1145/3055399.3055403
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We consider the adversarial convex bandit problem and we build the first poly(T)-time algorithm with poly(n)root T-regret for this problem. To do so we introduce three new ideas in the derivative-free optimization literature: (i) kernel methods, (ii) a generalization of Bernoulli convolutions, and (iii) a new annealing schedule for exponential weights (with increasing learning rate). The basic version of our algorithm achieves (O) over tilde (n(9.5)root T)-regret, and we show that a simple variant of this algorithm can be run in poly(n log(T))-time per step at the cost of an additional poly(n)T degrees((1)) factor in the regret. These results improve upon the (O) over tilde (n(11)root T)-regret and exp(poly(T))-time result of the first two authors, and the log(T)(poly(n))root T-regret and log(T)(poly(n))-time result of Hazan and Li. Furthermore we conjecture that another variant of the algorithm could achieve (O) over tilde (n(1.5)root T)-regret, and moreover that this regret is unimprovable (the current best lower bound being Omega(n root T)and it is achieved with linear functions). For the simpler situation of zeroth order stochastic convex optimization this corresponds to the conjecture that the optimal query complexity is of order n(3)/epsilon(2).
引用
收藏
页码:72 / 85
页数:14
相关论文
共 50 条
  • [41] A Kernel-Based Indicator for Multi/Many-Objective Optimization
    Cai, Xinye
    Xiao, Yushun
    Li, Zhenhua
    Sun, Qi
    Xu, Hanchuan
    Li, Miqing
    Ishibuchi, Hisao
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2022, 26 (04) : 602 - 615
  • [42] Bayesian Optimization of Hyperparameters in Kernel-Based Delay Rational Models
    Treviso, Felipe
    Trinchero, Riccardo
    Canavero, Flavio G.
    [J]. SPI 2021: 25TH IEEE WORKSHOP ON SIGNAL AND POWER INTEGRITY, 2021,
  • [43] An experimental evaluation of linear and kernel-based methods for face recognition
    Gupta, H
    Agrawal, AK
    Pruthi, T
    Shekhar, C
    Chellappa, R
    [J]. SIXTH IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION, PROCEEDINGS, 2002, : 13 - 18
  • [44] Input space versus feature space in kernel-based methods
    Schölkopf, B
    Mika, S
    Burges, CJC
    Knirsch, P
    Müller, KR
    Rätsch, G
    Smola, AJ
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (05): : 1000 - 1017
  • [45] Kernel-Based Approaches for Sequence Modeling: Connections to Neural Methods
    Liang, Kevin J.
    Wang, Guoyin
    Li, Yitong
    Henao, Ricardo
    Carin, Lawrence
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [46] Classifying 'drug-likeness' with kernel-based learning methods
    Müller, KR
    Rätsch, G
    Sonnenburg, S
    Mika, S
    Grimm, M
    Heinrich, N
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2005, 45 (02) : 249 - 253
  • [47] Kernel-Based Aggregating Learning System for Online Portfolio Optimization
    Wang, Xin
    Sun, Tao
    Liu, Zhi
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [48] Robust EM kernel-based methods for linear system identification
    Bottegal, Giulio
    Aravkin, Aleksandr Y.
    Hjalmarsson, Hakan
    Pillonetto, Gianluigi
    [J]. AUTOMATICA, 2016, 67 : 114 - 126
  • [49] Kernel-based learning methods for stochastic partial differential equations
    Ye, Qi
    [J]. ENGINEERING ANALYSIS WITH BOUNDARY ELEMENTS, 2024, 169
  • [50] On asymptotic properties of hyperparameter estimators for kernel-based regularization methods
    Mu, Biqiang
    Chen, Tianshi
    Ljung, Lennart
    [J]. AUTOMATICA, 2018, 94 : 381 - 395