Kernel-Based Methods for Bandit Convex Optimization

被引：67

作者：

Bubeck, Sebastien ^{[1
]}

Lee, Yin Tat ^{[1
]}

Eldan, Ronen ^{[2
]}

机构：

[1] Microsoft Res, Cambridge, MA 02142 USA

[2] Weizmann Inst Sci, Rehovot, Israel

来源：

STOC'17: PROCEEDINGS OF THE 49TH ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING | 2017年

关键词：

multi-armed bandit; online learning; convex optimization;

D O I：

10.1145/3055399.3055403

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

We consider the adversarial convex bandit problem and we build the first poly(T)-time algorithm with poly(n)root T-regret for this problem. To do so we introduce three new ideas in the derivative-free optimization literature: (i) kernel methods, (ii) a generalization of Bernoulli convolutions, and (iii) a new annealing schedule for exponential weights (with increasing learning rate). The basic version of our algorithm achieves (O) over tilde (n(9.5)root T)-regret, and we show that a simple variant of this algorithm can be run in poly(n log(T))-time per step at the cost of an additional poly(n)T degrees((1)) factor in the regret. These results improve upon the (O) over tilde (n(11)root T)-regret and exp(poly(T))-time result of the first two authors, and the log(T)(poly(n))root T-regret and log(T)(poly(n))-time result of Hazan and Li. Furthermore we conjecture that another variant of the algorithm could achieve (O) over tilde (n(1.5)root T)-regret, and moreover that this regret is unimprovable (the current best lower bound being Omega(n root T)and it is achieved with linear functions). For the simpler situation of zeroth order stochastic convex optimization this corresponds to the conjecture that the optimal query complexity is of order n(3)/epsilon(2).

引用

页码：72 / 85

页数：14

共 50 条

[41] A Kernel-Based Indicator for Multi/Many-Objective Optimization
Cai, Xinye
Xiao, Yushun
Li, Zhenhua
Sun, Qi
Xu, Hanchuan
Li, Miqing
Ishibuchi, Hisao
[J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2022, 26 (04) : 602 - 615
[42] Bayesian Optimization of Hyperparameters in Kernel-Based Delay Rational Models
Treviso, Felipe
Trinchero, Riccardo
Canavero, Flavio G.
[J]. SPI 2021: 25TH IEEE WORKSHOP ON SIGNAL AND POWER INTEGRITY, 2021,
[43] An experimental evaluation of linear and kernel-based methods for face recognition
Gupta, H
Agrawal, AK
Pruthi, T
Shekhar, C
Chellappa, R
[J]. SIXTH IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION, PROCEEDINGS, 2002, : 13 - 18
[44] Input space versus feature space in kernel-based methods
Schölkopf, B
Mika, S
Burges, CJC
Knirsch, P
Müller, KR
Rätsch, G
Smola, AJ
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (05): : 1000 - 1017
[45] Kernel-Based Approaches for Sequence Modeling: Connections to Neural Methods
Liang, Kevin J.
Wang, Guoyin
Li, Yitong
Henao, Ricardo
Carin, Lawrence
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[46] Classifying 'drug-likeness' with kernel-based learning methods
Müller, KR
Rätsch, G
Sonnenburg, S
Mika, S
Grimm, M
Heinrich, N
[J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2005, 45 (02) : 249 - 253
[47] Kernel-Based Aggregating Learning System for Online Portfolio Optimization
Wang, Xin
Sun, Tao
Liu, Zhi
[J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
[48] Robust EM kernel-based methods for linear system identification
Bottegal, Giulio
Aravkin, Aleksandr Y.
Hjalmarsson, Hakan
Pillonetto, Gianluigi
[J]. AUTOMATICA, 2016, 67 : 114 - 126
[49] Kernel-based learning methods for stochastic partial differential equations
Ye, Qi
[J]. ENGINEERING ANALYSIS WITH BOUNDARY ELEMENTS, 2024, 169
[50] On asymptotic properties of hyperparameter estimators for kernel-based regularization methods
Mu, Biqiang
Chen, Tianshi
Ljung, Lennart
[J]. AUTOMATICA, 2018, 94 : 381 - 395

← 1 2 3 4 5 →