Regret lower bound and optimal algorithm for high-dimensional contextual linear bandit

被引:2
|
作者
Li, Ke [1 ]
Yang, Yun [1 ]
Narisetty, Naveen N. [1 ]
机构
[1] Univ Illinois, Dept Stat, Champaign, IL 61820 USA
来源
ELECTRONIC JOURNAL OF STATISTICS | 2021年 / 15卷 / 02期
关键词
Contextual linear bandit; high-dimension; minimax regret; sparsity; upper confidence bound; VARIABLE SELECTION;
D O I
10.1214/21-EJS1909
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper, we consider the multi-armed bandit problem with high-dimensional features. First, we prove a minimax lower bound, O((log d)(alpha+1/2) T (1-alpha/2) + logT), for the cumulative regret, in terms of horizon T, dimension d and a margin parameter alpha is an element of [0, 1], which controls the separation between the optimal and the sub-optimal arms. This new lower bound unifies existing regret bound results that have different dependencies on T due to the use of different values of margin parameter a explicitly implied by their assumptions. Second, we propose a simple and computationally efficient algorithm inspired by the general Upper Confidence Bound (UCB) strategy that achieves a regret upper bound matching the lower bound. The proposed algorithm uses a properly centered l(1)-ball as the confidence set in contrast to the commonly used ellipsoid confidence set. In addition, the algorithm does not require any forced sampling step and is thereby adaptive to the practically unknown margin parameter. Simulations and a real data analysis are conducted to compare the proposed method with existing ones in the literature.
引用
收藏
页码:5652 / 5695
页数:44
相关论文
共 50 条
  • [41] Low-Rank Bandit Methods for High-Dimensional Dynamic Pricing
    Mueller, Jonas
    Syrgkanis, Vasilis
    Taddy, Matt
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [42] Bayes Optimal Learning in High-Dimensional Linear Regression With Network Side Information
    Nandy, Sagnik
    Sen, Subhabrata
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2025, 71 (01) : 565 - 591
  • [43] An upper bound on the number of high-dimensional permutations
    Nathan Linial
    Zur Luria
    Combinatorica, 2014, 34 : 471 - 486
  • [44] Practical bound for dimensionality in high-dimensional entanglement
    Romero, Jacquiline
    Padgett, Miles
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2014, 111 (17) : 6122 - 6123
  • [45] An upper bound on the number of high-dimensional permutations
    Linial, Nathan
    Luria, Zur
    COMBINATORICA, 2014, 34 (04) : 471 - 486
  • [46] Disordered high-dimensional optimal control
    Urbani, Pierfrancesco
    JOURNAL OF PHYSICS A-MATHEMATICAL AND THEORETICAL, 2021, 54 (32)
  • [47] Hallucinating optimal high-dimensional subspaces
    Arandjelovic, Ognjen
    PATTERN RECOGNITION, 2014, 47 (08) : 2662 - 2672
  • [48] Spatial bound whale optimization algorithm: an efficient high-dimensional feature selection approach
    Jingwei Too
    Majdi Mafarja
    Seyedali Mirjalili
    Neural Computing and Applications, 2021, 33 : 16229 - 16250
  • [49] Spatial bound whale optimization algorithm: an efficient high-dimensional feature selection approach
    Too, Jingwei
    Mafarja, Majdi
    Mirjalili, Seyedali
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (23): : 16229 - 16250
  • [50] An adaptive algorithm based on high-dimensional function approximation to obtain optimal designs
    Seufert, Philipp
    Schwientek, Jan
    Bortz, Michael
    arXiv, 2021,