Efficient and Robust High-Dimensional Linear Contextual Bandits

被引:0
|
作者
Chen, Cheng [1 ]
Luo, Luo [2 ]
Zhang, Weinan [1 ]
Yu, Yong [1 ]
Lian, Yijiang [3 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[3] Baidu, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The linear contextual bandits is a sequential decision-making problem where an agent decides among sequential actions given their corresponding contexts. Since large-scale data sets become more and more common, we study the linear contextual bandits in high-dimensional situations. Recent works focus on employing matrix sketching methods to accelerating contextual bandits. However, the matrix approximation error will bring additional terms to the regret bound. In this paper we first propose a novel matrix sketching method which is called Spectral Compensation Frequent Directions (SCFD). Then we propose an efficient approach for contextual bandits by adopting SCFD to approximate the covariance matrices. By maintaining and manipulating sketched matrices, our method only needs O(md) space and O (md) update time in each round, where d is the dimensionality of the data and m is the sketching size. Theoretical analysis reveals that our method has better regret bounds than previous methods in high-dimensional cases. Experimental results demonstrate the effectiveness of our algorithm and verify our theoretical guarantees.
引用
收藏
页码:4259 / 4265
页数:7
相关论文
共 50 条
  • [41] Semiparametric efficient estimation in high-dimensional partial linear regression models
    Fu, Xinyu
    Huang, Mian
    Yao, Weixin
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2024, 51 (03) : 1259 - 1287
  • [42] Robust transfer learning for high-dimensional quantile regression model with linear constraints
    Longjie Cao
    Yunquan Song
    [J]. Applied Intelligence, 2024, 54 : 1263 - 1274
  • [43] Robust PCA for high-dimensional data
    Hubert, M
    Rousseeuw, PJ
    Verboven, S
    [J]. DEVELOPMENTS IN ROBUST STATISTICS, 2003, : 169 - 179
  • [44] On robust regression with high-dimensional predictors
    El Karoui, Noureddine
    Bean, Derek
    Bickel, Peter J.
    Lim, Chinghway
    Yu, Bin
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2013, 110 (36) : 14557 - 14562
  • [45] Robust transfer learning for high-dimensional quantile regression model with linear constraints
    Cao, Longjie
    Song, Yunquan
    [J]. APPLIED INTELLIGENCE, 2024, 54 (02) : 1263 - 1274
  • [46] High-Dimensional Distributionally Robust Mean-Variance Efficient Portfolio Selection
    Zhang, Zhonghui
    Jing, Huarui
    Kao, Chihwa
    [J]. MATHEMATICS, 2023, 11 (05)
  • [47] Stochastic Linear Contextual Bandits with Diverse Contexts
    Wu, Weiqiang
    Yang, Jing
    Shen, Cong
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108
  • [48] Design of Experiments for Stochastic Contextual Linear Bandits
    Zanette, Andrea
    Dong, Kefan
    Lee, Jonathan
    Brunskill, Emma
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [49] Discussion of "A Tuning-Free Robust and Efficient Approach to High-Dimensional Regression"
    Li, Xiudi
    Shojaie, Ali
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (532) : 1717 - 1719
  • [50] Comment on "A Tuning-Free Robust and Efficient Approach to High-Dimensional Regression"
    Loh, Po-Ling
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (532) : 1715 - 1716