Efficient and Robust High-Dimensional Linear Contextual Bandits

被引:0
|
作者
Chen, Cheng [1 ]
Luo, Luo [2 ]
Zhang, Weinan [1 ]
Yu, Yong [1 ]
Lian, Yijiang [3 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[3] Baidu, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The linear contextual bandits is a sequential decision-making problem where an agent decides among sequential actions given their corresponding contexts. Since large-scale data sets become more and more common, we study the linear contextual bandits in high-dimensional situations. Recent works focus on employing matrix sketching methods to accelerating contextual bandits. However, the matrix approximation error will bring additional terms to the regret bound. In this paper we first propose a novel matrix sketching method which is called Spectral Compensation Frequent Directions (SCFD). Then we propose an efficient approach for contextual bandits by adopting SCFD to approximate the covariance matrices. By maintaining and manipulating sketched matrices, our method only needs O(md) space and O (md) update time in each round, where d is the dimensionality of the data and m is the sketching size. Theoretical analysis reveals that our method has better regret bounds than previous methods in high-dimensional cases. Experimental results demonstrate the effectiveness of our algorithm and verify our theoretical guarantees.
引用
收藏
页码:4259 / 4265
页数:7
相关论文
共 50 条
  • [1] Thompson Sampling for High-Dimensional Sparse Linear Contextual Bandits
    Chakraborty, Sunrit
    Roy, Saptarshi
    Tewari, Ambuj
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [2] Dynamic Batch Learning in High-Dimensional Sparse Linear Contextual Bandits
    Ren, Zhimei
    Zhou, Zhengyuan
    [J]. MANAGEMENT SCIENCE, 2024, 70 (02) : 1315 - 1342
  • [3] High-Dimensional Sparse Linear Bandits
    Hao, Botao
    Lattimore, Tor
    Wang, Mengdi
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [4] Robust Methods for High-Dimensional Linear Learning
    Merad, Ibrahim
    Gaiffas, Stephane
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [5] Efficient Sparse Linear Bandits under High Dimensional Data
    Wang, Xue
    Wei, Mike Mingcheng
    Yao, Tao
    [J]. PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 2431 - 2443
  • [6] Robust Stochastic Linear Contextual Bandits Under Adversarial Attacks
    Ding, Qin
    Hsieh, Cho-Jui
    Sharpnack, James
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [7] High-dimensional robust inference for censored linear models
    Jiayu Huang
    Yuanshan Wu
    [J]. Science China Mathematics, 2024, 67 : 891 - 918
  • [8] Robust Estimation of High-Dimensional Linear Regression With Changepoints
    Cui, Xiaolong
    Geng, Haoyu
    Wang, Zhaojun
    Zou, Changliang
    [J]. IEEE Transactions on Information Theory, 2024, 70 (10) : 7297 - 7319
  • [9] Robust linear regression for high-dimensional data: An overview
    Filzmoser, Peter
    Nordhausen, Klaus
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2021, 13 (04)
  • [10] High-dimensional robust inference for censored linear models
    Huang, Jiayu
    Wu, Yuanshan
    [J]. SCIENCE CHINA-MATHEMATICS, 2024, 67 (04) : 891 - 918