Sparse Linear Contextual Bandits via Relevance Vector Machines

被引:0
|
作者
Gilton, Davis [1 ]
Willett, Rebecca [1 ]
机构
[1] Univ Wisconsin, Elect & Comp Engn, Madison, WI 53706 USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper describes a linear multi-armed bandit algorithm that exploits sparsity in the underlying unknown weight vector controlling rewards. In linear multi-armed bandits, a user chooses a sequence of (slot machine) "arms" to pull, and each arm pull results in the user receiving a stochastic reward with mean equal to the inner product between a known feature vector associated with the arm and an unknown weight vector. While linear bandit algorithms have been widely considered in the literature, relatively little is known about how to exploit sparsity in the weight vector. This paper describes a novel approach that leverages ideas from linear Thompson sampling and relevance vector machines, resulting in a scalable approach that adapts to the unknown sparse support. Theoretical regret bounds highlight the proposed algorithm's performance as a function of the sparsity level, and simulations illustrate the advantages of the proposed method over several competing approaches.
引用
收藏
页码:518 / 522
页数:5
相关论文
共 50 条
  • [1] Privacy Amplification via Shuffling for Linear Contextual Bandits
    Garcelon, Evrard
    Chaudhuri, Kamalika
    Perchet, Vianney
    Pirotta, Matteo
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167, 2022, 167
  • [2] Optimal Multitask Linear Regression and Contextual Bandits under Sparse Heterogeneity
    Huang, Xinmeng
    Xu, Kan
    Lee, Donghwan
    Hassani, Hamed
    Bastani, Hamsa
    Dobriban, Edgar
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2025,
  • [3] Thompson Sampling for High-Dimensional Sparse Linear Contextual Bandits
    Chakraborty, Sunrit
    Roy, Saptarshi
    Tewari, Ambuj
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [4] Contextual Bandits With Hidden Features to Online Recommendation via Sparse Interactions
    Yang, Shangdong
    Zhang, Chenyu
    Gao, Yang
    Wang, Hao
    IEEE INTELLIGENT SYSTEMS, 2020, 35 (05) : 62 - 71
  • [5] Relevance Vector Machines: Sparse Classification Methods for QSAR
    Burden, Frank R.
    Winkler, David A.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2015, 55 (08) : 1529 - 1534
  • [6] Dynamic Batch Learning in High-Dimensional Sparse Linear Contextual Bandits
    Ren, Zhimei
    Zhou, Zhengyuan
    MANAGEMENT SCIENCE, 2024, 70 (02) : 1315 - 1342
  • [7] Conservative Contextual Linear Bandits
    Kazerouni, Abbas
    Ghavamzadeh, Mohammad
    Abbasi-Yadkori, Yasin
    Van Roy, Benjamin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [8] Balanced Linear Contextual Bandits
    Dimakopoulou, Maria
    Zhou, Zhengyuan
    Athey, Susan
    Imbens, Guido
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3445 - 3453
  • [9] Linear Contextual Bandits with Knapsacks
    Agrawal, Shipra
    Devanur, Nikhil R.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [10] Federated Linear Contextual Bandits
    Huang, Ruiquan
    Wu, Weiqiang
    Yang, Jing
    Shen, Cong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34