Sparse Linear Contextual Bandits via Relevance Vector Machines

被引:0
|
作者
Gilton, Davis [1 ]
Willett, Rebecca [1 ]
机构
[1] Univ Wisconsin, Elect & Comp Engn, Madison, WI 53706 USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper describes a linear multi-armed bandit algorithm that exploits sparsity in the underlying unknown weight vector controlling rewards. In linear multi-armed bandits, a user chooses a sequence of (slot machine) "arms" to pull, and each arm pull results in the user receiving a stochastic reward with mean equal to the inner product between a known feature vector associated with the arm and an unknown weight vector. While linear bandit algorithms have been widely considered in the literature, relatively little is known about how to exploit sparsity in the weight vector. This paper describes a novel approach that leverages ideas from linear Thompson sampling and relevance vector machines, resulting in a scalable approach that adapts to the unknown sparse support. Theoretical regret bounds highlight the proposed algorithm's performance as a function of the sparsity level, and simulations illustrate the advantages of the proposed method over several competing approaches.
引用
收藏
页码:518 / 522
页数:5
相关论文
共 50 条
  • [31] Interconnected Neural Linear Contextual Bandits with UCB Exploration
    Chen, Yang
    Xie, Miao
    Liu, Jiamou
    Zhao, Kaiqi
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT I, 2022, 13280 : 169 - 181
  • [32] Learning in Generalized Linear Contextual Bandits with Stochastic Delays
    Zhou, Zhengyuan
    Xu, Renyuan
    Blanchet, Jose
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [33] The Contextual Lasso: Sparse Linear Models via Deep Neural Networks
    Thompson, Ryan
    Dezfouli, Amir
    Kohn, Robert
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [34] Provably Optimal Algorithms for Generalized Linear Contextual Bandits
    Li, Lihong
    Lu, Yu
    Zhou, Dengyong
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [35] Sparse Least Squares Support Vector Machines via Genetic Algorithms
    Silva, Juliana Peixoto
    da Rocha Neto, Ajalmar R.
    2013 1ST BRICS COUNTRIES CONGRESS ON COMPUTATIONAL INTELLIGENCE AND 11TH BRAZILIAN CONGRESS ON COMPUTATIONAL INTELLIGENCE (BRICS-CCI & CBIC), 2013, : 248 - 253
  • [36] Efficient Batched Algorithm for Contextual Linear Bandits with Large Action Space via Soft Elimination
    Hanna, Osama A.
    Yang, Lin F.
    Fragouli, Christina
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [37] Automatic Ad Format Selection via Contextual Bandits
    Tang, Liang
    Rosales, Romer
    Singh, Ajit
    Agarwal, Deepak
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1587 - 1594
  • [38] Distributed Online Learning via Cooperative Contextual Bandits
    Tekin, Cem
    van der Schaar, Mihaela
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2015, 63 (14) : 3700 - 3714
  • [39] Massive data discrimination via linear support vector machines
    Bradley, PS
    Mangasarian, OL
    OPTIMIZATION METHODS & SOFTWARE, 2000, 13 (01): : 1 - 10
  • [40] Algorithms for Sparse Support Vector Machines
    Landeros, Alfonso
    Lange, Kenneth
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2023, 32 (03) : 1097 - 1108