Sparse Linear Contextual Bandits via Relevance Vector Machines

被引：0

作者：

Gilton, Davis ^{[1
]}

Willett, Rebecca ^{[1
]}

机构：

[1] Univ Wisconsin, Elect & Comp Engn, Madison, WI 53706 USA

来源：

2017 INTERNATIONAL CONFERENCE ON SAMPLING THEORY AND APPLICATIONS (SAMPTA) | 2017年

关键词：

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

This paper describes a linear multi-armed bandit algorithm that exploits sparsity in the underlying unknown weight vector controlling rewards. In linear multi-armed bandits, a user chooses a sequence of (slot machine) "arms" to pull, and each arm pull results in the user receiving a stochastic reward with mean equal to the inner product between a known feature vector associated with the arm and an unknown weight vector. While linear bandit algorithms have been widely considered in the literature, relatively little is known about how to exploit sparsity in the weight vector. This paper describes a novel approach that leverages ideas from linear Thompson sampling and relevance vector machines, resulting in a scalable approach that adapts to the unknown sparse support. Theoretical regret bounds highlight the proposed algorithm's performance as a function of the sparsity level, and simulations illustrate the advantages of the proposed method over several competing approaches.

引用

页码：518 / 522

页数：5

共 50 条

[1] Privacy Amplification via Shuffling for Linear Contextual Bandits
Garcelon, Evrard
Chaudhuri, Kamalika
Perchet, Vianney
Pirotta, Matteo
INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167, 2022, 167
[2] Optimal Multitask Linear Regression and Contextual Bandits under Sparse Heterogeneity
Huang, Xinmeng
Xu, Kan
Lee, Donghwan
Hassani, Hamed
Bastani, Hamsa
Dobriban, Edgar
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2025,
[3] Thompson Sampling for High-Dimensional Sparse Linear Contextual Bandits
Chakraborty, Sunrit
Roy, Saptarshi
Tewari, Ambuj
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
[4] Contextual Bandits With Hidden Features to Online Recommendation via Sparse Interactions
Yang, Shangdong
Zhang, Chenyu
Gao, Yang
Wang, Hao
IEEE INTELLIGENT SYSTEMS, 2020, 35 (05) : 62 - 71
[5] Relevance Vector Machines: Sparse Classification Methods for QSAR
Burden, Frank R.
Winkler, David A.
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2015, 55 (08) : 1529 - 1534
[6] Dynamic Batch Learning in High-Dimensional Sparse Linear Contextual Bandits
Ren, Zhimei
Zhou, Zhengyuan
MANAGEMENT SCIENCE, 2024, 70 (02) : 1315 - 1342
[7] Conservative Contextual Linear Bandits
Kazerouni, Abbas
Ghavamzadeh, Mohammad
Abbasi-Yadkori, Yasin
Van Roy, Benjamin
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[8] Balanced Linear Contextual Bandits
Dimakopoulou, Maria
Zhou, Zhengyuan
Athey, Susan
Imbens, Guido
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3445 - 3453
[9] Linear Contextual Bandits with Knapsacks
Agrawal, Shipra
Devanur, Nikhil R.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
[10] Federated Linear Contextual Bandits
Huang, Ruiquan
Wu, Weiqiang
Yang, Jing
Shen, Cong
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34

← 1 2 3 4 5 →