Efficient Online Bayesian Inference for Neural Bandits

被引：0

作者：

Duran-Martin, Gerardo ^{[1
]}

Kara, Aleyna ^{[2
]}

Murphy, Kevin ^{[3
]}

机构：

[1] Queen Mary Univ, London, England

[2] Bogazici Univ, Bogazici, Turkey

[3] Google Res, Mountain View, CA USA

来源：

INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151 | 2022年 / 151卷

基金：

英国工程与自然科学研究理事会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we present a new algorithm for online (sequential) inference in Bayesian neural networks, and show its suitability for tackling contextual bandit problems. The key idea is to combine the extended Kalman filter (which locally linearizes the likelihood function at each time step) with a (learned or random) low-dimensional affine subspace for the parameters; the use of a subspace enables us to scale our algorithm to models with similar to 1M parameters. While most other neural bandit methods need to store the entire past dataset in order to avoid the problem of "catastrophic forgetting", our approach uses constant memory. This is possible because we represent uncertainty about all the parameters in the model, not just the final linear layer. We show good results on the "Deep Bayesian Bandit Showdown" benchmark, as well as MNIST and a recommender system.

引用

页码：6002 / 6021

页数：20

共 50 条

[1] Probabilistic Bayesian Neural Networks for Efficient Inference
Ishak, Md
Alawad, Mohammed
PROCEEDING OF THE GREAT LAKES SYMPOSIUM ON VLSI 2024, GLSVLSI 2024, 2024, : 724 - 729
[2] Contextual Bandits with Online Neural Regression
Deb, Rohan
Ban, Yikun
Zuo, Shiliang
He, Jingrui
Banerjee, Arindam
arXiv, 2023,
[3] Efficient Sensory Encoding and Bayesian Inference with Heterogeneous Neural Populations
Ganguli, Deep
Simoncelli, Eero P.
NEURAL COMPUTATION, 2014, 26 (10) : 2103 - 2134
[4] Online Multi-Armed Bandits with Adaptive Inference
Dimakopoulou, Maria
Ren, Zhimei
Zhou, Zhengyuan
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[5] Deep Bayesian Bandits: Exploring in Online Personalized Recommendations
Guo, Dalin
Ktena, Sofia Ira
Huszar, Ferenc
Myana, Pranay Kumar
Kneier, Michael
Das, Sourav
Shi, Wenzhe
Tejani, Alykhan
RECSYS 2020: 14TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, 2020, : 456 - 461
[6] Efficient Priors for Scalable Variational Inference in Bayesian Deep Neural Networks
Krishnan, Ranganath
Subedar, Mahesh
Tickoo, Omesh
Labs, Intel
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 773 - 777
[7] Bayesian neural network with efficient priors for online quality prediction
Zhang, Xu
Zou, Yuanyuan
Li, Shaoyuan
DIGITAL CHEMICAL ENGINEERING, 2022, 2
[8] Bayesian inference in neural networks
Paige, RL
Butler, RW
BIOMETRIKA, 2001, 88 (03) : 623 - 641
[9] Bayesian inference in neural networks
Marzban, C
FIRST CONFERENCE ON ARTIFICIAL INTELLIGENCE, 1998, : J25 - J30
[10] Bayesian inference in neural networks
Marzban, C
14TH CONFERENCE ON PROBABILITY AND STATISTICS IN THE ATMOSPHERIC SCIENCES, 1998, : J97 - J102

← 1 2 3 4 5 →