Efficient Online Bayesian Inference for Neural Bandits

被引:0
|
作者
Duran-Martin, Gerardo [1 ]
Kara, Aleyna [2 ]
Murphy, Kevin [3 ]
机构
[1] Queen Mary Univ, London, England
[2] Bogazici Univ, Bogazici, Turkey
[3] Google Res, Mountain View, CA USA
基金
英国工程与自然科学研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a new algorithm for online (sequential) inference in Bayesian neural networks, and show its suitability for tackling contextual bandit problems. The key idea is to combine the extended Kalman filter (which locally linearizes the likelihood function at each time step) with a (learned or random) low-dimensional affine subspace for the parameters; the use of a subspace enables us to scale our algorithm to models with similar to 1M parameters. While most other neural bandit methods need to store the entire past dataset in order to avoid the problem of "catastrophic forgetting", our approach uses constant memory. This is possible because we represent uncertainty about all the parameters in the model, not just the final linear layer. We show good results on the "Deep Bayesian Bandit Showdown" benchmark, as well as MNIST and a recommender system.
引用
收藏
页码:6002 / 6021
页数:20
相关论文
共 50 条
  • [1] Probabilistic Bayesian Neural Networks for Efficient Inference
    Ishak, Md
    Alawad, Mohammed
    PROCEEDING OF THE GREAT LAKES SYMPOSIUM ON VLSI 2024, GLSVLSI 2024, 2024, : 724 - 729
  • [2] Contextual Bandits with Online Neural Regression
    Deb, Rohan
    Ban, Yikun
    Zuo, Shiliang
    He, Jingrui
    Banerjee, Arindam
    arXiv, 2023,
  • [3] Efficient Sensory Encoding and Bayesian Inference with Heterogeneous Neural Populations
    Ganguli, Deep
    Simoncelli, Eero P.
    NEURAL COMPUTATION, 2014, 26 (10) : 2103 - 2134
  • [4] Online Multi-Armed Bandits with Adaptive Inference
    Dimakopoulou, Maria
    Ren, Zhimei
    Zhou, Zhengyuan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [5] Deep Bayesian Bandits: Exploring in Online Personalized Recommendations
    Guo, Dalin
    Ktena, Sofia Ira
    Huszar, Ferenc
    Myana, Pranay Kumar
    Kneier, Michael
    Das, Sourav
    Shi, Wenzhe
    Tejani, Alykhan
    RECSYS 2020: 14TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, 2020, : 456 - 461
  • [6] Efficient Priors for Scalable Variational Inference in Bayesian Deep Neural Networks
    Krishnan, Ranganath
    Subedar, Mahesh
    Tickoo, Omesh
    Labs, Intel
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 773 - 777
  • [7] Bayesian neural network with efficient priors for online quality prediction
    Zhang, Xu
    Zou, Yuanyuan
    Li, Shaoyuan
    DIGITAL CHEMICAL ENGINEERING, 2022, 2
  • [8] Bayesian inference in neural networks
    Paige, RL
    Butler, RW
    BIOMETRIKA, 2001, 88 (03) : 623 - 641
  • [9] Bayesian inference in neural networks
    Marzban, C
    FIRST CONFERENCE ON ARTIFICIAL INTELLIGENCE, 1998, : J25 - J30
  • [10] Bayesian inference in neural networks
    Marzban, C
    14TH CONFERENCE ON PROBABILITY AND STATISTICS IN THE ATMOSPHERIC SCIENCES, 1998, : J97 - J102