Efficient Online Bayesian Inference for Neural Bandits

被引:0
|
作者
Duran-Martin, Gerardo [1 ]
Kara, Aleyna [2 ]
Murphy, Kevin [3 ]
机构
[1] Queen Mary Univ, London, England
[2] Bogazici Univ, Bogazici, Turkey
[3] Google Res, Mountain View, CA USA
基金
英国工程与自然科学研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a new algorithm for online (sequential) inference in Bayesian neural networks, and show its suitability for tackling contextual bandit problems. The key idea is to combine the extended Kalman filter (which locally linearizes the likelihood function at each time step) with a (learned or random) low-dimensional affine subspace for the parameters; the use of a subspace enables us to scale our algorithm to models with similar to 1M parameters. While most other neural bandit methods need to store the entire past dataset in order to avoid the problem of "catastrophic forgetting", our approach uses constant memory. This is possible because we represent uncertainty about all the parameters in the model, not just the final linear layer. We show good results on the "Deep Bayesian Bandit Showdown" benchmark, as well as MNIST and a recommender system.
引用
收藏
页码:6002 / 6021
页数:20
相关论文
共 50 条
  • [21] Bayesian inference - The future of online fraud protection
    Excell, D., 1600, Elsevier Ltd (2012):
  • [22] Online reinforcement learning control by Bayesian inference
    Xia, Zhongpu
    Zhao, Dongbin
    IET CONTROL THEORY AND APPLICATIONS, 2016, 10 (12): : 1331 - 1338
  • [23] Inference for Batched Bandits
    Zhang, Kelly W.
    Janson, Lucas
    Murphy, Susan A.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [24] Efficient Bayesian inference with latent Hamiltonian neural networks in No-U-Turn Sampling
    Dhulipala, Somayajulu L. N.
    Che, Yifeng
    Shields, Michael D.
    JOURNAL OF COMPUTATIONAL PHYSICS, 2023, 492
  • [25] Sample-efficient neural likelihood-free Bayesian inference of implicit HMMs
    Ghosh, Sanmitra
    Birre, Paul J.
    De Angelis, Daniela
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [26] Sampling-Tree Model: Efficient Implementation of Distributed Bayesian Inference in Neural Networks
    Yu, Zhaofei
    Chen, Feng
    Liu, Jian K.
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2020, 12 (03) : 497 - 510
  • [27] Hierarchical Bayesian Bandits
    Hong, Joey
    Kveton, Branislav
    Zaheer, Manzil
    Ghavamzadeh, Mohammad
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [28] BAYESIAN NONPARAMETRIC BANDITS
    CLAYTON, MK
    BERRY, DA
    ANNALS OF STATISTICS, 1985, 13 (04): : 1523 - 1534
  • [29] Efficient Lifting for Online Probabilistic Inference
    Nath, Aniruddh
    Domingos, Pedro
    PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 1193 - 1198
  • [30] Bayesian inference with finitely wide neural networks
    Lu, Chi-Ken
    PHYSICAL REVIEW E, 2023, 108 (01)