Hierarchical Bayesian Bandits

被引:0
|
作者
Hong, Joey [1 ,4 ]
Kveton, Branislav [2 ,4 ]
Zaheer, Manzil [3 ]
Ghavamzadeh, Mohammad [4 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Amazon, Seattle, WA USA
[3] Google DeepMind, Mountain View, CA 94043 USA
[4] Google Res, Mountain View, CA 94043 USA
关键词
ALLOCATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Meta-, multi-task, and federated learning can be all viewed as solving similar tasks, drawn from a distribution that reflects task similarities. We provide a unified view of all these problems, as learning to act in a hierarchical Bayesian bandit. We propose and analyze a natural hierarchical Thompson sampling algorithm (HierTS) for this class of problems. Our regret bounds hold for many variants of the problems, including when the tasks are solved sequentially or in parallel; and show that the regret decreases with a more informative prior. Our proofs rely on a novel total variance decomposition that can be applied beyond our models. Our theory is complemented by experiments, which show that the hierarchy helps with knowledge sharing among the tasks. This confirms that hierarchical Bayesian bandits are a universal and statistically-efficient tool for learning to act with similar bandit tasks.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Effects of Model Misspecification on Bayesian Bandits Case Studies in UX Optimization
    Sweeney, Mack
    van Adelsberg, Matthew
    Laskey, Kathryn
    Domeniconi, Carlotta
    20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, : 1286 - 1291
  • [42] Distributed Cooperative Decision-Making in Multiarmed Bandits: Frequentist and Bayesian Algorithms
    Landgren, Peter
    Srivastava, Vaibhav
    Leonard, Naomi Ehrich
    2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 167 - 172
  • [43] Multilevel Constrained Bandits: A Hierarchical Upper Confidence Bound Approach with Safety Guarantees
    Baheri, Ali
    MATHEMATICS, 2025, 13 (01)
  • [44] Hierarchical Bayesian modeling of intertemporal choice
    Chavez, Melisa E.
    Villalobos, Elena
    Baroja, Jose L.
    Bouzas, Arturo
    JUDGMENT AND DECISION MAKING, 2017, 12 (01): : 19 - 28
  • [45] Agglomerative and divisive hierarchical Bayesian clustering
    Burghardt, Elliot
    Sewell, Daniel
    Cavanaugh, Joseph
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2022, 176
  • [46] Hierarchical Bayesian estimation for the number of species
    Rodrigues, J
    Milan, LA
    Leite, JG
    BIOMETRICAL JOURNAL, 2001, 43 (06) : 737 - 746
  • [47] Bayesian hierarchical analysis of minefield data
    Cressie, N
    Lawson, AB
    DETECTION AND REMEDIATION TECHNOLOGIES FOR MINES AND MINELIKE TARGETS III, PTS 1 AND 2, 1998, 3392 : 930 - 940
  • [48] Bayesian hierarchical modelling of rainfall extremes
    Lehmann, E. A.
    Phatak, A.
    Soltyk, S.
    Chia, J.
    Lau, R.
    Palmer, M.
    20TH INTERNATIONAL CONGRESS ON MODELLING AND SIMULATION (MODSIM2013), 2013, : 2806 - 2812
  • [49] Bayesian Analysis of Hierarchical Multifidelity Codes
    Le Gratiet, Loic
    SIAM-ASA JOURNAL ON UNCERTAINTY QUANTIFICATION, 2013, 1 (01): : 244 - 269
  • [50] SPARSE BAYESIAN HIERARCHICAL MIXTURE OF EXPERTS
    Mossavat, Iman
    Amft, Oliver
    2011 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP (SSP), 2011, : 653 - 656