Hierarchical Bayesian Bandits

被引:0
|
作者
Hong, Joey [1 ,4 ]
Kveton, Branislav [2 ,4 ]
Zaheer, Manzil [3 ]
Ghavamzadeh, Mohammad [4 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Amazon, Seattle, WA USA
[3] Google DeepMind, Mountain View, CA 94043 USA
[4] Google Res, Mountain View, CA 94043 USA
来源
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151 | 2022年 / 151卷
关键词
ALLOCATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Meta-, multi-task, and federated learning can be all viewed as solving similar tasks, drawn from a distribution that reflects task similarities. We provide a unified view of all these problems, as learning to act in a hierarchical Bayesian bandit. We propose and analyze a natural hierarchical Thompson sampling algorithm (HierTS) for this class of problems. Our regret bounds hold for many variants of the problems, including when the tasks are solved sequentially or in parallel; and show that the regret decreases with a more informative prior. Our proofs rely on a novel total variance decomposition that can be applied beyond our models. Our theory is complemented by experiments, which show that the hierarchy helps with knowledge sharing among the tasks. This confirms that hierarchical Bayesian bandits are a universal and statistically-efficient tool for learning to act with similar bandit tasks.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] NONPARAMETRIC BAYESIAN MULTIARMED BANDITS FOR SINGLE-CELL EXPERIMENT DESIGN
    Camerlenghi, Federico
    Dumitrascu, Bianca
    Ferrari, Federico
    Engelhardt, Barbara E.
    Favaro, Stefano
    ANNALS OF APPLIED STATISTICS, 2020, 14 (04): : 2003 - 2019
  • [22] BayesOpt: A Bayesian Optimization Library for Nonlinear Optimization, Experimental Design and Bandits
    Martinez-Cantin, Ruben
    JOURNAL OF MACHINE LEARNING RESEARCH, 2014, 15 : 3735 - 3739
  • [23] Hierarchical Bayesian Reservoir Memory
    Nouri, Ali
    Nikmehr, Hooman
    2009 14TH INTERNATIONAL COMPUTER CONFERENCE, 2009, : 581 - 586
  • [24] Hierarchical Approximate Bayesian Computation
    Brandon M. Turner
    Trisha Van Zandt
    Psychometrika, 2014, 79 : 185 - 209
  • [25] Bayesian hierarchical curve registration
    Telesca, Donatello
    Inoue, Lurdes Y. T.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2008, 103 (481) : 328 - 339
  • [26] Interactive Bayesian Hierarchical Clustering
    Vikram, Sharad
    Dasgupta, Sanjoy
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [27] Bayesian Network Structure Inference with an Hierarchical Bayesian Model
    Werhli, Adriano Velasque
    ADVANCES IN ARTIFICIAL INTELLIGENCE - SBIA 2010, 2010, 6404 : 92 - 101
  • [28] Bayesian Hierarchical Pointing Models
    Zhao, Hang
    Gu, Sophia
    Yu, Chun
    Bi, Xiaojun
    PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON USER INTERFACE SOFTWARE AND TECHNOLOGY, UIST 2022, 2022,
  • [29] Hierarchical Bayesian models of delusion
    Williams, Daniel
    CONSCIOUSNESS AND COGNITION, 2018, 61 : 129 - 147
  • [30] Bayesian hierarchical ordinal regression
    Paquet, U
    Holden, S
    Naish-Guzman, A
    ARTIFICIAL NEURAL NETWORKS: FORMAL MODELS AND THEIR APPLICATIONS - ICANN 2005, PT 2, PROCEEDINGS, 2005, 3697 : 267 - 272