Hierarchical Bayesian Bandits

被引:0
|
作者
Hong, Joey [1 ,4 ]
Kveton, Branislav [2 ,4 ]
Zaheer, Manzil [3 ]
Ghavamzadeh, Mohammad [4 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Amazon, Seattle, WA USA
[3] Google DeepMind, Mountain View, CA 94043 USA
[4] Google Res, Mountain View, CA 94043 USA
关键词
ALLOCATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Meta-, multi-task, and federated learning can be all viewed as solving similar tasks, drawn from a distribution that reflects task similarities. We provide a unified view of all these problems, as learning to act in a hierarchical Bayesian bandit. We propose and analyze a natural hierarchical Thompson sampling algorithm (HierTS) for this class of problems. Our regret bounds hold for many variants of the problems, including when the tasks are solved sequentially or in parallel; and show that the regret decreases with a more informative prior. Our proofs rely on a novel total variance decomposition that can be applied beyond our models. Our theory is complemented by experiments, which show that the hierarchy helps with knowledge sharing among the tasks. This confirms that hierarchical Bayesian bandits are a universal and statistically-efficient tool for learning to act with similar bandit tasks.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Bayesian hierarchical classes analysis
    Leenen, Iwin
    Van Mechelen, Iven
    Gelman, Andrew
    De Knop, Stijn
    PSYCHOMETRIKA, 2008, 73 (01) : 39 - 64
  • [32] BayesOpt: A Bayesian optimization library for nonlinear optimization, experimental design and bandits
    Martinez-Cantin, Ruben
    Journal of Machine Learning Research, 2015, 15 : 3735 - 3739
  • [33] Bayesian hierarchical dictionary learning
    Waniorek, N.
    Calvetti, D.
    Somersalo, E.
    INVERSE PROBLEMS, 2023, 39 (02)
  • [34] PAC-Bayesian lifelong learning for multi-armed bandits
    Hamish Flynn
    David Reeb
    Melih Kandemir
    Jan Peters
    Data Mining and Knowledge Discovery, 2022, 36 : 841 - 876
  • [35] Contextual Linear Bandits under Noisy Features: Towards Bayesian Oracles
    Kim, Jung-hun
    Yun, Se-Young
    Jeong, Minchan
    Nam, Junhyun
    Shin, Jinwoo
    Combes, Richard
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
  • [36] PAC-Bayesian lifelong learning for multi-armed bandits
    Flynn, Hamish
    Reeb, David
    Kandemir, Melih
    Peters, Jan
    DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 36 (02) : 841 - 876
  • [37] Hierarchical Approximate Bayesian Computation
    Turner, Brandon M.
    Van Zandt, Trisha
    PSYCHOMETRIKA, 2014, 79 (02) : 185 - 209
  • [38] Bayesian nonparametric hierarchical modeling
    Dunson, David B.
    BIOMETRICAL JOURNAL, 2009, 51 (02) : 273 - 284
  • [39] Bayesian Hierarchical Classes Analysis
    Iwin Leenen
    Iven Van Mechelen
    Andrew Gelman
    Stijn De Knop
    Psychometrika, 2008, 73 : 39 - 64
  • [40] Bayesian Analysis of Hierarchical Effects
    Chandukala, Sandeep R.
    Dotson, Jeffrey P.
    Brazell, Jeff D.
    Allenby, Greg M.
    MARKETING SCIENCE, 2011, 30 (01) : 123 - 133