Hierarchical Bayesian Bandits

被引：0

作者：

Hong, Joey ^{[1
,4
]}

Kveton, Branislav ^{[2
,4
]}

Zaheer, Manzil ^{[3
]}

Ghavamzadeh, Mohammad ^{[4
]}

机构：

[1] Univ Calif Berkeley, Berkeley, CA 94720 USA

[2] Amazon, Seattle, WA USA

[3] Google DeepMind, Mountain View, CA 94043 USA

[4] Google Res, Mountain View, CA 94043 USA

来源：

INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151 | 2022年 / 151卷

关键词：

ALLOCATION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Meta-, multi-task, and federated learning can be all viewed as solving similar tasks, drawn from a distribution that reflects task similarities. We provide a unified view of all these problems, as learning to act in a hierarchical Bayesian bandit. We propose and analyze a natural hierarchical Thompson sampling algorithm (HierTS) for this class of problems. Our regret bounds hold for many variants of the problems, including when the tasks are solved sequentially or in parallel; and show that the regret decreases with a more informative prior. Our proofs rely on a novel total variance decomposition that can be applied beyond our models. Our theory is complemented by experiments, which show that the hierarchy helps with knowledge sharing among the tasks. This confirms that hierarchical Bayesian bandits are a universal and statistically-efficient tool for learning to act with similar bandit tasks.

引用

页数：18

共 50 条

[31] Bayesian hierarchical classes analysis
Leenen, Iwin
Van Mechelen, Iven
Gelman, Andrew
De Knop, Stijn
PSYCHOMETRIKA, 2008, 73 (01) : 39 - 64
[32] BayesOpt: A Bayesian optimization library for nonlinear optimization, experimental design and bandits
Martinez-Cantin, Ruben
Journal of Machine Learning Research, 2015, 15 : 3735 - 3739
[33] Bayesian hierarchical dictionary learning
Waniorek, N.
Calvetti, D.
Somersalo, E.
INVERSE PROBLEMS, 2023, 39 (02)
[34] PAC-Bayesian lifelong learning for multi-armed bandits
Hamish Flynn
David Reeb
Melih Kandemir
Jan Peters
Data Mining and Knowledge Discovery, 2022, 36 : 841 - 876
[35] Contextual Linear Bandits under Noisy Features: Towards Bayesian Oracles
Kim, Jung-hun
Yun, Se-Young
Jeong, Minchan
Nam, Junhyun
Shin, Jinwoo
Combes, Richard
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
[36] PAC-Bayesian lifelong learning for multi-armed bandits
Flynn, Hamish
Reeb, David
Kandemir, Melih
Peters, Jan
DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 36 (02) : 841 - 876
[37] Hierarchical Approximate Bayesian Computation
Turner, Brandon M.
Van Zandt, Trisha
PSYCHOMETRIKA, 2014, 79 (02) : 185 - 209
[38] Bayesian nonparametric hierarchical modeling
Dunson, David B.
BIOMETRICAL JOURNAL, 2009, 51 (02) : 273 - 284
[39] Bayesian Hierarchical Classes Analysis
Iwin Leenen
Iven Van Mechelen
Andrew Gelman
Stijn De Knop
Psychometrika, 2008, 73 : 39 - 64
[40] Bayesian Analysis of Hierarchical Effects
Chandukala, Sandeep R.
Dotson, Jeffrey P.
Brazell, Jeff D.
Allenby, Greg M.
MARKETING SCIENCE, 2011, 30 (01) : 123 - 133

← 1 2 3 4 5 →