共 50 条
- [21] Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [22] An Online Kernel Selection Wrapper via Multi-Armed Bandit Model 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1307 - 1312
- [23] Regret Analysis for RL using Renewal Bandit Feedback 2022 IEEE INFORMATION THEORY WORKSHOP (ITW), 2022, : 137 - 142
- [26] Worst-case regret analysis of computationally budgeted online kernel selection Machine Learning, 2022, 111 : 937 - 976
- [28] Regret Bounds for Expected Improvement Algorithms in Gaussian Process Bandit Optimization INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
- [29] Online Multiclass Boosting with Bandit Feedback 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89