共 23 条
- [1] CLUSTERING-GUIDED GP-UCB FOR BAYESIAN OPTIMIZATION 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2461 - 2465
- [2] Improving GP-UCB Algorithm by Harnessing Decomposed Feedback MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT I, 2020, 1167 : 555 - 569
- [4] Logarithmic Regret from Sublinear Hints ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [5] Nonstationary Stochastic Bandits: UCB Policies and Minimax Regret IEEE OPEN JOURNAL OF CONTROL SYSTEMS, 2024, 3 : 128 - 142
- [6] Feedback graph regret bounds for Thompson Sampling and UCB ALGORITHMIC LEARNING THEORY, VOL 117, 2020, 117 : 592 - 614
- [7] Constrained Online Learning in Networks with Sublinear Regret and Fit 2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 5486 - 5493
- [8] Algorithms with Logarithmic or Sublinear Regret for Constrained Contextual Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
- [10] Safe Policy Search for Lifelong Reinforcement Learning with Sublinear Regret INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 2361 - 2369