共 50 条
- [31] Optimism in Face of a Context: Regret Guarantees for Stochastic Contextual MDP THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 7, 2023, : 8510 - 8517
- [32] Multi-task Representation Learning with Stochastic Linear Bandits INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
- [33] Non-Stationary Representation Learning in Sequential Linear Bandits IEEE OPEN JOURNAL OF CONTROL SYSTEMS, 2022, 1 : 41 - 56
- [34] Online Linear Quadratic Tracking With Regret Guarantees IEEE CONTROL SYSTEMS LETTERS, 2023, 7 : 3950 - 3955
- [35] Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits CONFERENCE ON LEARNING THEORY, VOL 178, 2022, 178
- [36] Hybrid Regret Bounds for Combinatorial Semi-Bandits and Adversarial Linear Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [37] Optimal Regret Bounds for Collaborative Learning in Bandits INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 237, 2024, 237
- [38] Improved Regret Bounds for Oracle-Based Adversarial Contextual Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
- [39] Constant or Logarithmic Regret in Asynchronous Multiplayer Bandits with Limited Communication INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238