共 50 条
- [4] Near-optimal Per-Action Regret Bounds for Sleeping Bandits INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
- [5] Near-Optimal Regret Bounds for Multi-batch Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [6] Collaborative Linear Bandits with Adversarial Agents: Near-Optimal Regret Bounds ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [7] Feedback graph regret bounds for Thompson Sampling and UCB ALGORITHMIC LEARNING THEORY, VOL 117, 2020, 117 : 592 - 614
- [8] Society of Agents: Regret Bounds of Concurrent Thompson Sampling ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [9] NUMERICAL EVALUATION OF SAMPLING BOUNDS FOR NEAR-OPTIMAL RECONSTRUCTION IN COMPRESSED SENSING 2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
- [10] Improved Bayesian Regret Bounds for Thompson Sampling in Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,