共 50 条
- [1] Near-optimal Regret Bounds for Reinforcement Learning [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2010, 11 : 1563 - 1600
- [2] Near-Optimal Regret Bounds for Multi-batch Reinforcement Learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [5] Near-Optimal No-Regret Learning in General Games [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [6] Kernelized Reinforcement Learning with Order Optimal Regret Bounds [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [7] Collaborative Linear Bandits with Adversarial Agents: Near-Optimal Regret Bounds [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [8] Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [9] Near-Optimal Reinforcement Learning in Polynomial Time [J]. Machine Learning, 2002, 49 : 209 - 232
- [10] Near-optimal reinforcement learning in polynomial time [J]. MACHINE LEARNING, 2002, 49 (2-3) : 209 - 232