共 50 条
- [1] Near-Optimal Sample Complexity Bounds for Constrained MDPs [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [2] Near-Optimal Interdiction of Factored MDPs [J]. CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI2017), 2017,
- [3] Near-optimal Reinforcement Learning in Factored MDPs [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
- [4] NEAR-OPTIMAL BOUNDS FOR PHASE SYNCHRONIZATION [J]. SIAM JOURNAL ON OPTIMIZATION, 2018, 28 (02) : 989 - 1016
- [5] Near Instance-Optimal PAC Reinforcement Learning for Deterministic MDPs [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [6] Near-optimal Regret Bounds for Reinforcement Learning [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2010, 11 : 1563 - 1600
- [9] Near-optimal regret bounds for reinforcement learning [J]. Journal of Machine Learning Research, 2010, 11 : 1563 - 1600