共 50 条
- [1] Minimax Regret for Stochastic Shortest Path [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [2] Regret Bounds for Stochastic Shortest Path Problems with Linear Function Approximation [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [3] Improved No-Regret Algorithms for Stochastic Shortest Path with Linear MDP [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [4] Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
- [5] Learning Stochastic Shortest Path with Linear Function Approximation [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [6] Nearly Minimax Optimal Reinforcement Learning with Linear Function Approximation [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [7] Nearly Minimax Optimal Regret for Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
- [9] Nearly Minimax-Optimal Regret for Linearly Parameterized Bandits [J]. CONFERENCE ON LEARNING THEORY, VOL 99, 2019, 99
- [10] Nearly Optimal Regret for Stochastic Linear Bandits with Heavy-Tailed Payoffs [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2936 - 2942