共 50 条
- [31] Near-Optimal Offline Reinforcement Learning via Double Variance Reduction ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [32] The benefit of receding horizon control: Near-optimal policies for stochastic inventory control OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2020, 97
- [34] Near-Optimal Provable Uniform Convergence in Offine Policy Evaluation for Reinforcement Learning 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
- [35] Autoregressive Policies for Continuous Control Deep Reinforcement Learning PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2754 - 2762