共 50 条
- [1] A Provably Efficient Sample Collection Strategy for Reinforcement Learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [3] Provably Good Batch Reinforcement Learning Without Great Exploration [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [4] Provably Feedback-Efficient Reinforcement Learning via Active Reward Learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [5] Distributional Reinforcement Learning for Efficient Exploration [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
- [6] Gap-Dependent Unsupervised Exploration for Reinforcement Learning [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
- [7] Provably Efficient Causal Reinforcement Learning with Confounded Observational Data [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [8] Provably Efficient Reinforcement Learning for Discounted MDPs with Feature Mapping [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [9] Provably Efficient Reinforcement Learning in Partially Observable Dynamical Systems [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [10] Provably Efficient Offline Reinforcement Learning in Regular Decision Processes [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,