共 50 条
- [1] Autonomous Exploration for Navigating in MDPs Using Blackbox RL Algorithms [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 3714 - 3722
- [2] Near-Optimal Sample Complexity Bounds for Constrained MDPs [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [3] Towards Tight Bounds on the Sample Complexity of Average-reward MDPs [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [4] Learning with Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7667 - 7674
- [5] Layered State Discovery for Incremental Autonomous Exploration [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
- [6] Policy Optimization in Adversarial MDPs: Improved Exploration via Dilated Bonuses [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [7] On the Sample Complexity of Learning Infinite-horizon Discounted Linear Kernel MDPs [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [10] Improved bounds on the sample complexity of learning [J]. PROCEEDINGS OF THE ELEVENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2000, : 309 - 318