共 50 条
- [1] Nearly Minimax Optimal Reinforcement Learning for Discounted MDPs ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [2] Episodic Reinforcement Learning in Finite MDPs: Minimax Lower Bounds Revisited ALGORITHMIC LEARNING THEORY, VOL 132, 2021, 132
- [3] Nearly Minimax Optimal Regret for Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
- [5] Learning to Branch with Tree MDPs ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [6] Reinforcement learning for MDPs with constraints MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 646 - 653
- [7] Efficient reinforcement learning in factored MDPs IJCAI-99: PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 & 2, 1999, : 740 - 747
- [9] Multitask reinforcement learning on the distribution of MDPs 2003 IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN ROBOTICS AND AUTOMATION, VOLS I-III, PROCEEDINGS, 2003, : 1108 - 1113
- [10] Expedited Learning in MDPs with Side Information 2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 1941 - 1948