共 50 条
- [31] Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [36] Reinforcement Learning through Global Stochastic Search in N-MDPs [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2011, 6912 : 326 - 340
- [37] Episodic Reinforcement Learning in Finite MDPs: Minimax Lower Bounds Revisited [J]. ALGORITHMIC LEARNING THEORY, VOL 132, 2021, 132
- [39] Agnostic Reinforcement Learning with Low-Rank MDPs and Rich Observations [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [40] Model-based reinforcement learning in factored-state MDPs [J]. 2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2007, : 103 - 110