共 50 条
- [1] The complexity of Policy Iteration is exponential for discounted Markov Decision Processes 2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 5997 - 6002
- [4] Policy Gradient for Rectangular Robust Markov Decision Processes ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [7] Discounted Markov decision processes with fuzzy costs Annals of Operations Research, 2020, 295 : 769 - 786