共 50 条
- [4] Accelerated modified policy iteration algorithms for Markov decision processes [J]. Mathematical Methods of Operations Research, 2013, 78 : 61 - 76
- [5] The complexity of Policy Iteration is exponential for discounted Markov Decision Processes [J]. 2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 5997 - 6002