共 50 条
- [24] A pause control approach to the value iteration scheme in average Markov decision processes [J]. Systems and Control Letters, 1998, 33 (04): : 209 - 219
- [28] A method for speeding up value iteration in partially observable Markov decision processes [J]. UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1999, : 696 - 703
- [29] Variance Reduced Value Iteration and Faster Algorithms for Solving Markov Decision Processes [J]. SODA'18: PROCEEDINGS OF THE TWENTY-NINTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2018, : 770 - 787