共 50 条
- [33] Approximate robust policy iteration for discounted infinite-horizon Markov decision processes with uncertain stationary parametric tiransition matrices 2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 2052 - 2057
- [34] Adaptive Sampling for Best Policy Identification in Markov Decision Processes INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [35] Approximate equivalence of Markov decision processes LEARNING THEORY AND KERNEL MACHINES, 2003, 2777 : 581 - 594
- [37] Verification of General Markov Decision Processes by Approximate Similarity Relations and Policy Refinement QUANTITATIVE EVALUATION OF SYSTEMS, QEST 2016, 2016, 9826 : 227 - 243
- [39] Temporal logic control of general Markov decision processes by approximate policy refinement IFAC PAPERSONLINE, 2018, 51 (16): : 73 - 78