共 50 条
- [2] Robust Anytime Learning of Markov Decision Processes [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [7] Policy iteration for robust nonstationary Markov decision processes [J]. Optimization Letters, 2016, 10 : 1613 - 1628
- [8] Policy Gradient for Rectangular Robust Markov Decision Processes [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [9] Robust Average-Reward Markov Decision Processes [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12, 2023, : 15215 - 15223
- [10] Policy iteration for robust nonstationary Markov decision processes [J]. OPTIMIZATION LETTERS, 2016, 10 (08) : 1613 - 1628