共 50 条
- [41] Geometric Policy Iteration for Markov Decision Processes [J]. PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 2070 - 2078
- [42] Policy set iteration for Markov decision processes [J]. AUTOMATICA, 2013, 49 (12) : 3687 - 3689
- [43] AsyncQVI: Asynchronous-Parallel Q-Value Iteration for Discounted Markov Decision Processes with Near-Optimal Sample Complexity [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 713 - 722
- [46] Finite State Approximations of Markov Decision Processes with General State and Action Spaces [J]. 2015 AMERICAN CONTROL CONFERENCE (ACC), 2015, : 3589 - 3594
- [49] An optimistic value iteration for mean-variance optimization in discounted Markov decision processes [J]. RESULTS IN CONTROL AND OPTIMIZATION, 2022, 8
- [50] Kernel Taylor-Based Value Function Approximation for Continuous-State Markov Decision Processes [J]. ROBOTICS: SCIENCE AND SYSTEMS XVI, 2020,