共 50 条
- [44] Mean Field Approximation of the Policy Iteration Algorithm for Graph-based Markov Decision Processes ECAI 2006, PROCEEDINGS, 2006, 141 : 595 - +
- [46] Adaptive Approximate Policy Iteration 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130 : 523 - 531
- [48] Navigating to the Best Policy in Markov Decision Processes ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [50] Efficient Policy Representation for Markov Decision Processes SMART TECHNOLOGIES IN URBAN ENGINEERING, STUE-2022, 2023, 536 : 151 - 162