共 50 条
- [41] EXISTENCE OF OPTIMAL STATIONARY POLICIES IN AVERAGE REWARD MARKOV DECISION-PROCESSES WITH A RECURRENT STATE APPLIED MATHEMATICS AND OPTIMIZATION, 1992, 26 (02): : 171 - 194
- [42] Computing semi-stationary optimal policies for multichain semi-Markov decision processes Annals of Operations Research, 2020, 287 : 843 - 865
- [45] Learning Policies for Markov Decision Processes in Continuous Spaces 2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 4751 - 4758
- [46] Finding Safe Zones of Markov Decision Processes Policies ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [48] Efficient Policies for Stationary Possibilistic Markov Decision Processes SYMBOLIC AND QUANTITATIVE APPROACHES TO REASONING WITH UNCERTAINTY, ECSQARU 2017, 2017, 10369 : 306 - 317