共 50 条
- [42] A Rollout Algorithm for Multichain Markov Decision Processes with Average Cost [J]. POSITIVE SYSTEMS, PROCEEDINGS, 2009, 389 : 151 - 162
- [44] Constrained continuous-time Markov decision processes with average criteria [J]. Mathematical Methods of Operations Research, 2008, 67 : 323 - 340
- [48] Constrained Markov Decision Processes with Total Expected Cost Criteria [J]. PROCEEDINGS OF THE 12TH EAI INTERNATIONAL CONFERENCE ON PERFORMANCE EVALUATION METHODOLOGIES AND TOOLS (VALUETOOLS 2019), 2019, : 191 - 192
- [49] Conditions for the uniqueness of optimal policies of discounted Markov decision processes [J]. Mathematical Methods of Operations Research, 2004, 60 : 415 - 436
- [50] Reinforcement Learning of Risk-Constrained Policies in Markov Decision Processes [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9794 - 9801