共 50 条
- [2] Non-randomized policies for constrained Markov decision processes [J]. Mathematical Methods of Operations Research, 2007, 66 : 165 - 179
- [3] Optimal policies for constrained average-cost Markov decision processes [J]. TOP, 2011, 19 : 107 - 120
- [4] Optimal policies for constrained average-cost Markov decision processes [J]. TOP, 2011, 19 (01) : 107 - 120
- [5] Reinforcement Learning of Risk-Constrained Policies in Markov Decision Processes [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9794 - 9801
- [6] On constrained Markov decision processes [J]. OPERATIONS RESEARCH LETTERS, 1996, 19 (01) : 25 - 28
- [7] Learning in Constrained Markov Decision Processes [J]. IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (01): : 441 - 453
- [9] Optimal Decision Tree Policies for Markov Decision Processes [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 5457 - 5465
- [10] Dynamic programming in constrained Markov decision processes [J]. CONTROL AND CYBERNETICS, 2006, 35 (03): : 645 - 660