共 50 条
- [2] On constrained Markov decision processes [J]. OPERATIONS RESEARCH LETTERS, 1996, 19 (01) : 25 - 28
- [3] Regret bounds for sleeping experts and bandits [J]. MACHINE LEARNING, 2010, 80 (2-3) : 245 - 272
- [4] A Dual Approach to Constrained Markov Decision Processes with Entropy Regularization [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
- [5] Learning in Constrained Markov Decision Processes [J]. IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (01): : 441 - 453
- [6] A Policy Gradient Approach for Finite Horizon Constrained Markov Decision Processes [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 3353 - 3359
- [7] Dynamic programming in constrained Markov decision processes [J]. CONTROL AND CYBERNETICS, 2006, 35 (03): : 645 - 660
- [9] Markov decision processes with constrained stopping times [J]. PROCEEDINGS OF THE 39TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2000, : 706 - 710
- [10] Reinforcement Learning for Constrained Markov Decision Processes [J]. 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130