共 20 条
- [2] Safe Q-Learning Method Based on Constrained Markov Decision Processes IEEE ACCESS, 2019, 7 : 165007 - 165017
- [3] A Novel Q-learning Algorithm with Function Approximation for Constrained Markov Decision Processes 2012 50TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2012, : 400 - 405
- [6] Non-randomized policies for constrained Markov decision processes Mathematical Methods of Operations Research, 2007, 66 : 165 - 179
- [7] Risk-aware Q-Learning for Markov Decision Processes 2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
- [8] On Q-learning Convergence for Non-Markov Decision Processes PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2546 - 2552