共 50 条
- [4] Online Learning in Kernelized Markov Decision Processes 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
- [5] Risk-aware Q-Learning for Markov Decision Processes 2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
- [6] On Q-learning Convergence for Non-Markov Decision Processes PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2546 - 2552
- [7] Safe Q-Learning Method Based on Constrained Markov Decision Processes IEEE ACCESS, 2019, 7 : 165007 - 165017
- [8] An Aggregation Procedure for Large-Scale Markov Decision Processes PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON MATHEMATICAL METHODS IN ECONOMICS 2004, 2004, : 9 - 15
- [10] A Novel Q-learning Algorithm with Function Approximation for Constrained Markov Decision Processes 2012 50TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2012, : 400 - 405