共 50 条
- [2] Gradient-Aware Model-Based Policy Search [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 3801 - 3808
- [3] A Gradient-Aware Line Sampling Algorithm for LiDAR Scanners [J]. IEEE SENSORS JOURNAL, 2020, 20 (16) : 9283 - 9292
- [5] On constrained Markov decision processes [J]. OPERATIONS RESEARCH LETTERS, 1996, 19 (01) : 25 - 28
- [6] Potential based optimization algorithm of constrained Markov decision processes [J]. Proceedings of the 24th Chinese Control Conference, Vols 1 and 2, 2005, : 433 - 436
- [7] A Policy Gradient Approach for Finite Horizon Constrained Markov Decision Processes [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 3353 - 3359
- [9] A Structure-aware Online Learning Algorithm for Markov Decision Processes [J]. PROCEEDINGS OF THE 12TH EAI INTERNATIONAL CONFERENCE ON PERFORMANCE EVALUATION METHODOLOGIES AND TOOLS (VALUETOOLS 2019), 2019, : 71 - 78
- [10] Learning in Constrained Markov Decision Processes [J]. IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (01): : 441 - 453