共 50 条
- [1] Learning Parameterized Policies for Markov Decision Processes through Demonstrations [J]. 2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 7087 - 7092
- [2] Learning Policies for Markov Decision Processes in Continuous Spaces [J]. 2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 4751 - 4758
- [4] Variance minimization of parameterized Markov decision processes [J]. Discrete Event Dynamic Systems, 2018, 28 : 63 - 81
- [5] Variance minimization of parameterized Markov decision processes [J]. DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2018, 28 (01): : 63 - 81
- [6] Learning deterministic policies in partially observable Markov decision processes [J]. INTELLIGENT AUTONOMOUS SYSTEMS: IAS-5, 1998, : 250 - 257
- [8] Parameterized Penalties in the Dual Representation of Markov Decision Processes [J]. 2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 870 - 876
- [9] Reinforcement Learning of Risk-Constrained Policies in Markov Decision Processes [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9794 - 9801