共 50 条
- [41] Learning Representation and Control in Markov Decision Processes: New Frontiers [J]. FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 1 (04): : 403 - 565
- [43] PAC learning for Markov decision processes and dynamic. games [J]. 2004 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, PROCEEDINGS, 2004, : 468 - 468
- [46] Reinforcement Learning for Cost-Aware Markov Decision Processes [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [47] From perturbation analysis to Markov decision processes and reinforcement learning [J]. DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2003, 13 (1-2): : 9 - 39
- [48] Learning Parameterized Policies for Markov Decision Processes through Demonstrations [J]. 2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 7087 - 7092
- [49] Toward Implicit Learning for the Compositional Verification of Markov Decision Processes [J]. VERIFICATION AND EVALUATION OF COMPUTER AND COMMUNICATION SYSTEMS, 2018, 11181 : 200 - 217
- [50] Learning factored representations for partially observable Markov decision processes [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 1050 - 1056