共 50 条
- [42] A reinforcement learning based algorithm for Markov decision processes [J]. 2005 International Conference on Intelligent Sensing and Information Processing, Proceedings, 2005, : 199 - 204
- [43] Learning and Planning with Timing Information in Markov Decision Processes [J]. UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2015, : 111 - 120
- [44] Learning Adversarial Markov Decision Processes with Delayed Feedback [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7281 - 7289
- [47] Combining Learning Algorithms: An Approach to Markov Decision Processes [J]. ENTERPRISE INFORMATION SYSTEMS, ICEIS 2012, 2013, 141 : 172 - 188
- [48] A sensitivity view of Markov decision processes and reinforcement learning [J]. MODELING, CONTROL AND OPTIMIZATION OF COMPLEX SYSTEMS: IN HONOR OF PROFESSOR YU-CHI HO, 2003, 14 : 261 - 283
- [49] Online Learning in Markov Decision Processes with Continuous Actions [J]. ALGORITHMIC LEARNING THEORY, ALT 2015, 2015, 9355 : 302 - 316
- [50] THE COMPOSITIONAL CONSTRUCTION OF MARKOV PROCESSES II [J]. RAIRO-THEORETICAL INFORMATICS AND APPLICATIONS, 2011, 45 (01): : 117 - 142