共 50 条
- [41] Counterexample Explanation by Learning Small Strategies in Markov Decision Processes [J]. COMPUTER AIDED VERIFICATION, PT I, 2015, 9206 : 158 - 177
- [42] REINFORCEMENT LEARNING OF NON-MARKOV DECISION-PROCESSES [J]. ARTIFICIAL INTELLIGENCE, 1995, 73 (1-2) : 271 - 306
- [43] Active learning of dynamic Bayesian networks in Markov decision processes [J]. ABSTRACTION, REFORMULATION, AND APPROXIMATION, PROCEEDINGS, 2007, 4612 : 273 - +
- [44] Learning deterministic policies in partially observable Markov decision processes [J]. INTELLIGENT AUTONOMOUS SYSTEMS: IAS-5, 1998, : 250 - 257
- [45] PAC learning for Markov decision processes and dynamic. games [J]. 2004 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, PROCEEDINGS, 2004, : 468 - 468
- [47] Learning Representation and Control in Markov Decision Processes: New Frontiers [J]. FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 1 (04): : 403 - 565
- [48] Online Learning in Markov Decision Processes with Changing Cost Sequences [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 1), 2014, 32
- [49] Learning and Planning in Average-Reward Markov Decision Processes [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139 : 7665 - 7676
- [50] From Perturbation Analysis to Markov Decision Processes and Reinforcement Learning [J]. Discrete Event Dynamic Systems, 2003, 13 : 9 - 39