共 50 条
- [22] Delayed Nondeterminism in Continuous-Time Markov Decision Processes [J]. FOUNDATIONS OF SOFTWARE SCIENCE AND COMPUTATIONAL STRUCTURES, PROCEEDINGS, 2009, 5504 : 364 - +
- [23] Framework for solving time-delayed Markov Decision Processes [J]. PHYSICAL REVIEW RESEARCH, 2023, 5 (03):
- [24] Online EXP3 Learning in Adversarial Bandits with Delayed Feedback [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [25] Online Learning of Safety function for Markov Decision Processes [J]. 2023 EUROPEAN CONTROL CONFERENCE, ECC, 2023,
- [26] Learning Policies for Markov Decision Processes in Continuous Spaces [J]. 2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 4751 - 4758
- [27] Active Learning of Markov Decision Processes for System Verification [J]. 2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 2, 2012, : 289 - 294
- [28] Active learning in partially observable Markov decision processes [J]. MACHINE LEARNING: ECML 2005, PROCEEDINGS, 2005, 3720 : 601 - 608